Publications

You can also find my articles on my Google Scholar profile.

Conference Papers

CDI: Copyrighted Data Identification in Diffusion Models

Published in Conference on Computer Vision and Pattern Recognition (CVPR), 2025

We show that existing membership inference attacks are ineffective for large diffusion models and we propose CDI, a dataset inference approach that aggregates signals across many samples to reliably detect copyrighted training data with over 99% confidence.

Paper

Maybe I Should Not Answer That, but… Do LLMs Understand The Safety of Their Inputs?

Published in ICLR Workshop on Building Trust in Language Models and Applications, 2025

We investigate whether LLMs implicitly encode safety information, introducing a training-free moderation method that levarages the hidden states of an LLM to detect unsafe inputs.

Paper

Privacy Attacks on Image Autoregressive Models

Published in International Conference on Machine Learning (ICML), 2025

We show that image autoregressive models are empirically less private than diffusion models. We introduce the first membership inference attack tailored to IARs, and execute membership inference, dataset inference, and sample extraction to reveal their vulnerability.

Paper

Learning Graph Representation of Agent Diffuser

Published in International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2025

LGR-AD models the generation process as a distributed system of interacting agents, each representing an expert diffusion model. These agents dynamically adapt to varying conditions and collaborate through a graph neural network that encodes their relationships and performance metrics.

Paper

Efficient Model-Stealing Attacks Against Inductive Graph Neural Networks

Published in European Conference on Artificial Intelligence (ECAI), 2024

We presents efficient model‑stealing attacks tailored to inductive graph neural networks.

Paper

Towards More Realistic Membership Inference Attacks on Large Diffusion Models

Published in Winter Conference on Computer Vision (WACV), 2024

We design a fair evaluation framework for membership inference on Stable Diffusion, apply existing and new attacks, and show prior setups overestimate success while true membership detection remains difficult.

Paper

Bucks for Buckets (B4B): Active Defenses Against Stealing Encoders

Published in Advances in Neural Information Processing Systems (NeurIPS), 2023

B4B is an active defense against encoder model stealing.

Paper

Selectively Increasing the Diversity of GAN-generated Samples

Published in International Conference on Neural Information Processing (ICONIP), 2022

We propose a simple regularizer that selectively increases the diversity of GAN outputs where variety is desired,

Paper

Progressive Latent Replay for Efficient Generative Rehearsal

Published in International Conference on Neural Information Processing (ICONIP), 2022

We reduce the cost of generative rehearsal for continual learning by modulating the frequency of rehearsal based on the depth of the network.

Paper

Jan Dubiński

Publications

Conference Papers