CV
Education
Ph.D. in Computer Science
Warsaw University of Technology • Oct 2021 – Oct 2025 (awaiting defence)
Thesis: “Reliable and Safe Generative Models”M.Sc. in Computer Science
Warsaw University of Technology • Feb 2018 – Jun 2021
Specialization: Information and Decision Systems
Erasmus Exchange – University of Vienna, Mar 2020 – Jun 2020M.Sc. in Power Engineering
Warsaw University of Technology • Feb 2018 – Sep 2020
Erasmus Exchange – Polytechnic University of Catalonia, Sep 2018 – Jan 2019B.Sc. in Quantitative Methods in Economics and Information Systems
Warsaw School of Economics • Oct 2016 – Feb 2020B.Sc. in Power Engineering
Warsaw University of Technology • Oct 2014 – Feb 2018
Work experience
Astra Fellow, Constellation Institute at Berkeley, USA • Jan 2025 – Present
Working with Owain Evans’s TruthfulAI on safety and alignment of Large Language ModelsMARS Fellow, Cambridge AI Safety Hub • Dec 2022 – Jan 2026
Research on safety of Vision-Language Models with Yossi GandelsmanSenior Specialist, NASK National Research Institute • May 2025 – Present
Department of Security and Transparency of Artificial IntelligenceVisiting Researcher, CISPA – Helmholtz Center for Information Security • Mar 2024 – May 2024
Working on identifying the use of copyrighted data in Diffusion Models with Adam DziedzicPhD Student Researcher, IDEAS NCBR • Oct 2023 – Apr 2024
Research in Systems Security and Data PrivacyCollaborator, CERN, the European Organization for Nuclear Research • Feb 2019 – Present
Fast simulations of a particle calorimeter at the Large Hadron Collider using generative adversarial networks and variational autoencodersJunior Machine Learning Engineer, Samsung R&D Institute Poland • Jul 2019 – Oct 2021
Development of machine learning systems for automatic code repairIAESTE Internship, National Technical University in Bahia Blanca, Argentina • Aug 2017 – Sep 2017
Development of grid power failure simulations
Skills
- Strong background in machine learning, with a focus on generative models and their safety, including Large Language Models, Generative Vision Models, and Vision-Language Models
- Practical research experience across academia and international institutions
- Experience in frontier model adaptation and evaluation
- Advanced programming proficiency in Python 3
- In-depth knowledge of:
- Machine Learning: PyTorch, TensorFlow, Keras
- Data Science: Pandas, NumPy
- Data Visualization: Matplotlib, Seaborn
- Practical knowledge of:
- Version control: Git
- Operating systems: Linux
- Containerization: Docker
- Foreign languages: English C2 (fluent), German B1, Japanese A2
Selected Publications
Universal Properties of Activation Sparsity in Modern Large Language Models
F. Szatkowski, P. Będkowski, A. Devoto, J. Dubiński, P. Minervini, M. Piórczyński, S. Scardapane, B. Wójcik
International Conference on Learning Representations (ICLR), 2026, CORE A*On Stealing Graph Neural Network Models
M. Podhajski, J. Dubiński, F. Boenisch, A. Dziedzic, A. Pręgowska, T. Michalak
AAAI Conference on Artificial Intelligence (AAAI), 2026, CORE A*Privacy Attacks on Image Autoregressive Models
A. Kowalczuk*, J. Dubiński*, F. Boenisch, A. Dziedzic
*International Conference on Machine Learning (ICML), 2025, CORE A***CDI: Copyrighted Data Identification in Diffusion Models
J. Dubiński*, A. Kowalczuk*, F. Boenisch, A. Dziedzic
*Conference on Computer Vision and Pattern Recognition (CVPR), 2025, CORE A***Learning Graph Representation of Agent Diffuser
Y. Djenouri, N. Belmecheri, T. Michalak, J. Dubiński, A. N. Belbachir, A. Yazidi
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2025, CORE A*Bucks for Buckets (B4B): Active Defenses Against Stealing Encoders
J. Dubiński*, S. Pawlak*, F. Boenisch*, T. Trzcinski, A. Dziedzic
Neural Information Processing Systems (NeurIPS), 2023, CORE A*Towards More Realistic Membership Inference Attacks on Large Diffusion Models
J. Dubiński, A. Kowalczuk, S. Pawlak, P. Rokita, T. Trzciński, P. Morawiecki
Winter Conference on Computer Vision (WACV), 2024, CORE AEfficient Model-Stealing Attacks Against Inductive Graph Neural Networks
M. Podhajski, J. Dubiński, F. Boenisch, A. Dziedzic, A. Pregowska, T. Michalak
European Conference on Artificial Intelligence (ECAI), 2024, CORE ASelectively Increasing the Diversity of GAN-generated Samples
J. Dubiński, K. Deja, S. Wenzel, P. Rokita, T. Trzciński
International Conference on Neural Information Processing (ICONIP), 2022, CORE AProgressive Latent Replay for Efficient Generative Rehearsal
S. Pawlak, F. Szatkowski, M. Bortkiewicz, J. Dubiński, T. Trzciński
International Conference on Neural Information Processing (ICONIP), 2022, CORE AEfficient LLM Moderation with Multi-Layer Latent Prototypes
M. Chrabąszcz, F. Szatkowski, B. Wójcik, J. Dubiński, T. Trzciński
ICLR Workshop on Building Trust in Language Models and Applications, 2025
