publications

2026

  1. sockpuppetting_preview.png
    Sockpuppetting: Jailbreaking LLMs by Combining Prefilling with Optimization
    Asen Dotsinski and Panagiotis Eustratiadis
    arXiv preprint, 2026

2025

  1. clare_preview.png
    CLaRE: CLIP with Latent Reconstruction Errors for Generated Face Detection
    Udit Thakur, Mohammad Hafeez Khan, Meher Changlani, Asen Dotsinski, and 3 more authors
    In Proceedings of the 1st ACM Workshop on Deepfake, Deception, and Disinformation Security, 2025
  2. comp_mech_preview.png
    On the Generalizability of "Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals"
    Asen Dotsinski, Udit Thakur, Marko Ivanov, Mohammad Hafeez Khan, and 1 more author
    Transactions on Machine Learning Research, 2025