Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. NAACL 2025
    Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In
    Itay Nakash, George Kour, Guy Uziel, and 1 more author
    In Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
  2. COLM 2025
    AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation
    Itay Nakash, Nitay Calderon, Eyal Ben David, and 2 more authors
    Apr 2025
  3. ACL 2025
    Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models
    George Kour, Itay Nakash, Michal Shmueli-Scheuer, and 1 more author
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), Jul 2025
  4. Under Review
    Effective Red-Teaming of Policy-Adherent Agents
    Itay Nakash, George Kour, Koren Lazar, and 3 more authors
    Jul 2025