Itay Nakash

NLP, AI Security & Safety Researcher @ IBM Research

prof_pic_circel.png

I’m a Research Scientist at IBM Research, working on NLP and LLM-based agents.

My research focuses on improving how language models and agents are trained, evaluated, and deployed in practical settings. I have worked on AI safety and security, red-teaming, agent evaluation, alignment, and LLM efficiency, with publications at conferences including NAACL, ACL, EMNLP, and COLM.

More broadly, I’m interested in applied research that connects new technical ideas with real-world AI systems: building methods that are useful, measurable, and can provide real value beyond paper settings.

News

Apr 29, 2026 Excited to share that our paper Efficient Agent Evaluation via Diversity-Guided User Simulation was accepted to ACL 2026! TL;DR: DIVERT offer an efficient and coverage based user simulator to evaluate LLM agents.
Jan 15, 2026 Happy to share that ideas and findings from our CRAFT paper (EMNLP 2025) made their way into IBM watsonx Orchestrate’s LLM agent vulnerability testing. We advised and co-designed the red-teaming components of the Agent Evaluation system together with the software teams.
Jan 01, 2025 I’m starting my full-time role as an NLP Researcher at IBM Research, focusing on Gen-AI safety and agentic AI security. Looking forward to tackling new challenges in the field!

Selected Publications

  1. ACL 2026
    Efficient Agent Evaluation via Diversity-Guided User Simulation
    Itay Nakash, George Kour, and Ateret Anaby-Tavor
    2026
  2. NAACL 2025
    Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In
    Itay Nakash, George Kour, Guy Uziel, and 1 more author
    In Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
  3. COLM 2025
    AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation
    Itay Nakash, Nitay Calderon, Eyal Ben David, and 2 more authors
    In Second Conference on Language Modeling, Apr 2025
  4. EMNLP 2025
    Effective Red-Teaming of Policy-Adherent Agents
    Itay Nakash, George Kour, Koren Lazar, and 3 more authors
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Nov 2025

Latest Posts

Dec 02, 2025 EMNLP 2025 Highlights