Itay Nakash
NLP, AI Security & Safety Researcher @ IBM Research
I’m a Research Scientist at IBM Research, working on NLP and LLM-based agents.
My research focuses on improving how language models and agents are trained, evaluated, and deployed in practical settings. I have worked on AI safety and security, red-teaming, agent evaluation, alignment, and LLM efficiency, with publications at conferences including NAACL, ACL, EMNLP, and COLM.
More broadly, I’m interested in applied research that connects new technical ideas with real-world AI systems: building methods that are useful, measurable, and can provide real value beyond paper settings.
News
| Apr 29, 2026 | Excited to share that our paper Efficient Agent Evaluation via Diversity-Guided User Simulation was accepted to ACL 2026! TL;DR: DIVERT offer an efficient and coverage based user simulator to evaluate LLM agents. |
|---|---|
| Jan 15, 2026 | Happy to share that ideas and findings from our CRAFT paper (EMNLP 2025) made their way into IBM watsonx Orchestrate’s LLM agent vulnerability testing. We advised and co-designed the red-teaming components of the Agent Evaluation system together with the software teams. |
| Jan 01, 2025 | I’m starting my full-time role as an NLP Researcher at IBM Research, focusing on Gen-AI safety and agentic AI security. Looking forward to tackling new challenges in the field! |
Selected Publications
Latest Posts
| Dec 02, 2025 | EMNLP 2025 Highlights |
|---|