IBM Research

company

https://research.ibm.com/

AI & ML interests

Enterprise AI and ML, Foundation Models, Responsible AI

Recent Activity

nhinds updated a dataset 29 minutes ago

ibm-research/nhinds-dataset

nhinds published a dataset 29 minutes ago

ibm-research/nhinds-dataset

DhavalPatel submitted a paper about 22 hours ago

Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows

View all activity

Papers

Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows

Agentic CLEAR: Automating Multi-Level Evaluation of LLM Agents

View all Papers

Articles

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

about 8 hours ago

The Open Agent Leaderboard

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

ALTK‑Evolve: On‑the‑Job Learning for AI Agents

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

CUGA on Hugging Face: Democratizing Configurable AI Agents

View all articles

ibm-research 's collections 9