|
USA-IL-SULLIVAN Azienda Directories
|
Azienda News:
- LLM agents flunk CRM and confidentiality tasks • The Register
A new benchmark developed by academics shows that LLM-based AI agents perform below par on standard CRM tests and fail to understand the need for customer confidentiality A team led by Kung-Hsiang Huang, a Salesforce AI researcher, showed that using a new benchmark relying on synthetic data, LLM
- Salesforce study warns against rushing LLMs into CRM . . . - CIO
According to the study, this points to a broader issue LLM agents still lack built-in awareness of confidentiality protocols The findings echo rising enterprise caution
- Study shows AI agents struggle with CRM and confidentiality
Large Language Model (LLM) agents aren’t very good at key parts of CRM, according to a study led by Salesforce AI scientist Kung-Hsiang Huang The report showed AI agents had a roughly 58% success rate on single-step tasks that didn’t require follow-up actions or information That dropped to 35% when a task required multiple steps
- Salesforce study finds LLM agents flunk CRM and . . .
A new benchmark developed by academics shows that LLM-based AI agents perform below par on standard CRM tests and fail to understand the need for customer confidentiality A team led by Kung-Hsiang Huang, a Salesforce AI researcher, showed that using a new benchmark relying on synthetic data, LLM agents achieve around a 58 …
- AI Agents Fall Short in Professional Business Tasks, New . . .
The research, published in a paper titled "CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions," found that leading AI agents reached approximately 58% success in single-turn business tasks, with performance dropping dramatically to just 35% in multi-turn conversational settings
- CRM AI: Confidentiality Concerns Persist - SUCCESS QUARTERLY . . .
The study paints a picture of stark contrasts in AI performance On the one hand, LLM agents demonstrated a modest 58% success rate on straightforward, single-step tasks that required no follow-up This indicates a baseline capability, a certain aptitude for simple, isolated actions
- AI in CRM: Promise and Pitfalls Revealed by Salesforce Study
The study, as covered by The Register, highlights that LLM agents consistently underperformed across essential business skills, with success rates below 38 percent for multi-step tasks This suggests a fundamental disconnect between the raw intelligence of these models and their practical application in professional environments where precision
|
|