Large language models continue to struggle with hallucinations, presenting a major roadblock for real-world enterprise applications. Reducing these errors is a messy business, forcing model developers [...]
According to physician Zhu Kelei, AI has definitively saved the lives of patients whose scans were only flagged by PANDA, an AI tool developed by Alibaba researchers. The system analyzes non-contrast [...]
Resolve AI, the production-operations startup backed by Greylock and Lightspeed Venture Partners, today announced a sweeping expansion of its platform that introduces always-on background agents, a re [...]
LLMs designed for reasoning, like Claude 3.7 and Deepseek-R1, are supposed to excel at complex problem-solving by simulating thought processes. But a new study by Apple researchers suggests that these [...]
A Google study finds that the standard three to five human raters per test example often aren't enough for reliable AI benchmarks, and that splitting your annotation budget the right way matters [...]
A new study finds that narcissism, Machiavellianism, materialism, and psychopathy are closely linked to academic dishonesty and heavier use of generative AI tools like ChatGPT and Midjourney.<br /& [...]