Humans beat AI at international math contest despite gold-level AI scores

Hacker News - AI

Jul 23, 2025 04:09

moneil971

1 views

hackernewsaidiscussion

Summary

In an international math contest, human participants outperformed advanced AI systems, even though the AI achieved gold-level scores. This result highlights that while AI has made significant progress in mathematical problem-solving, humans still hold an edge in complex, competitive environments. The outcome suggests ongoing challenges for AI in matching human intuition and adaptability in high-level mathematics.

Article URL: https://phys.org/news/2025-07-humans-ai-international-math-contest.amp Comments URL: https://news.ycombinator.com/item?id=44655662 Points: 1 # Comments: 0

Read Full Article More News

Trump’s AI strategy trades guardrails for growth in race against China

AI News - TechCrunchJul 23

The Trump administration’s new AI Action Plan prioritizes rapid AI development, national security, and competition with China, marking a departure from Biden’s more cautious, risk-focused policies. This shift signals fewer regulatory guardrails in favor of accelerating innovation and asserting U.S. leadership in the global AI race.

Show HN: Kafka, the first AI employee (NEW SOTA ON GAIA BY 20%)

Hacker News - AIJul 23

Brainbase Labs has introduced Kafka, an AI "employee" capable of handling tasks via email, phone, and Slack, and performing real-world work such as coding and project management. Kafka achieves a state-of-the-art 77.2% on the GAIA Level 3 benchmark, thanks to a new "structured planning" algorithm that enables long-term, reliable task execution. This development marks a significant step toward practical, autonomous AI agents that can integrate seamlessly into human workflows.

AI might not recursively self improve (part 2)