Show HN: I Made a Hot or Not Benchmark for AI Design

Hacker News - AI
Jul 5, 2025 16:08
grxxxce
1 views
hackernewsaidiscussion

Summary

A team created a "Hot or Not" style benchmark game to evaluate and rank AI-generated frontend designs, revealing significant variability in quality across models and categories. Their findings highlight that while some models like DeepSeek and Grok excel in certain areas, others such as OpenAI's models perform inconsistently, especially outside game development. This crowdsourced approach provides valuable insights into the strengths and weaknesses of current AI design capabilities, underlining both impressive progress and ongoing limitations in the field.

We noticed most AI-generated frontend looks and feels vibe-coded, but couldn’t put our finger on why. So, we built a voting game to figure out the best ranking internally. It was surprisingly fun (and useful) so we refined it and wanted to share it here! State-of-the-art models go head-to-head in design across websites, game dev, 3d models, more — the things that are generated are at times very impressive, and at times make AGI feel far, far away. We were especially impressed with the quality of DeepSeek and Grok, and variance between categories (OpenAI is very good for game dev, but seems to suck everywhere else). Leaderboard: https://www.designarena.ai/leaderboard Voting: https://www.designarena.ai/vote Give us your thoughts (and if you make something cool, we want to see it :)! Comments URL: https://news.ycombinator.com/item?id=44473673 Points: 5 # Comments: 1

Related Articles

Ruvi AI's (RUVI) Audited Token Rises 50% in Weeks, Analysts Eye It as Dogecoin's (DOGE) Biggest Threat

Analytics InsightJul 5

Ruvi AI's (RUVI) audited token has surged 50% in recent weeks, drawing attention from analysts who now consider it a major competitor to Dogecoin (DOGE). The rapid rise highlights growing investor interest in AI-powered cryptocurrencies, signaling increased integration of AI technologies within the digital asset space. This trend could accelerate innovation and competition among AI-driven blockchain projects.

Synthetic proteins are being built with the help of AI models

Hacker News - AIJul 5

AI models are now being used to design synthetic proteins from scratch, accelerating the process of protein engineering beyond traditional methods. This advancement could revolutionize fields such as medicine and materials science, highlighting AI's growing impact on complex scientific research and innovation.

Noam Chomsky on ChatGPT, AI, Universal Grammar, Language and Mind (2023)

Hacker News - AIJul 5

In this interview, Noam Chomsky discusses the limitations of AI models like ChatGPT, arguing that they lack true understanding and creativity because they do not possess universal grammar or innate cognitive structures. He emphasizes that while AI can mimic language patterns, it does not grasp meaning in the way the human mind does, highlighting fundamental differences between machine learning and human cognition with significant implications for the future development of AI.