ads
Saturday, July 5, 2025
Show HN: I Made a Hot or Not Benchmark for AI Design https://ift.tt/ZtUNiVG
Show HN: I Made a Hot or Not Benchmark for AI Design We noticed most AI-generated frontend looks and feels vibe-coded, but couldn’t put our finger on why. So, we built a voting game to figure out the best ranking internally. It was surprisingly fun (and useful) so we refined it and wanted to share it here! State-of-the-art models go head-to-head in design across websites, game dev, 3d models, more — the things that are generated are at times very impressive, and at times make AGI feel far, far away. We were especially impressed with the quality of DeepSeek and Grok, and variance between categories (OpenAI is very good for game dev, but seems to suck everywhere else). Leaderboard: https://ift.tt/D5kdrvl Voting: https://ift.tt/vqjnrGX Give us your thoughts (and if you make something cool, we want to see it :)! https://ift.tt/vqjnrGX July 5, 2025 at 11:08PM
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment