Search entries with tag comparison:
Tags:
Feeds:
Filter displayed entries:
Tags:
Feeds:
| Entries with tag comparison | |||||
|---|---|---|---|---|---|
| List of free coding LLMshttps://github.com/vava-nessa/free-coding-models | 32345 | Creator | 1000 | 0 | 0 |
| Fuel Prices in EUhttps://www.fuel-prices.eu | 31980 | Creator | 1000 | 0 | 0 |
| Why isn't AMD's MI300X competitive?https://newsletter.semianalysis.com/p/mi300x-vs-h100-vs-h200-benchmark-part-1-training | 30846 | Creator | 1000 | 0 | 0 |
| Can I Run AI locally?https://www.canirun.ai | 23892 | Creator | 1000 | 0 | 0 |
| Self-Hosted LLM Leaderboardhttps://onyx.app/self-hosted-llm-leaderboard | 23612 | Creator | 1000 | 0 | 0 |
| Show HN: A real-time strategy game that AI agents can playhttps://llmskirmish.com | 21298 | Creator | 1000 | 0 | 0 |
| AI Benchy - private AI benchmarkhttps://aibenchy.com | 21226 | Creator | 1000 | 0 | 0 |
| "Car Wash" test with 53 modelshttps://opper.ai/blog/car-wash-test | 21016 | Creator | 1000 | 0 | 0 |
| Epoch.ai - tracking AI enhancementshttps://epoch.ai | 18529 | Creator | 1000 | 0 | 0 |
| Metabenchmark of the LLMshttps://metabench.organisons.com | 9279 | Creator | 1000 | 0 | 0 |
| Openrouter model rankingshttps://openrouter.ai/rankings | 6601 | Creator | 1000 | 0 | 0 |
| SWE-rebench: A Continuously Evolving and Decontaminated Benchmark for Software Engineering LLMshttps://swe-rebench.com | 6600 | Creator | 1000 | 0 | 0 |
| Poker Tournament for LLMshttps://pokerbattle.ai/event | 4218 | Creator | 1000 | 0 | 0 |
| I built the same app 10 times: Evaluating frameworks for mobile performancehttps://www.lorenstew.art/blog/10-kanban-boards | 4206 | Creator | 1000 | 0 | 0 |
| Show HN: I tracked the adoption of AI coding extensions in VS Code since 2022https://bloomberry.com/coding-tools.html | 2260 | Creator | 1000 | 0 | 0 |
| Altindex - Alternative financial datahttps://altindex.com | 2092 | Creator | 1000 | 0 | 0 |
| Koyfin - financial datahttps://www.koyfin.com | 2086 | Creator | 1000 | 0 | 0 |
| GuruFocus - financial datahttps://www.gurufocus.com | 2085 | Creator | 1000 | 0 | 0 |
| Stockanalysis - financial datahttps://stockanalysis.com | 2084 | Creator | 1000 | 0 | 0 |
| Finviz - financial datahttps://finviz.com | 2083 | Creator | 1000 | 0 | 0 |
| Finbox - financial datahttps://finbox.com | 2082 | Creator | 1000 | 0 | 0 |
| EQ-BenchAI writing benchmarkshttps://eqbench.com | 561 | Creator | 1000 | 0 | 0 |
| Rust template engine comparisons by Askamahttps://github.com/askama-rs/template-benchmark | 543 | Creator | 1000 | 0 | 0 |
| LLM comparison in Register-Transfer Level generation for hardware designhttps://huggingface.co/spaces/HPAI-BSC/TuRTLe-Leaderboard | 541 | Creator | 1000 | 0 | 0 |
| Artificial Analysis LLM Leaderboardhttps://artificialanalysis.ai/leaderboards/models | 536 | Creator | 1000 | 0 | 0 |
| @techfren Coding LLM Benchmarkshttps://leaderboard.techfren.net | 470 | Creator | 1000 | 0 | 0 |
| CadEval - CAD performance of the LLMshttps://willpatrick.xyz/cadevalresults_20250422_095709 | 468 | Creator | 1000 | 0 | 0 |
| LiveSWEBench - A Challenging, Contamination-Free Benchmark for AI Software Engineershttps://liveswebench.ai | 453 | Creator | 1000 | 0 | 0 |
| MathArena: Evaluating LLMs on Uncontaminated Math Competitionshttps://matharena.ai | 452 | Creator | 1000 | 0 | 0 |
| Vellum LLM Leaderboardhttps://www.vellum.ai/llm-leaderboard | 397 | Creator | 1000 | 0 | 0 |
| ProLLM Leaderboardshttps://www.prollm.ai/leaderboard | 396 | Creator | 1000 | 0 | 0 |
| Humanity's Last Exam - AI bencmarkhttps://lastexam.ai | 391 | Creator | 1000 | 0 | 0 |
| BigCodeBench Leaderboard - Evaluates LLMs with practical and challenging programming taskshttps://bigcode-bench.github.io | 387 | Creator | 1000 | 0 | 0 |
| Open LLM Leaderboardhttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard | 376 | Creator | 1000 | 0 | 0 |
| LLM Explorer - A Curated Large Language Model Directory and Analyticshttps://llm.extractum.io | 370 | Creator | 1000 | 0 | 0 |
| Shadeform - compare GPUs on demand serviceshttps://www.shadeform.ai | 363 | Creator | 1000 | 0 | 0 |
| EVKX - Electical Vehicles information sitehttps://evkx.net | 258 | Creator | 1000 | 0 | 0 |
| Tranco - A Research-Oriented Top Sites Ranking Hardened Against Manipulationhttps://tranco-list.eu | 256 | Creator | 1000 | 0 | 0 |
| Claude 3.5 Sonnet vs GPT-4o: Does Claude outperform GPT-4o?https://blog.getbind.co/2024/06/21/claude-3-5-sonnet-does-it-outperform-gpt-4o?ref=rc | 254 | Creator | 100 | 0 | 0 |
| SWE-bench - Can Language Models Resolve Real-World GitHub Issues?https://www.swebench.com | 233 | Creator | 1000 | 0 | 0 |
| NYT Connections LLM Benchmarkhttps://github.com/lechmazur/nyt-connections | 232 | Creator | 1000 | 0 | 0 |
| StackUnseen AI benchmarkhttps://prollm.toqan.ai/leaderboard/stack-unseen | 231 | Creator | 1000 | 0 | 0 |
| Database-like ops benchmarkhttps://h2oai.github.io/db-benchmark | 220 | Creator | 1000 | 0 | 0 |
| LiveCodeBench - AI Benchmarkhttps://livecodebench.github.io/leaderboard.html | 219 | Creator | 1000 | 0 | 0 |
| Artificial Analysis - AI comparisonhttps://artificialanalysis.ai | 218 | Creator | 1000 | 0 | 0 |
| SciCode AI benchmarkhttps://github.com/scicode-bench/SciCode | 191 | No Creator | 1 | 0 | 0 |
| SEAL LLM Leaderboardshttps://scale.com/leaderboard | 190 | No Creator | 0 | 0 | 0 |
| DB Performance testshttps://github.com/MaibornWolff/database-performance-comparison | 179 | Creator | 1000 | 0 | 0 |
| ARC Prize for AGIhttps://arcprize.org/blog | 92 | Creator | 100 | 0 | 0 |
| Aider AI leaderboardhttps://aider.chat/docs/leaderboards | 80 | Creator | 100 | 0 | 0 |
| LLM benchmarkhttps://dubesor.de/benchtable | 43 | Creator | 100 | 0 | 0 |
| OpenRouter - AI routerhttps://openrouter.ai | 62 | Creator | 100 | 0 | 0 |
| List of certain productshttps://www.productchart.com/ | 18 | Creator | 100 | 0 | 0 |
| LiveBench - A Challenging, Contamination-Free LLM Benchmarkhttps://livebench.ai | 13 | Creator | 100 | 0 | 0 |
| LMSYS Chatbot Arenahttps://lmarena.ai | 60 | Creator | 100 | 0 | 0 |