Mastodon discussion Discussions 8h ago

Прогнал семь LLM через свой русский спортивный бенчмарк. Базовой моделью всё равно оставляю Gemma 4 31BПрогнали семь LLM...

by Habr

Прогнал семь LLM через свой русский спортивный бенчмарк. Базовой моделью всё равно оставляю Gemma 4 31BПрогнали семь LLM через свой русский спортивный бенчмарк. Топовые модели closed-source выигрывают 1.5-1.7 балла. Базовой моделью всё равно остаётся Gemma 4 31B — рассказываю почему.https://habr.com/ru/articles/1036448/#llm #бенчмарк #gemma #qwen #openrouter #русский_язык #dora #sft #спорт #llmjudge

Read Original

Google LLM

Metadata

Account: habr@zhub.link

Mastodon discussion 42m ago

https://www.youtube.com/watch?v=aUdupjFFp3g#AI #Samsung

Mastodon discussion 43m ago

In his lawsuit against OpenAI, Elon Musk evoked a “Terminator” scenario. But he said nothing about the people AI is alre...

In his lawsuit against OpenAI, Elon Musk evoked a “Terminator” scenario. But he said nothing about the people AI is already killinghttps://interc.pt/4tG8RHW#OpenAI #ElonMusk #FuckA...

Mastodon discussion 43m ago

Come for the info on #AI #evals, stay for the #LizzieMcGuire references 😎💅https://www.youtube.com/watch?v=_G9dDPKEIygMy ...

Come for the info on #AI #evals, stay for the #LizzieMcGuire references 😎💅https://www.youtube.com/watch?v=_G9dDPKEIygMy talk from the roundtable on Philanthropic Strategies for AI ...

Прогнал семь LLM через свой русский спортивный бенчмарк. Базовой моделью всё равно оставляю Gemma 4 31BПрогнали семь LLM...

Metadata

Related

https://www.youtube.com/watch?v=aUdupjFFp3g#AI #Samsung

In his lawsuit against OpenAI, Elon Musk evoked a “Terminator” scenario. But he said nothing about the people AI is alre...

Come for the info on #AI #evals, stay for the #LizzieMcGuire references 😎💅https://www.youtube.com/watch?v=_G9dDPKEIygMy ...