Skip to content

Introducing the Open Chain of Thought Leaderboard

06 Apr 2024

Introducing the Open Chain of Thought Leaderboard

The Open Chain of Thought (CoT) Leaderboard by Hugging Face tracks the effectiveness of large language models (LLMs) in generating reasoning traces for challenging tasks. This leaderboard focuses on the accuracy improvement gained from CoT prompting, contrasting it with traditional methods. By comparing performance with and without CoT, it highlights the impact of structured reasoning in LLMs.

Evaluations include tasks like LogiQA and LSAT, chosen for their relevance and difficulty. Models are assessed using various CoT generation strategies, such as step-by-step reasoning and reflective prompts, with multiple decoding parameters to gauge performance.

Initial findings indicate that smaller, finetuned models can outperform larger ones in specific scenarios, showcasing the nuanced effectiveness of CoT strategies. Future plans include expanding the leaderboard's task range, developing a comprehensive dashboard, and inviting community contributions to enhance this open benchmarking tool.

Contributors can submit models for evaluation, analyze evaluation results, or help develop new CoT strategies and tasks. The Open CoT Leaderboard aims to refine and democratize the assessment of reasoning capabilities in AI.

Most popular AI tools

All recommendations
Cursor
Underlord by Descript
$0.00
$0.00
Eleven Labs
$0.00
$0.00
Looka
$0.00
$0.00
Murf AI
$0.00
$0.00
AdCreative.ai
$0.00
$0.00
Photo AI
$0.00
$0.00
Reply.io
$0.00
$0.00
MagicSlides
$0.00
$0.00
Pika Labs
$0.00
$0.00
LogoAI
$0.00
$0.00
Deepbrain AI
$0.00
$0.00
Mixo
$0.00
$0.00
FineShare FineCam
$0.00
$0.00
Taplio
$0.00
$0.00
Fiesta item
$0.00
$0.00
Description
$0.00
$0.00
AI Lawyer
$0.00
$0.00
Humata AI
$0.00
$0.00
Ask Your PDF
$0.00
$0.00
Audioread.com
$0.00
$0.00

Thanks for subscribing!

This email has been registered!

Shop the look

Choose Options

AiToolsChampion
Wait a second! We have an ultra-important mission for you! 🕵️‍♂️ Don't let AI take over! Humanity needs heroes like you to stay at the forefront and guide artificial intelligence to the light side of the Force! 🤖⚔️
Receive the latest news, tools and tips and keep your place as captain! 💪
Edit Option
Back In Stock Notification
this is just a warning
Login