News
Students shouldn’t encounter word problems for the first time in an assessment. Teachers need to bring the right word ...
Official repository for the paper "MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?". 🌟 For more details, please refer to the project page with dataset exploration ...
The new lineup performs better at software engineering tasks, follows instructions more precisely, and can process up to one million tokens of context, equivalent to about 750,000 words.
As the vision encoders of most MLLMs are trained on natural scenes, they often struggle to understand geometric diagrams, performing no better in geometry problem-solving than LLMs that only process ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results