Reasoning skills of large language models are often overestimated
New CSAIL research highlights how LLMs excel in familiar scenarios but struggle in novel ones, questioning their true reasoning abilities versus reliance on memorization.
Learn about artificial intelligence, GPT usage, prompt engineering and other technology news and updates from Land of GPT. The site aggregates articles from official RSS feeds under their original authorship. Each article has a do-follow link to the original source.
New CSAIL research highlights how LLMs excel in familiar scenarios but struggle in novel ones, questioning their true reasoning abilities versus reliance on memorization.
More accurate uncertainty estimates could help users decide about how and when to use machine-learning models in the real world.
Developed by MIT RAISE, the Day of AI curriculum empowers K-12 students to collaborate on local and global challenges using AI.
This new tool offers an easier way for people to analyze complex tabular data.
MosaicML, co-founded by an MIT alumnus and a professor, made deep-learning models faster and more efficient. Its acquisition by Databricks broadened that mission.
The program focused on AI in health care, drawing on Takeda’s R&D experience in drug development and MIT’s deep expertise in AI.
LLMs trained primarily on text can generate complex visual concepts through code with self-correction. Researchers used these illustrations to train an image-free computer vision system to recognize real photos.
The SPARROW algorithm automatically identifies the best molecules to test as potential new medicines, given the vast number of factors affecting each choice.
Combining natural language and programming, the method enables LLMs to solve numerical, analytical, and language-based tasks transparently.
The method uses language-based inputs instead of costly visual data to direct a robot through a multistep navigation task.