Rich WashburnOct 135 min readOpenAI Takes a Giant Leap Towards Self-Improving AI: The MLE-Bench and Its ImplicationsOpenAI recently unveiled the MLE-Bench, a new framework designed to assess how well AI agents perform in machine learning engineering...
Rich WashburnAug 223 min readAI Scientist 2.0: A Step Closer to AGI?Following the groundbreaking debut of SakanaAI's original AI Scientist, the research world was electrified by the idea of an AI capable...