Home Technology? Autoresearch and the experimental society...
Technology⭐ Featured

? Autoresearch and the experimental society

The most important thing happening in AI right now is not just the intelligence of the models, but the harnesses that make that intelligence usable.

6 April 2026 at 02:01 pm
1 views
? Autoresearch and the experimental society

The most significant development in the world of AI right now isn't just about the models' intelligence, but the tools that make this intelligence accessible and usable. This is where Autoresearch, a Python code released a few weeks ago, comes in. Autoresearch, as described in EV#565, is an autonomous experimental loop that transforms the way knowledge is produced.

In its initial experiment, Autoresearch trained a GPT-2-level model over just two days, achieving a 11% faster training time and discovering 20 genuine improvements. Andrej Karpathy, the creator of Autoresearch, demonstrated its potential by running it on Shopify's internal model. The result was impressive: Autoresearch ran 37 experiments overnight, producing a 0.8-billion-parameter model that outperformed the previous 1.6-billion-parameter version by 19%. Notably, Shopify's CEO, Toby Lütke, is not a machine learning engineer, yet Autoresearch's ease of use and efficiency allowed him to harness its power effectively.

Autoresearch is revolutionary because it addresses two critical challenges simultaneously. Firstly, it automates parts of the knowledge-production process, making it more efficient and accessible. Secondly, it solves the agent control problem, ensuring that AI remains focused on the task at hand. Often, AI systems can drift if given an open-ended brief or if optimized for the wrong metrics. Autoresearch prevents this by design, as it keeps the AI on track with the human's strategic direction.

The human sets the destination, while Autoresearch manages the execution, much like a self-driving car where the driver decides the route, and the vehicle handles the driving. This balance between human guidance and AI execution is key to harnessing the full potential of AI without losing control.

Recognizing the broader applicability of Autoresearch, Andrej Karpathy spent the last month adapting it for knowledge work beyond machine learning. His goal was to create a system that could run structured, low-cost experiments on the kinds of decisions teams make weekly. He named this version AutoBeta and made the full playbook and skills available to paying members.

The measurement problem, which often plagues AI experiments, is also addressed by Autoresearch. By providing a clear, iterative process of hypothesis, testing, scoring, and iteration, Autoresearch ensures that progress is measurable and consistent. This approach not only streamlines the experimentation process but also enhances the reliability of the results.

In conclusion, Autoresearch represents a paradigm shift in how we approach AI and knowledge production. By automating parts of the process and ensuring agent control, it democratizes access to powerful AI tools. The ability to apply Autoresearch beyond machine learning opens up new possibilities for various industries and teams, enabling them to make data-driven decisions more efficiently. As Andrej Karpathy continues to refine and expand AutoBeta, the potential for transforming knowledge work and AI applications becomes increasingly clear. The future of AI is not just about building smarter models, but about creating the right tools to make those models useful and accessible to everyone.

📰 Related News
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras founder Palak Shah recently opened up about one of the most expensive mistakes she made while building her luxury textile brand. During the early years of the company, Shah rented a premium billboard near Delhi’s DLF Emporio to increase brand visibility. However, after forgetting to cancel the campaign, the hoarding reportedly continued running for months — resulting in losses of nearly ₹40 lakh. The incident has now become a viral example of how small operational oversights can turn into costly business lessons for startups and entrepreneurs.
28 May
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Before AI was inevitable, it was a gamble—and Jensen Huang went all in.
14 Apr
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat is excited to announce the release of Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1, marking a major leap forward in our confidential computing journey. These releases graduate confidential containers on bare metal from …
14 Apr
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
YC Startup School: India’s talent pool across colleges and universities are key for building next-gen startups, which is what YC is looking to tap into. It wants to target entrepreneurs building for global markets, focussed on fintech, consumer, B2B, and ecom…
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC-RESULTS/ (PREVIEW, PIX):PREVIEW-TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
Any profit result ‌above T$505.7 billion would mark the company's highest-ever quarterly net income ​and its ninth consecutive quarter of profit growth
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
On Thursday, ​TSMC is expected to report a net profit of $17.1 billion for the quarter, according to an LSEG SmartEstimate compiled from 19 analysts. The war in the Middle East threatens to disrupt the supply of production materials for semiconductors such as…
14 Apr
If we can’t kick the habit, how do we manage AI’s energy needs?
If we can’t kick the habit, how do we manage AI’s energy needs?
One can only hope that OpenAI’s Sam Altman was joking when he sought to justify the immense energy consumption of artificial intelligence
14 Apr
What caused Nvidia Blackwell GPU prices to spike? #tech
What caused Nvidia Blackwell GPU prices to spike? #tech
Blackwell GPU hourly “rent” surges on agentic AI demand A compute pricing index tracking hourly costs for Nvidia Blackwell GPUs shows a sharp climb: hourly rental hit $4.08 , up 48% from $2.75 just two months earlier. The reported driver is rising demand tied…
14 Apr
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic has introduced Claude Mythos Preview, its most advanced AI model, improving significantly in reasoning, coding, and cybersecurity. Unlike previous releases, it will not be publicly available. Access is limited to a consortium of tech companies throu…
14 Apr