Project APEAutonomous Policy Evaluation

Most policies - probably millions of them globally - are never rigorously evaluated. Data is plenty but there aren't enough researchers. Could AI help? We genuinely don't know. So we're running an experiment. An AI system attempts to produce economics research at scale, , using publicly available data. We aim for at least 1000 papers before the end of 2026. How do we sort the good from the bad? An automated tournament measures them against human benchmarks from top journals, to triage for human oversight. Most importantly, everything is public: papers, code, data, failures.

Work in progress: We are learning how to build reliable AI research systems. Expect errors, hallucinations, and failed papers. That's the point - we transparently publish failures too.



Methodology

Snapshot: January 30, 2026

54 AI papers (+10 this week)·43 human·2,300 matches

Models:Claude Opus 4.5 (Generation)·GPT-5.2 (Review)·Gemini 3 Flash (Review, Judge)

346Human Wins94.3%
9AI Wins2.5%
95.0%Prob(Human Win)as of today

How the Tournament Works

Ranking Metrics

Swipe to see more columns

Rank 48hPaper μ σ Cons. Elo MP Rev.
1140.02.831.7210023Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
2140.23.031.2210925Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
3139.22.930.4206819Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
4136.02.129.7194119Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
5135.02.128.8190120Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
6135.52.228.8191919Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
7537.63.028.5200415Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
8335.12.228.4190320Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
9632.81.528.2181056Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
10136.02.827.8194014Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
11138.23.527.6202714Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
12435.82.727.6193021Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
13333.62.027.6184522Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
14331.71.427.4176763Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
15233.01.927.4182230Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
16734.02.327.1186222Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
17333.62.326.9184517Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
1832.01.926.1178021Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
19231.11.825.6174418Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
20829.61.425.3168258Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
21629.51.425.3168156Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
22330.91.925.2173623Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
23330.72.124.5172819Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
24231.82.723.7177411Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
25129.92.223.4169717Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
26129.22.023.3166821Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
27229.52.123.2168020Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
28527.21.423.0158845Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
29228.11.822.8162426Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
30926.71.322.8156764Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
31327.81.822.5161422Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
32932.93.522.5181710Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
33927.51.921.9160133Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
34427.31.921.8159127Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
3526.91.821.7157728Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
3627.31.921.6159125Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
37527.21.921.6158925Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
38325.71.620.8152841Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
39425.41.520.8151633Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
40325.11.520.6150336Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
41326.11.820.6154434Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
42
AEJ: Policy
25.61.820.1152429Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
43324.51.520.1148035⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
44223.71.718.7144729⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
4521.61.517.0136335⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
46121.81.716.8137229⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
47122.11.816.8138327⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
486
APE working paper #79
22.41.916.6139835⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
49721.21.616.5134936Forthcoming paper in a leading economics journal — peer reviewed by expert referees.
50321.61.915.9136520⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
51120.61.615.9132539⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
52120.61.715.6132432⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
53120.51.715.4131935⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
5410
APE working paper #29
23.02.615.2142215⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
55220.71.915.0132719⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
56421.02.314.1134124⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
57218.31.713.3123226⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
58317.91.812.6121631⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
59122.83.412.5141110⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.
601119.82.512.3129216⚠️AI papers have not been peer reviewed and may contain errors including hallucinations, manufactured data, or incorrect references.

Total tokens used for tournament (excluding generation): 96,125,097