통합 참고문헌 (References)

257 references

[1] Adam, David, "The AI co-scientist is here," Nature Medicine, 2026-03-16. [Adam, 2026] #11

[2] Anthropic, "Automated Alignment Researchers — Using LLMs to scale scalable oversight," Anthropic Research, 2026-04-14. [Anthropic, 2026] #28

[3] Astro-Han, "karpathy-llm-wiki — Agent Skills-compatible LLM Wiki for Claude Code/Codex," GitHub, 2026-04. [Astro-Han, 2026]

[4] Astorian, Lucas, "lucasastorian/llmwiki — Open-source LLM Wiki with document upload + Claude MCP," GitHub, 2026-04. [Astorian, 2026]

[5] Ahrens, Sönke (2017). How to Take Smart Notes. CreateSpace.

[6] Bush, Vannevar (1945). As We May Think (the Memex proposal). The Atlantic, July 1945.

[7] BSWEN, "What Results Did 700 Autoresearch Experiments Achieve Overnight?," Medium, 2026-03-30. [BSWEN, 2026]

[8] 0xchamin, "Mcptube — Karpathy's LLM Wiki applied to YouTube (transcripts + vision frames)," GitHub, 2026-04. [0xchamin, 2026]

[9] Clark, Andy and Chalmers, David (1998). The Extended Mind. Analysis 58(1): 7-19.

[10] Clark, Jack, "Import AI 454: Automating alignment research," Import AI, 2026-04-20. [Clark, 2026]

[11] ekadetov, "ekadetov/llm-wiki — Claude Code plugin for persistent compounding KBs in Obsidian," GitHub, 2026-04. [ekadetov, 2026]

[12] Gottweis, Juraj et al. (2025). Towards an AI co-scientist. arXiv:2502.18864. #11

[13] Guan et al. (2026). AI-Assisted Drug Re-Purposing for Human Liver Fibrosis. Advanced Science. [Guan et al., 2026]

[14] Jumper, John et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596: 583-589.

[15] Karpathy, Andrej, "LLM Wiki — A pattern for building personal knowledge bases using LLMs," GitHub Gist, 2026-04-04. [Karpathy, 2026]

[16] Karpathy, Andrej, "LLM Wiki announcement (Twitter/X thread)," Twitter/X, 2026-04-04. [Karpathy, 2026]

[17] Karpathy, Andrej, "Farzapedia reply — personalization argument for LLM Wiki," Twitter/X, 2026-04-12. [Karpathy, 2026]

[18] Karpathy, Andrej, "karpathy/autoresearch — AI agents running research on single-GPU nanochat training," GitHub, 2026-03-07. [Karpathy, 2026] #30

[19] Karpathy, Andrej, "Autoresearch first-run tweet — 12h / 110 changes on nanochat," Twitter/X, 2026-03-07. [Karpathy, 2026] #30

[20] Karpathy, Andrej, "Autoresearch Round 1 tweet — 700 experiments / 11% Time-to-GPT-2 reduction," Twitter/X, 2026-03-09. [Karpathy, 2026] #30

[21] Karpathy, Andrej (2017). Software 2.0. Medium. [Karpathy, 2017]

[22] King, Ross D. et al. (2009). The Automation of Science. Science 324: 85-89. [King et al., 2009]

[23] Langley, Pat (1981). Data-Driven Discovery of Physical Laws (BACON). Cognitive Science 5(1): 31-54. [Langley, 1981]

[24] Lu, Chris et al. (2024). The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery. arXiv:2408.06292. [Lu et al., 2024]

[25] Luhmann, Niklas (1992). Communicating with Slip Boxes — An Empirical Account. Essay. [Luhmann, 1992]

[26] Packer, Charles et al. (2023). MemGPT: Towards LLMs as Operating Systems. arXiv:2310.08560. [Packer et al., 2023]

[27] Park, Joon Sung et al. (2023). Generative Agents: Interactive Simulacra of Human Behavior. UIST 2023. [Park et al., 2023]

[28] 박재홍 (Park Jaehong), "RAG는 잊어라, Karpathy가 제안하는 'LLM 위키'라는 새로운 지식 관리 패러다임," GeekNews, 2026-05. [Park, 2026]

[29] Pilon, Simone et al. (2026). A flexible and affordable self-driving laboratory for automated reaction optimization. Nature Synthesis. [Pilon et al., 2026] #31

[30] Schmidgall et al. (2025). Evaluating Sakana's AI Scientist for Autonomous Research. arXiv:2502.14297. [Schmidgall et al., 2025]

[31] Silver, David et al. (2016). Mastering the game of Go with deep neural networks and tree search. Nature 529: 484-489. [Silver et al., 2016]

[32] skyllwt, "OmegaWiki — Wiki-centric full-lifecycle AI research platform on Claude Code (DAIR Lab, Peking University)," GitHub, 2026-04. [skyllwt, 2026]

[33] The New Stack, "Andrej Karpathy's 630-line Python script ran 50 experiments overnight without any human," The New Stack, 2026-03. [The New Stack, 2026]

[34] Um, Taewoong, "Brain Augmentation — manifesto for AI-era self-generating knowledge environments," terryum.ai, 2026-03-10. [Um, 2026]

[35] Um, Taewoong, "Democratization of research — three stages (document → in silico → physical)," terryum.ai, 2026-04-15. [Um, 2026]

[36] Um, Taewoong, "Claude Code → Codex 이관 전략," terryum.ai, 2026-04-24. [Um, 2026]

[37] ussumant, "ussumant/llm-wiki-compiler — Claude Code plugin: markdown knowledge → topic-based wiki," GitHub, 2026-04. [ussumant, 2026]

[38] Wang, Guanzhi et al. (2023). Voyager: An Open-Ended Embodied Agent with Large Language Models. TMLR 2024. [Wang et al., 2023]

[39] Agentic Researcher, "The Agentic Researcher: A Practical Guide to AI-Assisted Research," arXiv:2603.15914, 2026. [Agentic Researcher, 2026]

[40] Agentpedia, "Karpathy's LLM Wiki: The Complete Guide to His Idea File," Agentpedia, 2026. [Agentpedia, 2026]

[41] AIwire, "Stanford's Paper2Agent Reimagines Scientific Papers as Interactive AI Agents," HPCwire AIwire, 2025-10-10. [AIwire, 2025]

[42] Anthropic, "Claude Code memory + subagent documentation," Anthropic Docs, 2026. [Anthropic, 2026]

[43] Denser.ai, "From RAG to LLM Wiki: What Karpathy's idea means for AI knowledge bases," Denser.ai Blog, 2026. [Denser, 2026]

[44] Ghafarollahi, Alireza et al. (2024). SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning. arXiv:2409.05556. [Ghafarollahi et al., 2024]

[45] Gottweis, Juraj et al. (2025). Towards an AI co-scientist (Google AI Co-Scientist). arXiv:2502.18864. [Gottweis et al., 2025] #11

[46] HKUDS (2025). AI-Researcher: Autonomous Scientific Innovation. arXiv:2505.18705. [HKUDS, 2025]

[47] InfoQ, "Paper2Agent Converts Scientific Papers into Interactive AI Agents," InfoQ, 2025-10. [InfoQ, 2025]

[48] Izacard, Gautier et al. (2022). Atlas: Few-shot Learning with Retrieval Augmented Language Models. arXiv:2208.03299. [Izacard et al., 2022]

[49] Lala, J. et al. (2024). PaperQA2 — Language agents achieve superhuman synthesis of scientific knowledge. arXiv:2409.13740. [Lala et al., 2024]

[50] Lewis, Patrick et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. NeurIPS 2020. [Lewis et al., 2020]

[51] OpenAI, "Codex /goal Command," Ralphable, 2026. [OpenAI, 2026]

[52] Stanford team (2025). Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents. arXiv:2509.06917. [Stanford, 2025]

[53] Tecton & Tide, "/goal: The Six-Hour Codex Run That Survived a Five-Hour Pause," Tecton & Tide Blog, 2026-04. [Tecton & Tide, 2026]

[54] Willison, Simon, "Codex CLI 0.128.0 adds /goal," Simon Willison's Blog, 2026-04-30. [Willison, 2026]

[55] Yamada, Yutaro et al. (2025). The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search. arXiv:2504.08066. [Yamada et al., 2025]

[56] Papers2Code, "Papers2Code — AI Research to Code," Papers2Code, 2026. [Papers2Code, 2026]

[57] Zhang, Xiangyue (2026). Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation. arXiv:2604.05854. [Zhang, 2026]

[58] Adam, D. (2026). The AI co-scientist is here. Nature Medicine, 2026-03-16. #11

[59] Agentic Researcher (2026). The Agentic Researcher: A Practical Guide to AI-Assisted Research. arXiv:2603.15914.

[60] Aimaker (2026). 4-Month Obsidian + LLM Wiki Longitudinal Report. Aimaker blog.

[61] Anthropic (2026). Automated Alignment Researchers — Using LLMs to scale scalable oversight. Anthropic Research, 2026-04-14. #28

[62] Astorian, L. (2026). lucasastorian/llmwiki — MCP-based LLM Wiki service. GitHub.

[63] Astro-Han (2026). Astro-Han/karpathy-llm-wiki — Agent-Skills package. GitHub.

[64] Boiko, D. A., MacKnight, R., & Gomes, G. (2023). Emergent autonomous scientific research capabilities of large language models. arXiv:2304.05332; Nature, 2023.

[65] Bowman, S. R. et al. (2022). Measuring Progress on Scalable Oversight for Large Language Models. arXiv:2211.03540.

[66] Bran, A. M. et al. (2023). ChemCrow: Augmenting large-language models with chemistry tools. arXiv:2304.05376; Nature Machine Intelligence, 2024.

[67] Brazil, R. (2026). Inside the self-driving lab revolution. Nature, 2026-03-30. #31

[68] Burns, C. et al. (2023). Weak-to-Strong Generalization. arXiv:2312.09390; ICML 2024.

[69] Bush, V. (1945). As We May Think. The Atlantic, 1945-07.

[70] Chamin, 0x (2026). Mcptube — YouTube-to-LLM-Wiki converter. GitHub.

[71] Chen, W. et al. (2023). AgentVerse: Facilitating Multi-Agent Collaboration. arXiv:2308.10848.

[72] Clark, A., & Chalmers, D. J. (1998). The Extended Mind. Analysis, 58(1), 7-19.

[73] Clark, J. (2026). Import AI 454 — Reading AAR carefully. Substack, 2026-04-20. #28

[74] Ekadetov (2026). ekadetov/llm-wiki — Obsidian plugin for Claude Code. GitHub.

[75] Ghafarollahi, A., & Buehler, M. J. (2024). SciAgents: Automating Scientific Discovery through Multi-Agent Intelligent Graph Reasoning. arXiv:2409.05556.

[76] Gottweis, J. et al. (2025). Towards an AI co-scientist. arXiv:2502.18864. #11

[77] Guan, J. et al. (2026). AI-Assisted Drug Re-Purposing for Human Liver Fibrosis. Advanced Science.

[78] Hendrycks, D. et al. (2020). Measuring Massive Multitask Language Understanding. arXiv:2009.03300; ICLR 2021.

[79] HN (2026). LLM Wiki front-page thread (item 47640875). Hacker News, 2026-04-04.

[80] Hong, S. et al. (2023). MetaGPT: Meta Programming for Multi-Agent Collaborative Framework. arXiv:2308.00352.

[81] Izacard, G. et al. (2022). Atlas: Few-shot Learning with Retrieval Augmented Language Models. arXiv:2208.03299; JMLR 2023.

[82] Jumper, J. et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature, 596: 583-589.

[83] Karpathy, A. (2026a). karpathy/autoresearch. GitHub. #30

[84] Karpathy, A. (2026b). Autoresearch first overnight run tweet. Twitter/X, 2026-03-07. #30

[85] Karpathy, A. (2026c). Autoresearch Round 1 tweet. Twitter/X, ~2026-03-09. #30

[86] Karpathy, A. (2026d). LLM Wiki gist (karpathy/442a6bf555914893e9891c11519de94f). GitHub Gist, 2026-04-04.

[87] Karpathy, A. (2026f). Farzapedia follow-up thread. Twitter/X, 2026-04-12.

[88] King, R. D. et al. (2009). The Automation of Science. Science, 324: 85-89.

[89] Langley, P. (1981). Data-Driven Discovery of Physical Laws. Cognitive Science, 5(1).

[90] Lála, J., White, A. D. et al. (2024). PaperQA2: Faster, better, free research agents. arXiv:2409.13740.

[91] Lewis, P. et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. arXiv:2005.11401; NeurIPS 2020.

[92] Li, G. et al. (2023). CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society. arXiv:2303.17760; NeurIPS 2023.

[93] Lu, C. et al. (2024). The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery. arXiv:2408.06292.

[94] Madaan, A. et al. (2023). Self-Refine: Iterative Refinement with Self-Feedback. arXiv:2303.17651; NeurIPS 2023.

[95] Nature News (2026). How to build an AI scientist: first peer-reviewed paper spills the secrets. Nature.

[96] OpenAI (2026a). Codex CLI 0.128.0 changelog. GitHub.

[97] Packer, C. et al. (2023). MemGPT: Towards LLMs as Operating Systems. arXiv:2310.08560; COLM 2024.

[98] Park, J. S. et al. (2023). Generative Agents: Interactive Simulacra of Human Behavior. arXiv:2304.03442; UIST 2023.

[99] Pilon, T. et al. (2026). RoboChem-Flex: A ~$5,000 modular self-driving laboratory. Nature Synthesis. #31

[100] Rein, D. et al. (2023). GPQA: A Graduate-Level Google-Proof Q&A Benchmark. arXiv:2311.12022; COLM 2024.

[101] Schick, T. et al. (2023). Toolformer: Language Models Can Teach Themselves to Use Tools. arXiv:2302.04761; NeurIPS 2023.

[102] Schmidgall, S. et al. (2025). Evaluating Sakana's AI Scientist for Autonomous Research. arXiv:2502.14297; SIGIR Forum.

[103] Schmidt, M., & Lipson, H. (2009). Distilling Free-Form Natural Laws from Experimental Data. Science, 324: 81-85.

[104] Shen, Y. et al. (2023). HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face. arXiv:2303.17580; NeurIPS 2023.

[105] Shinn, N. et al. (2023). Reflexion: Language Agents with Verbal Reinforcement Learning. arXiv:2303.11366; NeurIPS 2023.

[106] Silver, D. et al. (2016). Mastering the game of Go with deep neural networks and tree search. Nature, 529: 484-489.

[107] Skyllwt (2026). OmegaWiki — Full research-lifecycle LLM Wiki implementation. GitHub.

[108] Srivastava, A. et al. (2022). Beyond the Imitation Game: BIG-Bench. arXiv:2206.04615; TMLR 2023.

[109] Stanford Paper2Agent team (2025). Paper2Agent: Converting Papers to MCP Servers. arXiv:2509.06917.

[110] Tecton & Tide (2026). The Six-Hour /goal Run That Survived a Five-Hour Pause. Tecton & Tide blog, 2026-05-01.

[111] The New Stack (2026). Autoresearch — the 630-line script that runs while you sleep. The New Stack.

[112] Um, T. (2025). Conductor — LLM Orchestration Patterns. terryum.ai post.

[113] Um, T. (2026). Brain Augmentation / Democratization of Research / AAR + autoresearch syntheses. terryum.ai posts (2026-03-10, 2026-04-14, 2026-04-15). #28

[114] Ussumant (2026). ussumant/llm-wiki-compiler. GitHub.

[115] Wang, G. et al. (2023). Voyager: An Open-Ended Embodied Agent with Large Language Models. arXiv:2305.16291; TMLR 2024.

[116] Wei, J. et al. (2022). Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. arXiv:2201.11903; NeurIPS 2022.

[117] Wenhao Yu (2026). A Zettelkasten user's critical review of Karpathy LLM Wiki. Personal blog.

[118] Willison, S. (2026). Codex /goal — the canonical English explainer. simonwillison.net, 2026-04-30.

[119] Wu, F. et al. (2026). Towards a Medical AI Scientist. arXiv:2603.28589. #21

[120] Wu, Q. et al. (2023). AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation. arXiv:2308.08155.

[121] Yamada, Y. et al. (2025). The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search. arXiv:2504.08066.

[122] Yao, S. et al. (2022). ReAct: Synergizing Reasoning and Acting in Language Models. arXiv:2210.03629; ICLR 2023.

[123] Yao, S. et al. (2023). Tree of Thoughts: Deliberate Problem Solving with Large Language Models. arXiv:2305.10601; NeurIPS 2023.

[124] Zhang, X. (2026). Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring. arXiv:2604.05854.

[125] Karpathy, A. (2026). LLM Wiki — A pattern for building personal knowledge bases using LLMs. GitHub Gist, 2026-04-04.

[126] Karpathy, A., "LLM Wiki announcement (Twitter/X thread)," 2026-04-04. [Karpathy, 2026]

[127] Karpathy, A., "Farzapedia reply — personalization argument for LLM Wiki," 2026-04-12. [Karpathy, 2026]

[128] Karpathy, A. (2017). Software 2.0. Medium.

[129] MindStudio (2026). What Is Andrej Karpathy's LLM Wiki? How to Build a Personal Knowledge Base With Claude Code. MindStudio Blog. [MindStudio, 2026]

[130] Cognition AI (2026). llm-wiki: the reference implementation of Karpathy's self-building AI memory pattern. Cognition blog (re-syndicated). [Cognition, 2026]

[131] Denser.ai (2026). From RAG to LLM Wiki: What Karpathy's idea means for AI knowledge bases. Denser.ai blog. [Denser, 2026]

[132] Analytics Vidhya (2026). LLM Wiki Revolution: How Andrej Karpathy's Idea is Changing AI. Analytics Vidhya blog. [Analytics Vidhya, 2026]

[133] Agentpedia (2026). Karpathy's LLM Wiki: The Complete Guide to His Idea File. Agentpedia blog. [Agentpedia, 2026]

[134] Lobster Pack (2026). Karpathy's LLM Wiki and the rise of "idea files" — why sharing instructions beats sharing code. Lobster Pack blog. [Lobster Pack, 2026]

[135] WebEdge (2026). Karpathy's LLM Knowledge Base System: Full Breakdown of His CLAUDE.md Schema. MindStudio Blog (WebEdge attribution). [WebEdge, 2026]

[136] Starmorph (2026). Karpathy's LLM Wiki: Step-by-step setup guide. Starmorph blog. [Starmorph, 2026]

[137] Park, J. (2026). RAG는 잊어라, Karpathy가 제안하는 'LLM 위키'라는 새로운 지식 관리 패러다임. GeekNews / WikiDocs blog. [Park, 2026]

[138] Anthropic (2026). Claude Code documentation. Anthropic docs. [Anthropic, 2026]

[139] OpenAI (2026). Custom instructions with AGENTS.md (Codex). OpenAI Developers Portal. [OpenAI, 2026]

[140] Fulkerson, A. (2026). Karpathy's Pattern for an LLM Wiki in Production. aaronfulkerson.com blog. [Fulkerson, 2026]

[141] Aimaker (2026). AI-powered second brain from LLM Wiki — 4-month report. Aimaker Substack. [Aimaker, 2026]

[142] Hacker News community, "LLM Wiki — example of an 'idea file' (Hacker News front-page thread)," 2026-04-04. [HN, 2026]

[143] Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. NeurIPS 2020. arXiv:2005.11401.

[144] Karpukhin, V., Oğuz, B., Min, S., Lewis, P., Wu, L., Edunov, S., et al. (2020). Dense Passage Retrieval for Open-Domain Question Answering. EMNLP 2020. arXiv:2004.04906.

[145] Johnson, J., Douze, M., and Jégou, H. (2019). Billion-scale similarity search with GPUs (FAISS). IEEE Transactions on Big Data. arXiv:1702.08734. DOI:10.1109/TBDATA.2019.2921572.

[146] Izacard, G., Lewis, P., Lomeli, M., Hosseini, L., Petroni, F., Schick, T., et al. (2022). Atlas: Few-shot Learning with Retrieval Augmented Language Models. JMLR 2023. arXiv:2208.03299.

[147] Bush, V. (1945). As We May Think (the Memex proposal). The Atlantic Monthly, July 1945.

[148] Luhmann, N. (1992). Communicating with Slip Boxes — An Empirical Account. Universität Bielefeld (translated essay).

[149] Clark, A. and Chalmers, D. (1998). The Extended Mind. Analysis 58 (1): 7-19. DOI:10.1093/analys/58.1.7.

[150] Ahrens, S. (2017). How to Take Smart Notes. Book (CreateSpace / Independently Published).

[151] Packer, C., Wooders, S., Lin, K., Fang, V., Patil, S. G., Stoica, I., and Gonzalez, J. E. (2023). MemGPT: Towards LLMs as Operating Systems. arXiv:2310.08560.

[152] Wang, G., Xie, Y., Jiang, Y., Mandlekar, A., Xiao, C., Zhu, Y., Fan, L., and Anandkumar, A. (2023). Voyager: An Open-Ended Embodied Agent with Large Language Models. TMLR 2024. arXiv:2305.16291.

[153] Park, J. S., O'Brien, J. C., Cai, C. J., Morris, M. R., Liang, P., and Bernstein, M. S. (2023). Generative Agents: Interactive Simulacra of Human Behavior. UIST 2023. arXiv:2304.03442. DOI:10.1145/3586183.3606763.

[154] Shinn, N., Cassano, F., Berman, E., Gopinath, A., Narasimhan, K., and Yao, S. (2023). Reflexion: Language Agents with Verbal Reinforcement Learning. NeurIPS 2023. arXiv:2303.11366.

[155] Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., Narasimhan, K., and Cao, Y. (2022). ReAct: Synergizing Reasoning and Acting in Language Models. ICLR 2023. arXiv:2210.03629.

[156] Schick, T., Dwivedi-Yu, J., Dessì, R., Raileanu, R., Lomeli, M., Zettlemoyer, L., Cancedda, N., and Scialom, T. (2023). Toolformer: Language Models Can Teach Themselves to Use Tools. NeurIPS 2023. arXiv:2302.04761.

[157] Wei, J., Wang, X., Schuurmans, D., Bosma, M., Ichter, B., Xia, F., Chi, E., Le, Q., and Zhou, D. (2022). Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. NeurIPS 2022. arXiv:2201.11903.

[158] FutureHouse (2024). PaperQA2: Superhuman scientific literature search (FutureHouse announcement). FutureHouse blog. [FutureHouse, 2024]

[159] skyllwt (DAIR Lab, PKU) (2026). OmegaWiki — Wiki-centric full-lifecycle AI research platform on Claude Code. GitHub. [skyllwt, 2026]

[160] Astro-Han (2026). Astro-Han/karpathy-llm-wiki — Agent Skills-compatible LLM Wiki for Claude Code/Cursor/Codex. GitHub. [Astro-Han, 2026]

[161] ekadetov (2026). ekadetov/llm-wiki — Claude Code plugin for persistent compounding KBs in Obsidian. GitHub. [ekadetov, 2026]

[162] Yu, W. (2026). What Is Karpathy's LLM Wiki? A Zettelkasten User's Honest Review. yu-wenhao.com blog. [Yu, 2026]

[163] Infranodus (2026). Infranodus on LLM Wiki — graph DBs as the missing layer. Infranodus blog. [Infranodus, 2026]

[164] innobu (2026). Karpathy's LLM Wiki: Second Brain and the Enterprise Reality Check 2026. innobu blog. [innobu, 2026]

[165] AI Critique (2026). Andrej Karpathy's latest concept 'LLM Wiki' and the future of enterprise knowledge. AI Critique blog. [AI Critique, 2026]

[166] Critical Analyst (2026). Research gap analysis — gaps.md (internal). terry-surveys repo. [Critical Analyst, 2026]

[167] Astorian, L. (2026). lucasastorian/llmwiki — Open-source LLM Wiki with document upload + Claude MCP. GitHub. [Astorian, 2026]

[168] ussumant (2026). ussumant/llm-wiki-compiler — Claude Code plugin: markdown knowledge → topic-based wiki. GitHub. [ussumant, 2026]

[169] 0xchamin (2026). Mcptube — Karpathy's LLM Wiki applied to YouTube (transcripts + vision frames). GitHub + Hacker News Show HN. [0xchamin, 2026]

[170] Astorian, L. and Hacker News community, "Show HN: LLM Wiki — Open-Source Implementation of Karpathy's LLM Wiki (lucasastorian)," 2026-04. [HN, 2026]

[171] Hacker News community, "Show HN: A Karpathy-style LLM wiki your agents maintain (Markdown and Git)," 2026-05. [HN, 2026]

[172] 0xchamin and Hacker News community, "Show HN: Mcptube — Karpathy's LLM Wiki idea applied to YouTube videos," 2026-04. [HN, 2026]

[173] Starmorph (2026). Karpathy's LLM Wiki — Full Beginner Setup Guide (video). YouTube. [Starmorph, 2026]

[174] Data Science Dojo (2026). The LLM Wiki Pattern by Andrej Karpathy — 5-paper, 30-minute tutorial. Data Science Dojo blog. [Data Science Dojo, 2026]

[175] Joshi, U. (2026). Andrej Karpathy's LLM Wiki: Create your own knowledge base. Medium. [Joshi, 2026]

[176] Global Advisors / Quantified Strategy Consulting (2026). Term: LLM Wiki — Andrej Karpathy. Global Advisors blog. [Global Advisors, 2026]

[177] TiddlyWiki community (2026). Riding the wave of Andrej Karpathy's 'LLM Wiki' (Talk TW). TiddlyWiki Talk forum. [TiddlyWiki, 2026]

[178] Herk, N. (2026). Karpathy 10x'ed Claude Code (LLM Wiki framing video). YouTube. [Herk, 2026]

[179] Paige (2026). Second-brain setup using Karpathy's LLM Wiki (video). YouTube. [Paige, 2026]

[180] Clark, J. (2026). Import AI 454: Automating alignment research. Import AI newsletter. [Clark, 2026]

[181] Boiko, D. A., MacKnight, R., Kline, B., and Gomes, G. (2023). Emergent autonomous scientific research capabilities of large language models. Nature 624, 570-578. arXiv:2304.05332. [Boiko et al., 2023]

[182] Schmidgall, S., Su, Y., Wang, Z., Sun, X., Wu, J., Yu, X., Liu, J., Liu, Z., and Barsoum, E. (2025). Evaluating Sakana's AI Scientist for Autonomous Research. arXiv:2502.14297. [Schmidgall et al., 2025]

[183] Sakana AI (2025). The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search. arXiv:2504.08066. [Sakana, 2025]

[184] Gottweis, J., Weng, W.-H., Daryin, A., Tu, T., Palepu, A., Sirkovic, P., et al. (2025). Towards an AI co-scientist. arXiv:2502.18864. [Gottweis et al., 2025] #11

[185] Lála, J., O'Donoghue, O., Shtedritski, A., Cox, S., Rodriques, S. G., and White, A. D. (2024). PaperQA2 — Language agents achieve superhuman synthesis of scientific knowledge. arXiv:2409.13740. [Lála et al., 2024]

[186] Karpathy, A. (2026). karpathy/autoresearch — AI agents running research on single-GPU nanochat training. GitHub. [Karpathy, 2026] #30

[187] Hacker News community (2026). Show HN: A Karpathy-style LLM wiki your agents maintain. [HN, 2026]

[188] Willison, S. (2026). Notes on Codex /goal. simonwillison.net. [Willison, 2026]

[189] Bran, A. M., Cox, S., Schilter, O., Baldassari, C., White, A. D., & Schwaller, P. (2023). ChemCrow: Augmenting large-language models with chemistry tools. arXiv:2304.05376; Nature Machine Intelligence 2024.

[190] Google AI (2025). Accelerating scientific breakthroughs with an AI co-scientist. Google Research blog, 2025-02-19. #11

[191] Lu, C., Lu, C., Lange, R. T., Foerster, J., Clune, J., & Ha, D. (2024). The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery. arXiv:2408.06292.

[192] PsyPost (2026). Google's AI co-scientist just solved a biological mystery that took humans a decade. PsyPost. #11

[193] Sakana AI (2025). SakanaAI/AI-Scientist-ICLR2025-Workshop-Experiment — Code release. GitHub.

[194] Shen, Y., Song, K., Tan, X., Li, D., Lu, W., & Zhuang, Y. (2023). HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face. arXiv:2303.17580; NeurIPS 2023.

[195] Shinn, N., Cassano, F., Berman, E., Gopinath, A., Narasimhan, K., & Yao, S. (2023). Reflexion: Language Agents with Verbal Reinforcement Learning. arXiv:2303.11366; NeurIPS 2023.

[196] Sun, J. et al. (2023). Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph. arXiv:2307.07697; ICLR 2024.

[197] Um, T. (2026a). AI Co-Scientist 요약 + 분석. terryum.ai paper post. #11

[198] Yang, J. (2023). Auto-GPT: An Autonomous GPT-4 Experiment. GitHub.

[199] Yao, S., Yu, D., Zhao, J., Shafran, I., Griffiths, T. L., Cao, Y., & Narasimhan, K. (2023). Tree of Thoughts: Deliberate Problem Solving with Large Language Models. arXiv:2305.10601; NeurIPS 2023.

[200] Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., Narasimhan, K., & Cao, Y. (2022). ReAct: Synergizing Reasoning and Acting in Language Models. arXiv:2210.03629; ICLR 2023.

[201] Bowman, S. R., Hyun, J., Perez, E., Chen, E., Pettit, C., Heiner, S., et al. (2022). Measuring Progress on Scalable Oversight for Large Language Models. arXiv:2211.03540.

[202] Burns, C., Izmailov, P., Kirchner, J. H., Baker, B., Gao, L., Aschenbrenner, L., Chen, Y., Ecoffet, A., Joglekar, M., Leike, J., Sutskever, I., & Wu, J. (2023). Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision. arXiv:2312.09390; ICML 2024.

[203] BSWEN (2026). What Results Did 700 Autoresearch Experiments Achieve Overnight? BSWEN Medium, 2026-03-30. [BSWEN, 2026]

[204] FutureHouse (2024a). Engineering Blog: Journey to superhuman performance on scientific tasks. FutureHouse blog.

[205] FutureHouse (2024b). PaperQA2 — FutureHouse Cookbook entry. FutureHouse Cookbook.

[206] FutureHouse (2024c). PaperQA2: Superhuman scientific literature search (WikiCrow announcement). FutureHouse blog.

[207] Karpathy, A. (2026b). karpathy/autoresearch. GitHub. #30

[208] Karpathy, A. (2026c). Autoresearch Round 1 tweet — 700 experiments / 11% Time-to-GPT-2 reduction. X (Twitter). #30

[209] Karpathy, A. (2026d). Autoresearch first-run tweet — 12h / 110 changes on nanochat. X (Twitter). #30

[210] Karpathy, A. (2026e). karpathy/nanochat. GitHub.

[211] Lála, J., Skarlinski, M., White, A. D., et al. (2024). PaperQA2 — Language agents achieve superhuman synthesis of scientific knowledge. arXiv:2409.13740.

[212] Stanford (2025). Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents. arXiv:2509.06917.

[213] The New Stack (2026). Andrej Karpathy's 630-line Python script ran 50 experiments overnight without any human input. The New Stack.

[214] Um, T. (2026a). autoresearch 요약 + 분석. terryum.ai paper post.

[215] Um, T. (2026b). AAR (Automated Alignment Researchers) 요약 + 분석. terryum.ai paper post. #28

[216] Wu, H., Zheng, B., Song, D., Jiang, Y., Gao, J., Xing, L., Sun, L., & Yuan, Y. (2026). Towards a Medical AI Scientist. arXiv:2603.28589. #21

[217] Adam, D. (2026). The AI co-scientist is here (Nature Medicine feature). Nature Medicine. DOI:10.1038/s41591-026-04275-z. #11

[218] HIMS (2026). RoboChem Flex: democratisation of the autonomous synthesis robot. HIMS, University of Amsterdam, 2026. #31

[219] Karpathy, A. (2026). karpathy/autoresearch. GitHub. #30

[220] King, R. D., Rowland, J., Oliver, S. G., Young, M., Aubrey, W., Byrne, E., Liakata, M., Markham, M., Pir, P., Soldatova, L. N., Sparkes, A., Whelan, K. E., & Clare, A. (2009). The Automation of Science. Science 324(5923):85–89. DOI:10.1126/science.1165620.

[221] Phys.org (2026). Low-cost robotic chemistry system can be built and deployed in any lab. Phys.org, 2026-04.

[222] Pilon, S. et al., Noël, T. (2026). A flexible and affordable self-driving laboratory for automated reaction optimization (RoboChem-Flex). Nature Synthesis. DOI:10.1038/s44160-026-01053-0. #31

[223] QPillars (2026). Self-Driving Labs in 2026 — What Actually Works vs. What's Still Hype. QPillars blog. #31

[224] Um, T. (2026a). Medical AI Scientist 요약 + 분석. terryum.ai paper post. #21

[225] Um, T. (2026b). Self-Driving Labs 요약 + 분석. terryum.ai paper post. #31

[226] Anthropic (2026). Claude Code memory + subagent documentation. Anthropic Developer Docs.

[227] Karpathy, A., Y. He, X. Lee, et al. (2026). LLM Wiki — A pattern for building personal knowledge bases using LLM agents. GitHub Gist, 2026-04-04.

[228] Karpathy, A. (2026). Farzapedia reply — personalization argument for LLM Wiki. X (Twitter), 2026.

[229] OpenAI (2026). Custom instructions with AGENTS.md. OpenAI Codex Docs.

[230] OpenAI Codex Team (2026). Codex CLI 0.128.0 release notes. OpenAI Codex Changelog, 2026-04-30.

[231] Park, J. (GeekNews) (2026). RAG는 잊어라, Karpathy가 제안하는 'LLM 위키'라는 새로운 지식 관리 패러다임. GeekNews, 2026. [GeekNews / Park, 2026]

[232] Stanford Paper2Agent Team (2025). Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents. arXiv:2509.06917.

[233] Tecton and Tide (2026). /goal: The Six-Hour Codex Run That Survived a Five-Hour Pause. Tecton and Tide Blog. [Tecton and Tide, 2026]

[234] Um, T. (terryum) (2026). Claude Code → Codex 이관 전략. terryum.ai post, 2026-04-24. [Um, 2026]

[235] Willison, S. (2026). Codex CLI 0.128.0 adds /goal. Simon Willison's Weblog, 2026-04-30.

[236] Lu, C., Lu, C., et al. (2024). The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery. arXiv:2408.06292.

[237] Um, T. (terryum) (2026). Brain Augmentation — manifesto for AI-era self-generating knowledge environments. terryum.ai post #7, 2026-03-10. [Brain Augmentation, 2026; Um, 2026]

[238] Um, T. (terryum) (2026). Democratization of Research — three stages (document → in silico → physical). terryum.ai post #25, 2026-04-15. [Democratization of Research, 2026]

[239] Um, T. (terryum) (2026). AAR (Automated Alignment Researchers) 요약 + 분석. terryum.ai paper post, 2026. #28

[240] Um, T. (terryum) (2026). autoresearch 요약 + 분석. terryum.ai paper post, 2026.

[241] Um, T. (terryum) (2026). AI Co-Scientist 요약 + 분석. terryum.ai paper post, 2026. #11

[242] Um, T. (terryum) (2026). Self-Driving Labs 요약 + 분석. terryum.ai paper post, 2026. #31

[243] Um, T. (terryum) (2026). Medical AI Scientist 요약 + 분석. terryum.ai paper post, 2026. #21

[244] Um, T. (terryum) (2026). Harnessing Claude Intelligence. terryum.ai paper post, 2026.

[245] Um, T. (terryum) (2026). Meta-Harness Optimization. terryum.ai paper post, 2026. #22

[246] Um, T. (terryum) (2025). Conductor — LLM orchestration patterns. terryum.ai paper post, 2025.

[247] Wu, H., Zheng, B., et al. (2026). Towards a Medical AI Scientist. arXiv:2603.28589. #21

[248] Gottweis, J., et al. (2025). Towards an AI co-scientist (Google AI Co-Scientist). arXiv:2502.18864. #11

[249] Schmidgall, S., et al. (2025). Evaluating Sakana's AI Scientist for Autonomous Research. arXiv:2502.14297.

[250] Pilon, S., et al. (2026). A flexible and affordable self-driving laboratory for automated reaction optimization. Nature Synthesis, 2026. #31

[251] The New Stack (2026). Karpathy's AutoResearch Ran 700 ML Experiments in 2 Days Without Human Input. Reported by Um, T., terryum.ai, 2026. [The New Stack, 2026]

[252] Um, T. (terryum) (2026). Democratization of Research — three stages. terryum.ai post #25, 2026-04-15. [Democratization of Research, 2026]

[253] Um, T. (terryum) (2026). AAR summary and analysis. terryum.ai paper post, 2026. #28

[254] Adam, D. (2026). The AI co-scientist is here. Nature Medicine Feature, 2026-03-16. #11

[255] Guan, Y., et al. (2026). Independent wet-lab replication of liver fibrosis target validation. Reported on terryum.ai paper post, 2026. [Guan et al., 2026] #31

[256] Zhang, S., et al. (2026). Deep Researcher Agent — Think/Execute/Monitor/Reflect with zero-cost monitoring. Reported via terryum.ai, 2026. [Zhang et al., 2026]

[257] Restrepo, G. (2026). Expanding diversity in chemical space. Nature Chemistry, 2026-03-19. [Restrepo, 2026]

감사의 글

이 책은 저자의 블로그 포스트 #7 'Brain Augmentation'과 #25 '연구의 민주화', 그리고 한 달 전 만든 서베이 'Claude Code에서 Codex로'의 Part IV(ch10-12)를 출발점으로 한다.

Andrej Karpathy의 2026-04-04 LLM Wiki gist 공개와 그 후 한 달 반 동안 폭발한 OSS·블로그·영상 생태계, Anthropic의 Automated Alignment Researchers(2026-04), Google의 AI Co-Scientist(2025-02), Sakana AI의 The AI Scientist v1(2024-08) 계보가 이 책의 뼈대다.

Hacker News, Reddit r/LocalLLaMA·r/ClaudeAI, GeekNews의 한국어 토론과 'Karpathy's LLM Wiki Full Beginner Setup Guide' 계열 영상들이 Part II의 매트릭스가 되었다.

이 프로젝트는 황민호님의 Harness 스킬을 이용하여 제작되었습니다.

이 저작물의 제작에 AI 도구가 활용되었습니다. 문헌 조사, 콘텐츠 생성, 원고 작성에 Claude(Opus 4.6)를 사용하였습니다.