Artificial Intelligence
Awesome Artificial Intelligence¶
A curated collection of must-use, actively maintained resources for building and shipping AI systems.
Focus: AI engineering (RAG, agents, evals, guardrails, deploy) plus the best books, guides, papers, and a carefully selected set of tools.

π Learn¶
Deep, durable knowledge β still valuable five years from now.
Books¶
Modern & Practical - Designing Machine Learning Systems β Scalable, maintainable ML pipelines (Chip Huyen). - AI Engineering β End-to-end AI product building (Chip Huyen). - Build a Large Language Model from Scratch β Transformers in raw PyTorch, layer by layer (Sebastian Raschka). - Hands-On Large Language Models β Visual + practical guide to LLM applications (Jay Alammar, Maarten Grootendorst). - LLM Engineer's Handbook β Production LLMOps: fine-tuning, quantization, serving (Labonne, Iusztin). - The 100-Page Language Models Book β Concise, math-grounded path from n-grams to transformers (Andriy Burkov). - Generative Deep Learning (2nd Edition) β GANs, VAEs, diffusion models (David Foster).
Foundational - Artificial Intelligence: A Modern Approach β The canonical AI theory text (Russell, Norvig). - Deep Learning β Mathematical foundations of neural networks (Goodfellow, Bengio, Courville). - Deep Learning: Foundations and Concepts β Bishop's 2024 update; probability-grounded modern DL (Bishop & Bishop). - Understanding Deep Learning β Math + intuition + Python notebooks (Simon Prince). - Speech and Language Processing (3rd Edition) β The NLP reference, kept current through the deep learning era (Jurafsky, Martin). - Reinforcement Learning: An Introduction (2nd Edition) β RL foundations (Sutton, Barto).
Courses¶
Beginner - Google Generative AI Learning Path - Hugging Face LLM Course - Fast.ai β Practical Deep Learning
Intermediate / Advanced - Stanford CS324: Large Language Models - Full Stack Deep Learning - MIT 6.S191: Intro to Deep Learning
Focused - DeepLearning.AI Short Courses - Google DeepMind β Introduction to Reinforcement Learning - Karpathy β Neural Networks: Zero to Hero
Landmark Papers¶
Research that shaped modern AI β worth reading to understand the "why" behind today's architectures. - Attention Is All You Need β Transformer architecture. - Scaling Laws for Neural Language Models β Model/data/compute scaling. - Language Models are Few-Shot Learners β GPT-3 capabilities. - Constitutional AI β Safer model alignment.
π Build¶
The toolchain for building with AI. Personal note: you don't need tons of frameworks β start with simple LLM calls and work up.
Guides & Playbooks¶
- Building Effective Agents (Anthropic) β β Patterns, pitfalls, and tradeoffs for designing AI agents.
- OpenAI Agents Guide β Practical guide on building agents.
- Google AI Agents Paper β Practical guide to building AI agents from Google.
- Google Agents Companion Paper β Companion guide from Google.
- OpenAI Cookbook β Example code, recipes, and best practices for working with OpenAI APIs.
- LLM Engineer Handbook β A goldmine of useful links for AI engineers.
Frameworks¶
- PocketFlow β Extremely minimalist AI agent framework in just 100 lines of code. Fantastic way to learn.
- Google ADK β Google's Agent Development Kit (Python, Java). Great local development experience + A2A + MCP.
- Pydantic-AI β Typed, structured LLM orchestration framework built on Pydantic models for safe, predictable outputs.
- LangGraph β Build multi-agent workflows with stateful graphs on top of LangChain.
- CrewAI β Agent orchestration with structured tasks and human-in-the-loop controls.
- AutoGen β Microsoft's framework for multi-agent conversation and collaboration.
- LlamaIndex β Data framework for ingesting, indexing, and querying private data with LLMs.
- Haystack β Open-source search/RAG framework with modular pipelines.
- Docling β Great library for ingesting any kind of document for RAG β
Evals¶
- OpenAI Evals β OpenAI's framework for writing evals.
IDEs¶
- Cursor β LLM-powered IDE for multi-file edits and codebase-aware chat.
- GitHub Copilot β In-IDE code completion, chat, and refactors.
π€ Agents¶
Harnesses that turn LLMs into autonomous workers. The model is swappable; the harness is the product.
Coding¶
For live capability comparison, see Terminal-Bench and SWE-bench.
- Claude Code β Anthropic's CLI agent; multi-file codebase refactoring with long context.
- Codex CLI β OpenAI's Rust-based local terminal agent; lightweight and fast.
- Gemini CLI β Google's official open-source terminal agent; long-context repo exploration.
- Cursor CLI β Cursor's terminal-native agent with sandboxed permissions.
- Aider β Git-integrated pair programming with surgical edits and undo.
- OpenCode β Provider-agnostic terminal harness with a strong TUI.
- OpenHands β Open-source autonomous SWE platform; browser + shell + editor loop.
- Cline β Open-source agentic IDE extension with strong multi-provider support.
- Continue β Open-source IDE + CLI assistant with source-controlled rules.
- Goose β Block's extensible, MCP-driven local agent.
- Factory Droid β Benchmark-leading multi-model harness with BYOK local execution.
- Amp β Sourcegraph's commercial agentic coding tool with strong product UX.
- Mistral Vibe β Mistral's agentic coding CLI, powered by Devstral.
- Qwen Code β Alibaba's terminal coding agent, optimized for Qwen models.
- Pi β Highly customizable terminal harness; minimal base prompt, extension-driven.
- Nanocoder β Private, local-first agent for Ollama and LM Studio.
- Kilo CLI β Multi-mode agent with a unified gateway to 500+ models.
π§ Models¶
State-of-the-art models by modality.
π¬ Language¶
The major frontier labs.
- ChatGPT β Best for general reasoning, tool use, and the broadest ecosystem.
- Claude β Best for long-context analysis, coding, and structured thinking.
- Gemini β Best for multimodal tasks and Google ecosystem integration.
- Grok β Best for real-time information via X and very long context.
- Llama β Best open-weight family for self-hosting and fine-tuning.
- Mistral β Best for lightweight, high-performance open-weight models.
- DeepSeek β Best for cost-efficient reasoning with open weights.
- Qwen β Best for multilingual and Chinese-first applications.
- Kimi β Best for long-context instruction following.
- GLM β Frontier-tier Chinese model with open weights.
- Cohere β Best for enterprise LLMs with strong retrieval-augmented generation APIs.
πΌ Image¶
- GPT Image β OpenAI's integrated image generation with near-perfect text rendering.
- Midjourney β Artistic and photorealistic images.
- Adobe Firefly β Integrated into Creative Cloud; commercial-safe.
- Ideogram β Precise, legible text in generated images.
- Flux β High-res, prompt-editable, open-weight images.
π₯ Video¶
- Google Veo β High-quality video with synchronized audio.
- Runway β Video editing + generation with granular creative control.
- Kling β Cinematic, realistic video generation.
π Audio¶
- ElevenLabs β High-quality text-to-speech and voice cloning.
- Suno β AI music from text prompts.
π Compare¶
Live benchmarks, pricing, and the latest model versions. - OpenRouter β Unified API + live pricing across ~300 models. - LMArena β Human-preference Elo rankings for text, image, and video. - Artificial Analysis β Speed, price, and quality benchmarks across providers.
π‘ Follow¶
Stay current without drowning in noise.