Curated Links
Tools, Models & Courses
Curated links to GitHub repos, model hubs, and learning resources. Filter by category or browse all.
Sphinx
AI copilot for data science. One prompt runs the full workflow: EDA, cleaning, feature engineering, model selection and fitting, evaluation, and feature importance—in about 5 minutes. Free to get started.
sphinx.ai
Lovable
AI-powered app builder. Design in the browser, export to GitHub.
lovable.dev
Cursor
AI-first code editor. Built on VS Code, with Copilot-style assistance.
cursor.com
Claude Code
Command-line coding agent. Scaffold and edit projects from natural language.
claude.com
LLM Course by Maxime Labonne
Large language model course. From basics to RAG, agents, and fine-tuning.
GitHub
Awesome ML
Curated list of ML resources. Frameworks, papers, and tools.
GitHub
PydanticAI
Structured AI agents in Python. Type-safe, testable agent workflows.
GitHub
LangChain
Framework for LLM applications. Chains, agents, and integrations.
GitHub
PAL MCP
Provider-agnostic MCP server that turns your AI CLI or IDE into a coordinator for multiple models: spawn isolated sub-agents, run cross-model debates and code reviews, and hand off full context between models for planning and implementation.
GitHub
AGENTS.md
Open standard for AI coding agents: a consistent location for project setup, build steps, tests, and coding conventions. Used by 60,000+ open-source projects; keeps READMEs clean while improving agent reliability across tools.
agents.md
Promptify
Developer-friendly NLP wrapper for LLMs. Run NER, classification, and more with minimal code and zero training data. Converts unstructured model output into reliable, structured Python objects for production.
GitHub
MiniMax-M1
Open-weight reasoning model: large MoE backbone with hybrid attention, up to 1M-token context, less test-time compute than comparable models. Strong in software engineering and extended-input settings.
GitHub
collaborating-with-codex
Agent Skill that lets Claude delegate coding tasks to the OpenAI Codex CLI for multi-model collaboration. Claude coordinates and refines; Codex handles implementation, debugging, and code analysis in a sandbox.
GitHub
Happy Coder
Mobile, web, and CLI client for Claude Code and Codex. Run and monitor from anywhere with E2E encryption, push notifications when the agent needs attention, and seamless switching between desktop and phone.
GitHub
Docker Sandboxes for Claude Code
Run Claude Code in an isolated, reproducible Docker environment without changing how you use the CLI. Sandboxed file access and credentials for security and reliability; supports all Claude Code options.
Docker Docs
Playwriter MCP
Lets AI agents control your Chrome browser via a lightweight extension using the full Playwright API with minimal context. Reliable browser automation: screenshots, flow validation, logged-in pages—no custom automation to maintain.
GitHub
oh-my-claude-sisyphus
Claude Code plugin for native multi-agent orchestration: specialized subagents, hooks, and slash commands for parallel coding tasks. Automates delegation, search, planning, and “keep going until done” workflows with smart model routing.
GitHub
dlt Fundamentals
Course from dlthub on building robust ELT pipelines. Includes a holiday lesson (Dec 22) on integrating LLMs into your workflow, with 50 swag packs to compete for.
dlthub.com
FreeCodeCamp Course Certification: Data Engineering with Python and AI/LLMs – Data Loading Tutorial
Learn how to build modern, scalable data pipelines using Python and AI-assisted tools. This hands-on tutorial starts from the basics of data ingestion and takes you all the way to advanced techniques in data loading, transformation, deployment, and automation. By freeCodeCamp, featuring Alexey Grigoriev and Adrian Brudaru.
freeCodeCamp / dlthub
AI Agents Email Crash-Course (Cohort Edition)
Free cohort-based version running December and January. Complete the project and review three other submissions to receive a certificate of completion signed by Alexey.
alexeygrigorev.com
Claude Use Cases
Curated library of real-world Claude use cases across research, writing, coding, analysis, and everyday work. Organized by role, industry, and feature with concrete, end-to-end examples.
claude.com
AI Engineering Hub
Large open-source GitHub repo: 90+ production-ready projects, tutorials, and reference implementations for LLMs, RAG, agents, MCP, multimodal systems, and evaluation. Structured by difficulty and use case.
GitHub
Agentic AI Crash Course
A free introductory crash course on agentic AI that explains how modern AI agents work in practice, from tools and RAG to memory, planning, MCP, and multi-agent systems. Designed as a clear, realistic starting point focused on real-world system design and limitations rather than hype.
GitHub
Assignments for CS146S: The Modern Software Developer
Programming assignments for CS146S: The Modern Software Developer (Stanford University), focused on AI-assisted software development. Includes hands-on work with modern tooling and workflows like LLM-based coding, testing, and documentation (Fall 2025).
GitHub
Data Engineering Zoomcamp
New cohort starts on January 12, 2026. A free 9-week course on building production-ready data pipelines: ingestion, orchestration, warehousing, analytics, and more.
GitHub
LLM Fine-Tuning roadmap
Curated resource for practitioners: core fine-tuning concepts, transformer internals, training infrastructure, data preparation, PEFT and alignment methods, and tools for training and deploying LLMs.
GitHub
Claude Code and Large-Context Reasoning
Materials from Tim Warner's O'Reilly Live Learning course: production-ready AI-assisted development with Claude Code, large-context reasoning, MCP-based memory, agents, and custom skills. Code review, automation, and CI/CD examples.
O'Reilly
awesome-slash
Curated list of tools, patterns, and projects built around slash-command interfaces. Practical reference for command-driven workflows, bots, and developer tools.
GitHub
astronomer/agents
Open-source agent skills for data engineering: 13 skills to extend and automate data workflows with AI agents.
GitHub
500+ AI Agent Projects
Curated collection of AI agent use cases across healthcare, finance, education, retail, and more. Maps practical applications to open-source implementations and frameworks (CrewAI, AutoGen, Agno, LangGraph). Hands-on inspiration hub for builders and practitioners.
GitHub
PaperBanana
Agentic framework that automates publication-ready methodology diagrams and statistical plots directly from paper text, references, or rough sketches—optimized for scientific accuracy and visual consistency.
GitHub
Dexter
Autonomous financial research agent that plans, executes, and validates analysis using real-time market data, with safety features like loop detection and step limits.
GitHub
Pro Workflow
Battle-tested AI coding practices for Claude Code and Cursor to keep an effective 80/20 AI-to-review ratio using disciplined context management and intentional review rituals.
GitHub
You Could've Invented OpenClaw
Tutorial by Nader Dabit for building a persistent AI assistant from scratch (starting with a Telegram bot + Anthropic API) and iteratively adding sessions, memory, tool use, and scheduled tasks.
Gist
CS336: Language Modeling from Scratch
Implementation-heavy Stanford course that walks through building a language model end to end: data processing and tokenization, training, systems optimization, scaling laws, and alignment. Includes Python lecture scripts and slides, five programming assignments with public leaderboards, handouts, starter code, and the full schedule.
Stanford University
planning-with-files
Claude Code skill that turns long, complex tasks into a file-based workflow using persistent Markdown files. Stores plans, progress, and errors on disk to reduce goal drift and repeated mistakes across many tool calls or sessions.
GitHub
CLAUDE_CODEX_SKILL
Workflow showing how to use the OpenAI Codex CLI for deep, non-interactive debugging via a file-based question-and-answer process: write a detailed problem description and full code context into a file, then let Codex produce a structured written analysis.
Gist
Materials from my talk, "A Day of an AI Engineer"
Practical resource that combines a live webinar recording with the full planned write-up and a demo project. It also compares the AI Engineer role to traditional data team roles and maps modern AI projects to CRISP-DM.
Google Docs
Locality Sensitive Hashing: The Illustrated Guide
In-depth tutorial from Pinecone's FAISS learning series that covers LSH (theory + Python implementation) for approximate nearest-neighbor search.
Pinecone
Atomic Agents
Lightweight, modular framework for building AI agent pipelines emphasizing atomic, single-purpose components that are reusable, composable, and predictable (built on Instructor and Pydantic).
GitHub
Gas Town
Orchestration system for managing multiple Claude Code instances simultaneously. Coordinates parallel tasks, tracks work across agent instances, manages merge queues, and keeps persistent agent identities (tmux-based workflow).
GitHub