Curated Links

Tools, Models & Courses

Curated links to GitHub repos, model hubs, and learning resources. Filter by category or browse all.

Tools

Sphinx

AI copilot for data science. One prompt runs the full workflow: EDA, cleaning, feature engineering, model selection and fitting, evaluation, and feature importance—in about 5 minutes. Free to get started.

sphinx.ai

Tools

Lovable

AI-powered app builder. Design in the browser, export to GitHub.

lovable.dev

Tools

Cursor

AI-first code editor. Built on VS Code, with Copilot-style assistance.

cursor.com

Tools

Claude Code

Command-line coding agent. Scaffold and edit projects from natural language.

claude.com

Courses

LLM Course by Maxime Labonne

Large language model course. From basics to RAG, agents, and fine-tuning.

GitHub

Other

Awesome ML

Curated list of ML resources. Frameworks, papers, and tools.

GitHub

Tools

PydanticAI

Structured AI agents in Python. Type-safe, testable agent workflows.

GitHub

Tools

LangChain

Framework for LLM applications. Chains, agents, and integrations.

GitHub

Tools

PAL MCP

Provider-agnostic MCP server that turns your AI CLI or IDE into a coordinator for multiple models: spawn isolated sub-agents, run cross-model debates and code reviews, and hand off full context between models for planning and implementation.

GitHub

Other

AGENTS.md

Open standard for AI coding agents: a consistent location for project setup, build steps, tests, and coding conventions. Used by 60,000+ open-source projects; keeps READMEs clean while improving agent reliability across tools.

agents.md

Tools

Promptify

Developer-friendly NLP wrapper for LLMs. Run NER, classification, and more with minimal code and zero training data. Converts unstructured model output into reliable, structured Python objects for production.

GitHub

Models

MiniMax-M1

Open-weight reasoning model: large MoE backbone with hybrid attention, up to 1M-token context, less test-time compute than comparable models. Strong in software engineering and extended-input settings.

GitHub

Tools

collaborating-with-codex

Agent Skill that lets Claude delegate coding tasks to the OpenAI Codex CLI for multi-model collaboration. Claude coordinates and refines; Codex handles implementation, debugging, and code analysis in a sandbox.

GitHub

Tools

Happy Coder

Mobile, web, and CLI client for Claude Code and Codex. Run and monitor from anywhere with E2E encryption, push notifications when the agent needs attention, and seamless switching between desktop and phone.

GitHub

Tools

Docker Sandboxes for Claude Code

Run Claude Code in an isolated, reproducible Docker environment without changing how you use the CLI. Sandboxed file access and credentials for security and reliability; supports all Claude Code options.

Docker Docs

Tools

Playwriter MCP

Lets AI agents control your Chrome browser via a lightweight extension using the full Playwright API with minimal context. Reliable browser automation: screenshots, flow validation, logged-in pages—no custom automation to maintain.

GitHub

Tools

oh-my-claude-sisyphus

Claude Code plugin for native multi-agent orchestration: specialized subagents, hooks, and slash commands for parallel coding tasks. Automates delegation, search, planning, and “keep going until done” workflows with smart model routing.

GitHub

Courses

dlt Fundamentals

Course from dlthub on building robust ELT pipelines. Includes a holiday lesson (Dec 22) on integrating LLMs into your workflow, with 50 swag packs to compete for.

dlthub.com

Courses

FreeCodeCamp Course Certification: Data Engineering with Python and AI/LLMs – Data Loading Tutorial

Learn how to build modern, scalable data pipelines using Python and AI-assisted tools. This hands-on tutorial starts from the basics of data ingestion and takes you all the way to advanced techniques in data loading, transformation, deployment, and automation. By freeCodeCamp, featuring Alexey Grigoriev and Adrian Brudaru.

freeCodeCamp / dlthub

Courses

AI Agents Email Crash-Course (Cohort Edition)

Free cohort-based version running December and January. Complete the project and review three other submissions to receive a certificate of completion signed by Alexey.

alexeygrigorev.com

Other

Claude Use Cases

Curated library of real-world Claude use cases across research, writing, coding, analysis, and everyday work. Organized by role, industry, and feature with concrete, end-to-end examples.

claude.com

Other

AI Engineering Hub

Large open-source GitHub repo: 90+ production-ready projects, tutorials, and reference implementations for LLMs, RAG, agents, MCP, multimodal systems, and evaluation. Structured by difficulty and use case.

GitHub

Courses

Agentic AI Crash Course

A free introductory crash course on agentic AI that explains how modern AI agents work in practice, from tools and RAG to memory, planning, MCP, and multi-agent systems. Designed as a clear, realistic starting point focused on real-world system design and limitations rather than hype.

GitHub

Courses

Assignments for CS146S: The Modern Software Developer

Programming assignments for CS146S: The Modern Software Developer (Stanford University), focused on AI-assisted software development. Includes hands-on work with modern tooling and workflows like LLM-based coding, testing, and documentation (Fall 2025).

GitHub

Courses

Data Engineering Zoomcamp

New cohort starts on January 12, 2026. A free 9-week course on building production-ready data pipelines: ingestion, orchestration, warehousing, analytics, and more.

GitHub

Courses

LLM Fine-Tuning roadmap

Curated resource for practitioners: core fine-tuning concepts, transformer internals, training infrastructure, data preparation, PEFT and alignment methods, and tools for training and deploying LLMs.

GitHub

Courses

Claude Code and Large-Context Reasoning

Materials from Tim Warner's O'Reilly Live Learning course: production-ready AI-assisted development with Claude Code, large-context reasoning, MCP-based memory, agents, and custom skills. Code review, automation, and CI/CD examples.

O'Reilly

Other

awesome-slash

Curated list of tools, patterns, and projects built around slash-command interfaces. Practical reference for command-driven workflows, bots, and developer tools.

GitHub

Tools

astronomer/agents

Open-source agent skills for data engineering: 13 skills to extend and automate data workflows with AI agents.

GitHub

Other

500+ AI Agent Projects

Curated collection of AI agent use cases across healthcare, finance, education, retail, and more. Maps practical applications to open-source implementations and frameworks (CrewAI, AutoGen, Agno, LangGraph). Hands-on inspiration hub for builders and practitioners.

GitHub

Tools

PaperBanana

Agentic framework that automates publication-ready methodology diagrams and statistical plots directly from paper text, references, or rough sketches—optimized for scientific accuracy and visual consistency.

GitHub

Tools

Dexter

Autonomous financial research agent that plans, executes, and validates analysis using real-time market data, with safety features like loop detection and step limits.

GitHub

Tools

Pro Workflow

Battle-tested AI coding practices for Claude Code and Cursor to keep an effective 80/20 AI-to-review ratio using disciplined context management and intentional review rituals.

GitHub

Courses

You Could've Invented OpenClaw

Tutorial by Nader Dabit for building a persistent AI assistant from scratch (starting with a Telegram bot + Anthropic API) and iteratively adding sessions, memory, tool use, and scheduled tasks.

Gist

Courses

CS336: Language Modeling from Scratch

Implementation-heavy Stanford course that walks through building a language model end to end: data processing and tokenization, training, systems optimization, scaling laws, and alignment. Includes Python lecture scripts and slides, five programming assignments with public leaderboards, handouts, starter code, and the full schedule.

Stanford University

Tools

planning-with-files

Claude Code skill that turns long, complex tasks into a file-based workflow using persistent Markdown files. Stores plans, progress, and errors on disk to reduce goal drift and repeated mistakes across many tool calls or sessions.

GitHub

Tools

CLAUDE_CODEX_SKILL

Workflow showing how to use the OpenAI Codex CLI for deep, non-interactive debugging via a file-based question-and-answer process: write a detailed problem description and full code context into a file, then let Codex produce a structured written analysis.

Gist

Other

Materials from my talk, "A Day of an AI Engineer"

Practical resource that combines a live webinar recording with the full planned write-up and a demo project. It also compares the AI Engineer role to traditional data team roles and maps modern AI projects to CRISP-DM.

Google Docs

Other

Locality Sensitive Hashing: The Illustrated Guide

In-depth tutorial from Pinecone's FAISS learning series that covers LSH (theory + Python implementation) for approximate nearest-neighbor search.

Pinecone

Tools

Atomic Agents

Lightweight, modular framework for building AI agent pipelines emphasizing atomic, single-purpose components that are reusable, composable, and predictable (built on Instructor and Pydantic).

GitHub

Tools

Gas Town

Orchestration system for managing multiple Claude Code instances simultaneously. Coordinates parallel tasks, tracks work across agent instances, manages merge queues, and keeps persistent agent identities (tmux-based workflow).

GitHub