Learning Resources
Curated collection of articles, courses, tools, and tutorials from industry leaders to accelerate your AI and data engineering journey.
Resources
Categories
Access
Featured Resources
Attention Is All You Need - The Transformer Paper
The foundational paper that introduced the Transformer architecture, the basis for GPT, BERT, and all modern LLMs.
arXiv
LangChain Documentation
Official documentation for building applications with LLMs using LangChain framework.
LangChain
Stanford CS25: Transformers United
Stanford's comprehensive course on Transformer models and their applications across domains.
Stanford University
OpenAI Playground
Interactive environment to experiment with OpenAI models, test prompts, and understand model behavior.
OpenAI
Databricks Academy - Free Training
Official Databricks training covering Spark, Delta Lake, and data engineering best practices.
Databricks
The Data Engineering Handbook
Open-source handbook covering modern data engineering practices, tools, and architectures.
DataExpert.io
ChatGPT Prompt Engineering for Developers
DeepLearning.AI course teaching best practices for prompting and building with LLMs.
DeepLearning.AI
Prompt Engineering Guide
Comprehensive open-source guide covering prompt engineering techniques, tips, and applications.
DAIR.AI
MLOps Zoomcamp - Free Course
Free 9-week course covering ML experiment tracking, orchestration, deployment, and monitoring.
DataTalks.Club
The AI Engineer Roadmap 2024
Comprehensive roadmap for becoming an AI engineer, from fundamentals to production systems.
roadmap.sh
Hugging Face
Platform for discovering, sharing, and deploying machine learning models and datasets.
Hugging Face
All Resources
(22)RAG (Retrieval Augmented Generation) Explained
Comprehensive guide to understanding and implementing RAG systems for production applications.
Pinecone
Apache Spark Documentation
Official Apache Spark documentation with tutorials, API references, and best practices.
Apache
dbt (Data Build Tool) Best Practices
Learn how to build maintainable, tested, and documented data transformation pipelines with dbt.
dbt Labs
Apache Airflow
Open-source workflow orchestration platform for building and managing data pipelines.
Apache
PromptBase - Prompt Marketplace
Browse and purchase high-quality prompts for various AI models and use cases.
PromptBase
Anthropic's Prompt Engineering Guide
Official guide from Anthropic on how to get the best results from Claude and other LLMs.
Anthropic
MLflow Documentation
Learn to track experiments, package models, and deploy ML projects with MLflow.
MLflow
Weights & Biases
Platform for experiment tracking, model management, and collaboration in ML projects.
W&B
Data Engineering Interview Questions
Collection of common data engineering interview questions covering SQL, Python, Spark, and system design.
GitHub
Replicate
Run and deploy open-source machine learning models with a simple API.
Replicate
GitHub Copilot
AI pair programmer that helps you write code faster with AI-powered suggestions.
GitHub