Follow these steps to get started using these resources:
- Fork the Repository: Click
- Clone the Repository:
git clone https://github.com/microsoft/edgeai-for-beginners.git - Join The Azure AI Foundry Discord and meet experts and fellow developers
Arabic | Bengali | Bulgarian | Burmese (Myanmar) | Chinese (Simplified) | Chinese (Traditional, Hong Kong) | Chinese (Traditional, Macau) | Chinese (Traditional, Taiwan) | Croatian | Czech | Danish | Dutch | Estonian | Finnish | French | German | Greek | Hebrew | Hindi | Hungarian | Indonesian | Italian | Japanese | Kannada | Khmer | Korean | Lithuanian | Malay | Malayalam | Marathi | Nepali | Nigerian Pidgin | Norwegian | Persian (Farsi) | Polish | Portuguese (Brazil) | Portuguese (Portugal) | Punjabi (Gurmukhi) | Romanian | Russian | Serbian (Cyrillic) | Slovak | Slovenian | Spanish | Swahili | Swedish | Tagalog (Filipino) | Tamil | Telugu | Thai | Turkish | Ukrainian | Urdu | Vietnamese
Prefer to Clone Locally?
This repository includes 50+ language translations which significantly increases the download size. To clone without translations, use sparse checkout:
Bash / macOS / Linux:
git clone --filter=blob:none --sparse https://github.com/microsoft/edgeai-for-beginners.git cd edgeai-for-beginners git sparse-checkout set --no-cone '/*' '!translations' '!translated_images'CMD (Windows):
git clone --filter=blob:none --sparse https://github.com/microsoft/edgeai-for-beginners.git cd edgeai-for-beginners git sparse-checkout set --no-cone "/*" "!translations" "!translated_images"This gives you everything you need to complete the course with a much faster download.
If you wish to have additional translations languages supported are listed here
Welcome to EdgeAI for Beginners โ your comprehensive journey into the transformative world of Edge Artificial Intelligence. This course bridges the gap between powerful AI capabilities and practical, real-world deployment on edge devices, empowering you to harness AI's potential directly where data is generated and decisions need to be made.
This course takes you from fundamental concepts to production-ready implementations, covering:
- Small Language Models (SLMs) optimized for edge deployment
- Hardware-aware optimization across diverse platforms
- Real-time inference with privacy-preserving capabilities
- Production deployment strategies for enterprise applications
Edge AI represents a paradigm shift that addresses critical modern challenges:
- Privacy & Security: Process sensitive data locally without cloud exposure
- Real-time Performance: Eliminate network latency for time-critical applications
- Cost Efficiency: Reduce bandwidth and cloud computing expenses
- Resilient Operations: Maintain functionality during network outages
- Regulatory Compliance: Meet data sovereignty requirements
Edge AI refers to running AI algorithms and language models locally on hardware, close to where data is generated without relying on cloud resources for inference. It reduces latency, enhances privacy, and enables real-time decision-making.
- On-device inference: AI models run on edge devices (phones, routers, microcontrollers, industrial PCs)
- Offline capability: Functions without persistent internet connectivity
- Low latency: Immediate responses suited for real-time systems
- Data sovereignty: Keeps sensitive data local, improving security and compliance
SLMs like Phi-4, Mistral-7B, and Gemma are optimized versions of larger LLMsโtrained or distilled for:
- Reduced memory footprint: Efficient use of limited edge device memory
- Lower compute demand: Optimized for CPU and edge GPU performance
- Faster startup times: Quick initialization for responsive applications
They unlock powerful NLP capabilities while meeting the constraints of:
- Embedded systems: IoT devices and industrial controllers
- Mobile devices: Smartphones and tablets with offline capabilities
- IoT Devices: Sensors and smart devices with limited resources
- Edge servers: Local processing units with limited GPU resources
- Personal Computers: Desktop and laptop deployment scenarios
| Module | Topic | Focus Area | Key Content | Level | Duration |
|---|---|---|---|---|---|
| ๐ 00 | Introduction to EdgeAI | Foundation & Context | EdgeAI Overview โข Industry Applications โข SLM Introduction โข Learning Objectives | Beginner | 1-2 hrs |
| ๐ 01 | EdgeAI Fundamentals | Cloud vs Edge AI comparison | EdgeAI Fundamentals โข Real World Case Studies โข Implementation Guide โข Edge Deployment | Beginner | 3-4 hrs |
| ๐ง 02 | SLM Model Foundations | Model families & architecture | Phi Family โข Qwen Family โข Gemma Family โข BitNET โข ฮผModel โข Phi-Silica | Beginner | 4-5 hrs |
| ๐ 03 | SLM Deployment Practice | Local & cloud deployment | Advanced Learning โข Local Environment โข Cloud Deployment | Intermediate | 4-5 hrs |
| โ๏ธ 04 | Model Optimization Toolkit | Cross-platform optimization | Introduction โข Llama.cpp โข Microsoft Olive โข OpenVINO โข Apple MLX โข Workflow Synthesis | Intermediate | 5-6 hrs |
| ๐ง 05 | SLMOps Production | Production operations | SLMOps Introduction โข Model Distillation โข Fine-tuning โข Production Deployment | Advanced | 5-6 hrs |
| ๐ค 06 | AI Agents & Function Calling | Agent frameworks & MCP | Agent Introduction โข Function Calling โข Model Context Protocol | Advanced | 4-5 hrs |
| ๐ป 07 | Platform Implementation | Cross-platform samples | AI Toolkit โข Foundry Local โข Windows Development | Advanced | 3-4 hrs |
| ๐ญ 08 | Foundry Local Toolkit | Production-ready samples | Sample applications (see details below) | Expert | 8-10 hrs |
- 01: REST Chat Quickstart
- 02: OpenAI SDK Integration
- 03: Model Discovery & Benchmarking
- 04: Chainlit RAG Application
- 05: Multi-Agent Orchestration
- 06: Models-as-Tools Router
- 07: Direct API Client
- 08: Windows 11 Chat App
- 09: Advanced Multi-Agent System
- 10: Foundry Tools Framework
Comprehensive hands-on workshop materials with production-ready implementations:
- Workshop Guide - Complete learning objectives, outcomes, and resource navigation
- Python Samples (6 sessions) - Updated with best practices, error handling, and comprehensive documentation
- Jupyter Notebooks (8 interactive) - Step-by-step tutorials with benchmarks and performance monitoring
- Session Guides - Detailed markdown guides for each workshop session
- Validation Tools - Scripts to verify code quality and run smoke tests
What You'll Build:
- Local AI chat applications with streaming support
- RAG pipelines with quality evaluation (RAGAS)
- Multi-model benchmarking and comparison tools
- Multi-agent orchestration systems
- Intelligent model routing with task-based selection
Build an AI-powered podcast production pipeline from scratch! This immersive workshop teaches you to create a complete multi-agent system that transforms ideas into professional podcast episodes.
๐ฌ Start The AI Podcast Studio Workshop
Your Mission: Launch "Future Bytes" โ a tech podcast powered entirely by AI agents you'll build yourself. No cloud dependencies, no API costs โ everything runs locally on your machine.
What Makes This Unique:
- ๐ค Real Multi-Agent Orchestration - Build specialized AI agents that research, write, and produce audio
- ๐ฏ Complete Production Pipeline - From topic selection to final podcast audio output
- ๐ป 100% Local Deployment - Uses Ollama and local models (Qwen-3-8B) for full privacy and control
- ๐ค Text-to-Speech Integration - Transform scripts into natural-sounding multi-speaker conversations
- โ Human-in-the-Loop Workflows - Approval gates ensure quality while maintaining automation
Three-Act Learning Journey:
| Act | Focus | Key Skills | Duration |
|---|---|---|---|
| Act 1: Meet Your AI Assistants | Build your first AI agent | Tool integration โข Web search โข Problem-solving โข Agentic reasoning | 2-3 hrs |
| Act 2: Assemble Your Production Team | Orchestrate multiple agents | Team coordination โข Approval workflows โข DevUI interface โข Human oversight | 3-4 hrs |
| Act 3: Bring Your Podcast to Life | Generate podcast audio | Text-to-speech โข Multi-speaker synthesis โข Long-form audio โข Full automation | 2-3 hrs |
Technologies Used:
- Microsoft Agent Framework - Multi-agent orchestration and coordination
- Ollama - Local AI model runtime (no cloud required)
- Qwen-3-8B - Open-source language model optimized for agentic tasks
- Text-to-Speech APIs - Natural voice synthesis for podcast generation
Hardware Support:
- โ CPU Mode - Works on any modern computer (8GB+ RAM recommended)
- ๐ GPU Acceleration - Significantly faster inference with NVIDIA/AMD GPUs
- โก NPU Support - Next-generation neural processing unit acceleration
Perfect For:
- Developers learning multi-agent AI systems
- Anyone interested in AI automation and workflows
- Content creators exploring AI-assisted production
- Students studying practical AI orchestration patterns
Start Building: ๐๏ธ The AI Podcast Studio Workshop โ
- Total Duration: 36-45 hours
- Beginner Path: Modules 01-02 (7-9 hours)
- Intermediate Path: Modules 03-04 (9-11 hours)
- Advanced Path: Modules 05-07 (12-15 hours)
- Expert Path: Module 08 (8-10 hours)
- Edge AI Architecture: Design local-first AI systems with cloud integration
- Model Optimization: Quantize and compress models for edge deployment (85% speed boost, 75% size reduction)
- Multi-Platform Deployment: Windows, mobile, embedded, and cloud-edge hybrid systems
- Production Operations: Monitoring, scaling, and maintaining edge AI in production
- Foundry Local Chat Apps: Windows 11 native application with model switching
- Multi-Agent Systems: Coordinator with specialist agents for complex workflows
- RAG Applications: Local document processing with vector search
- Model Routers: Intelligent selection between models based on task analysis
- API Frameworks: Production-ready clients with streaming and health monitoring
- Cross-Platform Tools: LangChain/Semantic Kernel integration patterns
Manufacturing โข Healthcare โข Autonomous Vehicles โข Smart Cities โข Mobile Apps
Recommended Learning Path (20-30 hours total):
- ๐ Introduction (Introduction.md): EdgeAI foundation + industry context + learning framework
- ๐ Foundation (Modules 01-02): EdgeAI concepts + SLM model families
- โ๏ธ Optimization (Modules 03-04): Deployment + quantization frameworks
- ๐ Production (Modules 05-06): SLMOps + AI agents + function calling
- ๐ป Implementation (Modules 07-08): Platform samples + Foundry Local toolkit
Each module includes theory, hands-on exercises, and production-ready code samples.
Technical Roles: EdgeAI Solutions Architect โข ML Engineer (Edge) โข IoT AI Developer โข Mobile AI Developer
Industry Sectors: Manufacturing 4.0 โข Healthcare Tech โข Autonomous Systems โข FinTech โข Consumer Electronics
Portfolio Projects: Multi-agent systems โข Production RAG apps โข Cross-platform deployment โข Performance optimization
edgeai-for-beginners/
โโโ ๐ introduction.md # Foundation: EdgeAI Overview & Learning Framework
โโโ ๐ Module01-04/ # Fundamentals โ SLMs โ Deployment โ Optimization
โโโ ๐ง Module05-06/ # SLMOps โ AI Agents โ Function Calling
โโโ ๐ป Module07/ # Platform Samples (VS Code, Windows, Jetson, Mobile)
โโโ ๐ญ Module08/ # Foundry Local Toolkit + 10 Comprehensive Samples
โ โโโ samples/01-06/ # Foundation: REST, SDK, RAG, Agents, Routing
โ โโโ samples/07-10/ # Advanced: API Client, Windows App, Enterprise Agents, Tools
โโโ ๐ translations/ # Multi-language support (8+ languages)
โโโ ๐ STUDY_GUIDE.md # Structured learning paths & time allocation
โ
Progressive Learning: Theory โ Practice โ Production deployment
โ
Real Case Studies: Microsoft, Japan Airlines, enterprise implementations
โ
Hands-on Samples: 50+ examples, 10 comprehensive Foundry Local demos
โ
Performance Focus: 85% speed improvements, 75% size reductions
โ
Multi-Platform: Windows, mobile, embedded, cloud-edge hybrid
โ
Production Ready: Monitoring, scaling, security, compliance frameworks
๐ Study Guide Available: Structured 20-hour learning path with time allocation guidance and self-assessment tools.
EdgeAI represents the future of AI deployment: local-first, privacy-preserving, and efficient. Master these skills to build the next generation of intelligent applications.
Our team produces other courses! Check out:
If you get stuck or have any questions about building AI apps, join:
If you have product feedback or errors while building visit:
