All notable changes to EdgeAI for Beginners are documented here. This project uses date-based entries and the Keep a Changelog style (Added, Changed, Fixed, Removed, Docs, Moved).
-
Microsoft Agent Framework Integration (
Module06/01.IntroduceAgent.md):- Complete section on Microsoft Agent Framework for production-ready agent development
- Detailed integration patterns with Foundry Local for edge deployment
- Multi-agent orchestration examples with specialized SLM models
- Enterprise deployment patterns with resource management and monitoring
- Security and compliance features for edge agent systems
- Real-world implementation examples (retail, healthcare, customer service)
-
Production SLM Agent Deployment Strategies:
- Foundry Local: Complete enterprise-grade edge AI runtime documentation with installation, configuration, and production patterns
- Ollama: Enhanced community-focused deployment with comprehensive monitoring and model management
- VLLM: High-performance inference engine with advanced optimization techniques and enterprise features
- Production deployment checklists and comparison tables for all three platforms
-
Edge-Optimized SLM Frameworks Enhancement:
- ONNX Runtime: New comprehensive section for cross-platform SLM agent deployment
- Universal deployment patterns across Windows, Linux, macOS, iOS, and Android
- Hardware acceleration options (CPU, GPU, NPU) with automatic detection
- Production-ready features and agent-specific optimizations
- Complete implementation examples with Microsoft Agent Framework integration
-
References and Further Reading:
- Comprehensive resource library with 100+ authoritative sources
- Core research papers on AI agents and Small Language Models
- Official documentation for all major frameworks and tools
- Industry reports, market analysis, and technical benchmarks
- Educational resources, conferences, and community forums
- Standards, specifications, and compliance frameworks
- Enhanced Learning Objectives: Added Microsoft Agent Framework mastery and edge deployment capabilities
- Production Focus: Shifted from conceptual to implementation-ready guidance with production examples
- Code Examples: Updated all examples to use modern SDK patterns and best practices
- Architecture Patterns: Added hierarchical agent architectures and edge-to-cloud coordination
- Performance Optimization: Enhanced with resource management and auto-scaling recommendations
- Comprehensive Agent Framework Coverage: From basic concepts to enterprise deployment
- Production Deployment Strategies: Complete guides for Foundry Local, Ollama, and VLLM
- Cross-Platform Optimization: Added ONNX Runtime for universal deployment
- Resource Library: Extensive references for continued learning and implementation
-
MCP Introduction Modernization (
Module06/03.IntroduceMCP.md):- Updated with latest MCP specifications from modelcontextprotocol.io (2025-06-18 version)
- Added official USB-C analogy for standardized AI application connections
- Updated architecture section with official two-layer design (Data Layer + Transport Layer)
- Enhanced core primitives documentation with server primitives (Tools, Resources, Prompts) and client primitives (Sampling, Elicitation, Logging)
-
Comprehensive MCP References and Resources:
- Added MCP for Beginners link (https://aka.ms/mcp-for-beginners)
- Official MCP documentation and specifications (modelcontextprotocol.io)
- Development resources including MCP Inspector and reference implementations
- Technical standards (JSON-RPC 2.0, JSON Schema, OpenAPI, Server-Sent Events)
-
New Section 7: Qualcomm QNN Optimization Suite (
Module04/05.QualcommQNN.md):- Comprehensive 400+ line guide covering Qualcomm's unified AI inference framework
- Detailed coverage of heterogeneous computing (Hexagon NPU, Adreno GPU, Kryo CPU)
- Hardware-aware optimization for Snapdragon platforms with intelligent workload distribution
- Advanced quantization techniques (INT8, INT16, mixed-precision) for mobile deployment
- Power-efficient inference optimization for battery-powered devices and real-time applications
- Complete installation guide with QNN SDK setup and environment configuration
- Practical examples: PyTorch to QNN conversion, multi-backend optimization, context binary generation
- Advanced usage patterns: custom backend configuration, dynamic quantization, performance profiling
- Comprehensive troubleshooting section and community resources
-
Enhanced Module04 structure:
- Updated README.md to include 7 progressive sections (was 6)
- Added Qualcomm QNN to performance benchmarks table (5-15x speed improvement, 50-80% memory reduction)
- Comprehensive learning outcomes for mobile AI deployment and power optimization
-
Microsoft Olive documentation enhancement (
Module04/03.MicrosoftOlive.md):- Added comprehensive "Olive Recipes Repository" section covering 100+ pre-built optimization recipes
- Detailed coverage of supported model families (Phi, Llama, Qwen, Gemma, Mistral, DeepSeek)
- Practical usage examples for recipe customization and community contributions
- Enhanced with performance benchmarks and integration guidance
-
Section reordering in Module04:
- Apple MLX moved to Section 5 (was Section 6)
- Workflow Synthesis moved to Section 6 (was Section 7)
- Qualcomm QNN positioned as Section 7 (specialized mobile/edge focus)
- Updated all file references and navigation links accordingly
- chat_bootstrap.py validation and repair:
- Fixed corrupted import statement (
util.util.workshop_utils→util.workshop_utils) - Created missing
__init__.pyin util package for proper Python module resolution - Installed required dependencies (openai, foundry-local-sdk) in conda environment
- Successfully validated sample execution with both default and custom prompts
- Confirmed integration with Foundry Local service and model loading (phi-4-mini with CUDA optimization)
- Fixed corrupted import statement (
-
Module04 README.md complete restructure:
- Added Qualcomm QNN as major optimization framework alongside OpenVINO, Olive, MLX
- Updated chapter learning outcomes to include mobile AI deployment and power optimization
- Enhanced performance comparison table with QNN metrics and mobile/edge use cases
- Maintained logical progression from enterprise solutions to platform-specific optimizations
-
Cross-references and navigation:
- Updated all internal links and file references for new section numbering
- Enhanced workflow synthesis description to include mobile, desktop, and cloud environments
- Added comprehensive resource links for Qualcomm developer ecosystem
-
Workshop README.md complete rewrite:
- Added comprehensive introduction explaining Edge AI value proposition (privacy, performance, cost)
- Created 6 core learning objectives with detailed competencies
- Added learning outcomes table with deliverables and competency matrix
- Included career-ready skills section for industry relevance
- Added quick start guide with prerequisites and 3-step setup
- Created resource tables for Python samples (8 files with run times)
- Added Jupyter notebooks table (8 notebooks with difficulty ratings)
- Created documentation table (7 key docs with "Use When" guidance)
- Added learning path recommendations for different skill levels
-
Workshop validation and testing infrastructure:
- Created
scripts/validate_samples.py- Comprehensive validation tool for syntax, imports, and best practices - Created
scripts/test_samples.py- Smoke test runner for all Python samples - Added validation documentation to
scripts/README.md
- Created
-
Comprehensive documentation:
- Created
SAMPLES_UPDATE_SUMMARY.md- 400+ line detailed guide covering all improvements - Created
UPDATE_COMPLETE.md- Executive summary of update completion - Created
QUICK_REFERENCE.md- Quick reference card for Workshop
- Created
-
All 8 Python samples updated with best practices:
- Enhanced error handling with try-except blocks around all I/O operations
- Added type hints and comprehensive docstrings
- Implemented consistent [INFO]/[ERROR]/[RESULT] logging pattern
- Protected optional imports with installation hints
- Improved user feedback throughout all samples
-
session01/chat_bootstrap.py:
- Enhanced client initialization with comprehensive error messages
- Improved streaming error handling with chunk validation
- Added better exception handling for service unavailability
-
session02/rag_pipeline.py:
- Added import guards for sentence-transformers with installation hints
- Enhanced error handling for embedding and generation operations
- Improved output formatting with structured results
-
session02/rag_eval_ragas.py:
- Protected optional imports (ragas, datasets) with user-friendly error messages
- Added error handling for evaluation metrics
- Enhanced output formatting for evaluation results
-
session03/benchmark_oss_models.py:
- Implemented graceful degradation (continues on model failures)
- Added detailed progress reporting and per-model error handling
- Enhanced statistics calculation with comprehensive error recovery
-
session04/model_compare.py:
- Added type hints (Tuple return types)
- Enhanced output formatting with structured JSON results
- Implemented per-model error handling with recovery
-
session05/agents_orchestrator.py:
- Enhanced Agent.act() with comprehensive docstrings
- Added pipeline error handling with stage-by-stage logging
- Improved memory management and state tracking
-
session06/models_router.py:
- Enhanced function documentation for all routing components
- Added detailed logging in route() function
- Improved test output with structured results
-
session06/models_pipeline.py:
- Added error handling to chat() helper function
- Enhanced pipeline() with stage logging and progress reporting
- Improved main() with comprehensive error recovery
- Updated main README.md with Workshop section highlighting hands-on learning path
- Enhanced STUDY_GUIDE.md with comprehensive Workshop section including:
- Learning objectives and study focus areas
- Self-assessment questions
- Hands-on exercises with time estimates
- Time allocation for concentrated and part-time study
- Added Workshop to progress tracking template
- Updated time allocation guide from 20 hours to 30 hours (including Workshop)
- Added Workshop sample descriptions and learning outcomes to README
- Resolved inconsistent error handling patterns across Workshop samples
- Fixed optional dependency import errors with proper guards
- Corrected missing type hints in critical functions
- Addressed insufficient user feedback in error scenarios
- Fixed validation issues with comprehensive testing infrastructure
-
Comprehensive alignment with Microsoft Foundry-Local repository patterns
- Updated all code examples to use modern
FoundryLocalManagerand OpenAI SDK integration - Replaced deprecated manual
requestscalls with proper SDK usage - Aligned implementation patterns with official Microsoft documentation and samples
- Updated all code examples to use modern
-
05.AIPoweredAgents.md modernization:
- Updated multi-agent orchestration to use modern SDK patterns
- Enhanced coordinator implementation with advanced features (feedback loops, performance monitoring)
- Added comprehensive error handling and service health checking
- Integrated proper references to local samples (
samples/05/multi_agent_orchestration.ipynb) - Updated function calling examples to use modern
toolsparameter instead of deprecatedfunctions - Added production-ready patterns with monitoring and statistics tracking
-
06.ModelsAsTools.md complete rewrite:
- Replaced basic tool registry with intelligent model router implementation
- Added keyword-based model selection for different task types (general, reasoning, code, creative)
- Integrated environment-based configuration with flexible model assignment
- Enhanced with comprehensive service health monitoring and error handling
- Added production deployment patterns with request monitoring and performance tracking
- Aligned with local implementation in
samples/06/router.pyandsamples/06/model_router.ipynb
-
Documentation structure improvements:
- Added overview sections highlighting modernization and SDK alignment
- Enhanced with emojis and better formatting for improved readability
- Added proper references to local sample files throughout documentation
- Included production-ready implementation guidance and best practices
- Comprehensive overview sections in Module 08 files highlighting modern SDK integration
- Architecture highlights showcasing advanced features (multi-agent systems, intelligent routing)
- Direct references to local sample implementations for hands-on experience
- Production deployment guidance with monitoring and error handling patterns
- Interactive Jupyter notebook examples with advanced features and benchmarks
- Alignment discrepancies between documentation and actual sample implementations
- Outdated SDK usage patterns throughout Module 08
- Missing references to comprehensive local sample library
- Inconsistent implementation approaches across different sections
- Module 08: Microsoft Foundry Local – Complete Developer Toolkit
- Six sessions: setup, Azure AI Foundry integration, open-source models, cutting-edge demos, agents, and models-as-tools
- Runnable samples under
Module08/samples/01–06with Windows cmd instructions01REST quick chat (chat_quickstart.py)02SDK quickstart with OpenAI/Foundry Local and Azure OpenAI support (sdk_quickstart.py)03CLI list-and-bench (list_and_bench.cmd)04Chainlit demo (app.py)05Multi-agent orchestration (python -m samples.05.agents.coordinator)06Models-as-Tools router (router.py)
- Azure OpenAI support in Session 2 SDK sample with environment variable configuration
.vscode/settings.jsonto point toModule08/.venvand improve Python analysis resolution.envwithPYTHONPATHhint for VS Code/Pylance awareness
- Default model updated to
phi-4-miniacross Module 08 docs and samples; removed remainingphi-3.5mentions within Module 08 - Router (
Module08/samples/06/router.py) improvements:- Endpoint discovery via
foundry service statuswith regex parsing /v1/modelshealth check on startup- Env-configurable model registry (
GENERAL_MODEL,REASONING_MODEL,CODE_MODEL,TOOL_REGISTRYJSON)
- Endpoint discovery via
- Requirements updated:
Module08/requirements.txtnow includesopenai(alongsiderequests,chainlit) - Chainlit sample guidance clarified and troubleshooting added; import resolution via workspace settings
- Resolved import issues:
- Router no longer depends on a non-existent
utilsmodule; functions are inlined - Coordinator uses relative import (
from .specialists import ...) and is invoked via module path - VS Code/Pylance configuration to resolve
chainlitand package imports
- Router no longer depends on a non-existent
- Corrected minor typo in
STUDY_GUIDE.mdand added Module 08 coverage
- Deleted unused
Module08/infra/obs.pyand removed the emptyinfra/directory; observability patterns retained as optional in docs
- Consolidated Module 08 demos under
Module08/sampleswith session-numbered folders- Moved Chainlit app to
samples/04 - Moved agents to
samples/05and added__init__.pyfiles for package resolution
- Moved Chainlit app to
- Module 08 session docs and all sample READMEs enriched with Microsoft Learn and trusted vendor references
Module08/README.mdupdated with Samples Overview, router configuration, and validation tipsModule07/README.mdWindows Foundry Local section validated against Learn docsSTUDY_GUIDE.mdupdated:- Added Module 08 to overview, schedules, progress tracker
- Added comprehensive References section (Foundry Local, Azure AI, Olive, ONNX Runtime, OpenVINO, MLX, Llama.cpp, vLLM, Ollama, AI Toolkit, Windows ML)
- Course architecture and modules established (Modules 01–07)
- Iterative content modernization, formatting standardization, and added case studies
- Expanded optimization frameworks coverage (Llama.cpp, Olive, OpenVINO, Apple MLX)
- Optional per-sample smoke tests to validate Foundry Local availability
- Review translations to align model references (e.g.,
phi-4-mini) where appropriate - Add minimal pyright config if teams prefer workspace-wide strictness