A practical guide for real-world language-based AI applications for resource-constrained devices with industry standards in mind.
- Start with the Edge AI Engineering: a practical guide covering core concepts of the entire Edge AI MLOps stack with industry blueprints.
- Then read this: The Next AI Frontier is at the Edge
- Related work: Edge Audio | Edge Vision
The goal of this guide is to provide resources for building, optimizing, and deploying language-based AI applications at the edge, through hands-on examples including practical notebooks and real-world use cases across key industries.
Industry Blueprints
- Autonomous Systems
- Healthcare & Medical Imaging*
- Retail & Consumer Analytics
- Security & Surveillance
- Agriculture & Precision Farming
- Manufacturing & Quality Control
- Smart Cities & Urban Planning
Edge Optimization Lab: techniques and tools for maximizing performance and efficiency of language models on edge hardware
- Model Quantization
- Pruning Techniques
- Federated Learning
- Compiler Targets (ONNX, TVM)
- Hardware-Specific Optimizations (Jetson, Raspberry Pi, Microcontrollers)
Production Pipelines: guides and templates for robust, scalable edge language AI operations
- CI/CD for Language Models on the Edge
- Monitoring (Drift Detection, Edge Metrics Dashboard)
- OTA Updates
- Edge Security (Secure Boot, Data Encryption, Threat Detection, Privacy-Preserving language, Adversarial Robustness, Device Hardening, Compliance)
Reference Architectures: blueprints for edge language hardware and system design
- Microphone Array Setups
- Edge Server Specs
- IoT Connectivity
- Edge-Cloud Hybrid Models
Integration
- Notebooks (hands-on deep dives)
- Companion Resources
- Industry-Specific Stardards
βββ edge-ai-engineering/
β βββ introduction-to-edge-ai.md
β βββ edge-ai-architectures.md
β βββ model-optimization-techniques.md
β βββ hardware-acceleration.md
β βββ edge-deployment-strategies.md
β βββ real-time-processing.md
β βββ privacy-and-security.md
β βββ edge-ai-frameworks.md
β βββ benchmarking-and-performance.md
βββ industry-blueprints/
β βββ autonomous-systems/
β β βββ voice-command-recognition-tflite.md
β β βββ natural-language-vehicle-control.md
β β βββ edge-language-agents-drone.md
β βββ healthcare/
β β βββ clinical-text-analysis-edge.md
β β βββ patient-conversation-models.md
β β βββ medical-language-agents.md
β βββ retail-consumer-analytics/
β β βββ sentiment-analysis-edge.md
β β βββ chatbot-instore-assistance.md
β β βββ personalized-recommendation-edge.md
β βββ security-surveillance/
β β βββ voice-biometric-authentication.md
β β βββ call-center-language-intent-detection.md
β β βββ edge-threat-intelligence-llm.md
β βββ agriculture-precision-farming/
β β βββ voice-command-machinery-control.md
β β βββ farmer-assistant-chatbots.md
β β βββ natural-language-report-generation.md
β βββ manufacturing/
β β βββ voice-operated-maintenance.md
β β βββ defect-report-llm.md
β β βββ predictive-language-agents.md
β βββ smart-cities/
β βββ multilingual-public-announcements.md
β βββ emergency-alert-llm.md
β βββ citizen-feedback-analysis.md
βββ edge-optimization-lab/
β βββ model-quantization/
β β βββ int8-quantization-for-llm.md
β β βββ post-training-quantization-llm.md
β βββ pruning-techniques/
β β βββ structured-pruning-llms.md
β β βββ adaptive-pruning-edge.md
β βββ federated-learning/
β β βββ privacy-preserving-llm.md
β β βββ distributed-fine-tuning.md
β βββ compiler-targets/
β β βββ onnx-runtime-for-llms.md
β β βββ tvm-compiler-usage.md
β βββ hardware-specific-optimization/
β βββ nvidia-jetson-llm-optimization.md
β βββ intel-openvino-llm.md
β βββ raspberry-pi-llm.md
β βββ microcontroller-tinyml-language.md
βββ production-pipelines/
β βββ ci-cd-for-edge.md
β βββ monitoring/
β β βββ drift-detection-llm.md
β β βββ edge-llm-metrics-dashboard.md
β βββ ota-updates.md
β βββ edge-security/
β βββ secure-boot-implementation.md
β βββ data-encryption-edge.md
β βββ threat-detection/
β β βββ anomaly-detection-text.md
β β βββ adversarial-attack-defense.md
β βββ privacy-preserving-llm/
β β βββ federated-learning-techniques.md
β β βββ differential-privacy.md
β βββ model-security/
β β βββ adversarial-robustness.md
β βββ edge-device-hardening/
β β βββ secure-deployment.md
β β βββ secure-communication.md
β βββ industry-compliance/
β βββ regulatory-standards.md
β βββ ethical-ai-guidelines.md
βββ reference-architectures/
β βββ language-model-hardware.md
β βββ edge-server-specs.md
β βββ iot-connectivity.md
β βββ edge-cloud-hybrid-models.md
βββ integration/
βββ cs-notebook-redirects.md
βββ companion-resources.md
βββ industry-specific-regulations.md
Important
This project uses a submodule edge-ai-engineering located in lab/edge-ai-engineering.
Please initialize submodules after cloning the repository: git submodule update --init --recursive
- Clone this repository:
git clone https://github.com/afondiel/edge-language.git- Explore the Edge AI Engineering for foundational knowledge.
- Dive into Industry Blueprints for hands-on, sector-specific language AI guides.
- Use the Edge Optimization Lab and Production Pipeline for deployment and scaling.
See CONTRIBUTING.md for guidelines on how to contribute, report issues, or suggest new blueprints.
Distributed under the MIT License. See LICENSE for more information.
Technical Guides:
Curated Lists:
- awesomesmol - An awesome list of "small but mighty" models and resources.
- Edge-AI Model Zoo - A list of production-ready models for resource-constrained devices.
Books: