Skip to main content
Enterprise AI Analysis: Send Less, Save More: Energy-Efficiency Benchmark of Embedded CNN Inference vs. Data Transmission in IoT

Enterprise AI Analysis

Send Less, Save More: Energy-Efficiency Benchmark of Embedded CNN Inference vs. Data Transmission in IoT

The integration of IoT and AI offers significant opportunities for environmental monitoring. However, remote operation requires highly energy-efficient solutions. This research evaluates the energy consumption of on-device Convolutional Neural Network (CNN) inference compared to raw image data transmission over Low Power Wide Area Networks (LPWANs), demonstrating significant energy savings by processing data locally.

Benjamin Karic, Nina Herrmann, Jan Stenkamp, Fabian Gieseke, Angela Schwering (University of Münster, Germany)
Paula Scharf (re:edu GmbH & Co. KG, Münster, Germany)

Executive Impact at a Glance

This study quantifies the substantial energy and operational benefits of deploying embedded AI for IoT applications, crucial for sustainable remote environmental monitoring.

0x Energy Reduction (CNN Inference vs. Raw Image Tx)
0x Energy Reduction (Conditional Result Tx)
0% SqueezeNet Size vs. MobileNetV2 (Quantized)
0% Inference Energy Share of Total ML
50.51 µAh Lowest Energy for Class Index Transmission (LoRaWAN)

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

The Edge vs. Cloud Paradigm

The core challenge in remote IoT monitoring, especially with image data, is energy efficiency. Traditional "Cloud-ML" involves transmitting raw data for server-side processing. Our "Embedded-ML" approach emphasizes local inference on microcontrollers, significantly reducing data transmission volume and enabling the use of energy-efficient LPWANs.

This work provides a comprehensive benchmark comparing these paradigms using state-of-the-art CNNs (MobileNetV2, SqueezeNet) and various LPWAN protocols (LoRaWAN, LTE-M, NB-IoT) on ESP32-S3 microcontrollers.

Enterprise Process Flow: Embedded ML Deployment

PlantVillage / CUB Dataset
Resize to 224x224
Pretrained on ImageNet
MobileNet/SqueezeNet Training
Post Training Quantization (8-bit Integer)
Deployment on ESP32-S3 Microcontroller

Model Selection & Performance on Edge Devices

We evaluated MobileNetV2 and SqueezeNet, fine-tuning them on PlantVillage (plant disease) and CUB (bird species) datasets. Crucially, models underwent 8-bit post-training quantization to reduce memory footprint, a necessity for resource-constrained microcontrollers like the ESP32-S3.

Quantization led to minor accuracy losses (around 1% for PlantVillage, 3-6% for CUB) while significantly reducing model sizes. For instance, quantized SqueezeNet models were approximately 8% the size of unquantized MobileNetV2 models.

Inference consistently accounts for over 90% of the total ML-related energy, highlighting its intensive nature despite model compression. This contradicts simple FLOPs-based estimations, emphasizing the need for real-world energy measurements on target hardware.

Model Performance & Size Comparison (Quantized 8-bit Integer)

Model Dataset Size (MB) Top-1 Accuracy Top-5 Accuracy
MobileNetV2 (Int) CUB 2.59 48.96% 74.83%
MobileNetV2 (Int) PlantVillage 2.38 95.87% 99.94%
SqueezeNet (Int) CUB 0.87 53.04% 82.29%
SqueezeNet (Int) PlantVillage 0.79 96.47% 99.92%

Optimizing Data Transfer with LPWANs

Data transmission is often the most energy-intensive part of IoT operations. We benchmarked various LPWAN technologies and application layer protocols:

  • LoRaWAN: Ideal for transmitting small data (e.g., class indices) due to low connection energy, but with duty cycle limitations for frequency.
  • LTE-M & NB-IoT (MQTT/CoAP): Suitable for larger data, with LTE-M MQTT proving most efficient for transmitting full images. NB-IoT generally incurs higher latencies and energy costs.

Crucially, reducing message size from a full image to a small class index enables the use of highly energy-efficient protocols like LoRaWAN, which would otherwise be unfeasible.

Energy Footprint: Transmitting Class Label vs. Image

Data Type Protocol LPWAN Total Energy (µAh)
Class Index CoAP LTE-M 148.98
Class Index MQTT LTE-M 187.62
Class Index LoRaWAN LoRa 50.54
Image (32x32) CoAP LTE-M 148.97
Image (224x224) MQTT LTE-M 331.34
Image (224x224) MQTT NB-IoT 1247.95

Sending a raw 224x224 pixel image via NB-IoT MQTT costs nearly 1.25 mAh, while sending a class index via LoRaWAN costs only 50.54 µAh. This stark difference highlights the immense energy savings from on-device inference.

Achieving Maximum Energy Efficiency in Practice

By combining on-device CNN inference with optimized data transmission, overall energy consumption can be drastically reduced. Our experiments show that executing CNN inference on-device and transmitting only the results reduces energy consumption by a factor of up to five compared to sending raw image data.

Furthermore, implementing a "result-based sending" strategy, where data is only transmitted when an event of interest is detected (e.g., every 10th measurement), leads to an energy reduction factor of almost 14 times.

These findings empower developers to create IoT applications with extended operational lifespans for remote environmental monitoring, significantly lowering the carbon footprint and maintenance costs.

Case Study: Remote Biodiversity Monitoring

Challenge: Monitoring bird species in remote forest locations requires long-term, autonomous operation of devices with limited power and poor connectivity. Transmitting high-resolution images of birds for cloud-based classification is energy-prohibitive.

EmbeddedML Solution: Deploying a quantized SqueezeNet model on an ESP32-S3 microcontroller to classify bird species locally. Instead of sending the full image, only the identified class index (e.g., "European Robin") is transmitted via LoRaWAN.

Impact: This approach reduces data transmission energy by a factor of 5-14x depending on the sending frequency, extending the device's battery life from weeks to months or even years. This enables truly autonomous and scalable biodiversity monitoring without constant human intervention or costly infrastructure.

Considerations & Future Directions

While EmbeddedML offers significant advantages, it comes with certain trade-offs:

  • Loss of Raw Data: On-device processing means raw images are not stored or transmitted, limiting post-hoc analysis or re-evaluation.
  • Model Constraints: The current approach is limited to relatively small models compressed for specific microcontrollers. Larger or more complex models might require more powerful (and energy-intensive) hardware.
  • Hardware Specificity: The use of manufacturer-provided libraries (like ESP-DL) optimizes for specific hardware but reduces cross-platform compatibility.

Future work could explore Neural Architecture Search (NAS) to design energy-optimized models from scratch, investigate image batching for more efficient transmission of larger images, and evaluate protocol-specific energy-saving features like faster reconnection times.

Calculate Your Potential ROI

See how integrating energy-efficient EmbeddedML into your IoT operations can translate into tangible savings and increased operational capacity.

Estimate Your Annual Savings

Estimated Annual Savings $0
Operational Hours Reclaimed 0

Your AI Implementation Roadmap

Our proven process guides you from initial concept to a fully optimized and deployed energy-efficient IoT solution.

Phase 1: Discovery & Strategy

Assess current IoT infrastructure, identify environmental monitoring needs, and define key performance indicators for energy efficiency and data throughput.

Phase 2: Model Selection & Optimization

Evaluate suitable CNN architectures (e.g., SqueezeNet, MobileNetV2), perform transfer learning on domain-specific datasets, and apply quantization for embedded deployment.

Phase 3: Hardware Integration & Benchmarking

Deploy models on selected microcontrollers (e.g., ESP32-S3), integrate camera and communication modules, and conduct real-world energy consumption benchmarks.

Phase 4: Protocol & Deployment Strategy

Select optimal LPWAN protocols (LoRaWAN, LTE-M) based on data size and frequency, and implement smart sending strategies for maximum energy savings.

Phase 5: Monitoring & Iteration

Continuously monitor deployed devices, analyze performance data, and iterate on models or protocols for ongoing optimization and scalability.

Ready to Transform Your IoT Strategy?

Leverage the power of Embedded ML to reduce energy consumption, extend device lifespan, and achieve truly autonomous environmental monitoring. Book a free consultation to explore how these insights can be applied to your specific needs.

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs


AI Consultation Booking