Enterprise AI Analysis

Artificial Intelligence as a Service (AlaaS) for Cloud, Fog and the Edge: State-of-the-Art Practices

Authors: Naeem Syed, Adnan Anwar, Zubair Baig, and Sherali Zeadally

Publication: ACM Comput. Surv., Vol. 57, No. 8, Article 211. Publication date: March 2025.

DOI: 10.1145/3712016

Executive Summary: Key Business Insights

This analysis highlights the transformative potential of Artificial Intelligence as a Service (AIaaS) across cloud, fog, and edge computing paradigms. It details current state-of-the-art practices, identifies critical capabilities for successful AIaaS deployment, and outlines challenges that must be addressed for widespread adoption in enterprise settings.

0 AI Investment by 2025

0 Latency Reduction (Edge)

0 Projected AIaaS Growth

0 Cloud-based AI Efficiency

Discuss Your AIaaS Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

AI in the Cloud: Scalability and Rich Analytics

AI technologies in the Cloud are mature, offering extensive open-source and proprietary tools. Cloud-based AI facilitates digital business transformation through big data analytics and automation. Examples like Google Cloud AI and Amazon AI leverage cloud's high computation and storage capabilities for complex AI tasks. This deployment model offers lower costs, affordable data storage, and seamless integration of AI applications.

Cloud AI supports various analytics types—descriptive, predictive, prescriptive, and adaptive—critical for data-driven decision-making in applications like Cloud-assisted Smart Factory (CaSF) for Industry 4.0. AI-enabled intelligent scheduling in the cloud also contributes to energy efficiency and green computing by optimizing resource allocation and control processes.

AI in Fog: Bridging Cloud and Edge for IoT

Fog computing brings cloud services closer to IoT end devices, enabling distributed processing and context awareness, which is vital for real-time, low-latency applications where traditional cloud architectures face bandwidth and latency issues. Fog-IoT offers an effective solution for managing computation, bandwidth, and latency requirements, placing a decentralized computing layer between data sources and the central cloud.

The Elastic Intelligent Fog (EiF) architecture showcases AI-enabled fog services for dynamic management of smart things, supporting virtual AI/IoT functions, intelligent orchestration, and advanced optimization. This includes applications like forecasting air quality and processing video analytics at the fog layer, promoting cross-border AI services and improving quality of service (QoS) by addressing resource management, heterogeneity, and network reliability.

AI at the Edge: Real-time Decisions and Low Latency

Edge computing addresses the limitations of cloud-based architectures by bringing AI inference services directly to IoT devices, ensuring real-time or ultra-low latency decisions. This is crucial for applications like autonomous vehicles, industrial automation, and smart edge architectures that dynamically monitor and adjust network configurations.

Techniques such as edge-cloud collaboration and device-edge co-inference optimize AI services on resource-constrained IoT devices. Lightweight DL models, often trained in the cloud and optimized for edge deployment, enable simpler inference tasks. The Industrial Internet of Things (IIoT) benefits significantly, with AI at the edge improving QoS through multi-hop cooperative computation offloading and federated learning for privacy-preserving model training.

Core AIaaS Characteristics for Enterprise Adoption

AIaaS leverages the "as-a-Service" model, providing AI capabilities via cloud infrastructure on a subscription basis. Key characteristics, aligning with NIST's definition of a Cloud service, include: On-demand self-service for provisioning computing capabilities without human interaction, Broad network access from any device using standard protocols, and Resource pooling for multiple customers sharing CPU, GPU, FPGA, memory, storage, and bandwidth.

Additionally, AIaaS offers Rapid elasticity for quick allocation and release of computing capabilities based on demand, and Measured service to monitor and report resource utilization. Beyond these, AIaaS must also ensure abstraction for ease of use, integration for customization, security of customer data, and robust fault tolerance and scalability to provide a reliable service across diverse enterprise applications.

A comprehensive AIaaS offering should support a wide range of AI technologies, including ML/DL algorithms for Natural Language Processing (NLP), speech recognition, computer vision, and pattern recognition, along with tools for data handling, model training, optimization, explainable AI, and visualization. This enables automation, error reduction, and increased efficiency across enterprise workflows.

Addressing Key Challenges in AIaaS Deployment

Widespread AIaaS adoption faces several critical challenges. Privacy is paramount, requiring robust security measures for data at rest, in transit, and at the central server, including encryption, federated learning, differential privacy, and secure computing hardware, especially for sensitive data. Latency, particularly for real-time applications, is a major hurdle due to communication network congestion and intermediary hops between client devices and central AI systems. Efficient data compression and local processing are crucial here.

Resource Allocation and Optimization remains a complex problem, balancing costs, inference latency, accuracy, and energy consumption across cloud, fog, and edge layers. Data-driven solutions and protection against cyber attacks on allocation models are essential. Furthermore, Data Silos create barriers to information sharing and can degrade AI model accuracy, necessitating curated, structured data sets. Finally, maintaining Accuracy with decentralized AIaaS, heterogeneous data sources, and offloaded tasks requires innovative techniques like federated learning and continuous resource upgrades to ensure reliable outputs.

Automated Hardware Configuration Flow

AI algorithm (as set of features)

→

HADA (Automated HW dimensioning tool)

→

Optimal HW resource dimensioning (Memory, cores, GPU, CPU)

Generative Chatbot Response Generation Flow

User input

→

Text pre-processing

→

Tokenization

→

Trained model

→

Decoding

→

De-tokenization

→

Generated response

AIaaS vs. In-House Chatbot Deployment

Factor	Generative Chatbot Requirements	On-Premises Deployment	AIaaS Deployment
Hardware Requirements	LLMs (e.g., GPT3, GPT4) require ~10¹⁵ FLOPs, large memory (150-200GB), and superior bandwidth for low latency.	Requires significant initial cost, upgrades, and service disruptions.	Cloud-based AIaaS offers on-demand computing resources, GPUs, and optimal resource utilization based on computation needs.
Skill Requirement	ML/DL tasks, data preparation, speech/text processing, tuning, encoding/decoding of user queries.	Developing and fine-tuning requires a range of skilled personnel to handle low latency.	AIaaS offers pre-trained models for AI tasks with data-driven service optimization.
Privacy and Security of Data	Personally identifiable and sensitive information poses privacy challenges (e.g., for chatbot training and optimization).	Limited privacy challenges if trained in-house (but adds data security costs).	AIaaS provides privacy and data security via federated learning, synthetic data, anonymized data, or collaborative learning.

Cloud, Fog, and Edge Computing Comparison

Attribute	Cloud	Fog	Edge
Data analytics	Big data inferences using high performance computing.	Data aggregation from multiple sources, distributed processing across multiple fog nodes.	Real-time data analysis on edge devices focusing on limited cluster of IoT devices, less distributed.
Data sensitivity	Less sensitive data.	Critical data analyzed that may or may not include sensitive data depending on applications.	Sensitive data analytics.
Computing power	High	Limited	Limited
Latency	Highest (~20-100+ ms)	Medium (~5-20 ms)	Lowest (less than 1-5ms)
Bandwidth	High (~10-100+ Gbps)	Medium (~1-10Gbps)	Low (10-1000Mbps)
Proximity	Far from edge	Nodes closer to edge	Usually at one-hop distance
Scalability	High	Scalable within the network.	Hard to scale
Resource requirement	Highly resource intensive	Resource intensive	Less resource intensive
Storage	Can store large amounts of data for a long time.	Aggregates data from numerous IoT devices to either provide quick data-driven services or send it to the cloud for complex processing.	Limited storage of data usually collects data from fewer IoT devices for low-latency services.

Case Study: Generative Chatbot on AIaaS

The article highlights generative chatbots as a key application for AIaaS, leveraging Natural Language Processing (NLP) and Machine Learning/Deep Learning (ML/DL) models. AIaaS provides a scalable, cost-effective platform to deploy and manage these complex AI tasks, supporting various stages from text pre-processing, tokenization, and model training, to inference, and continuous improvement (feedback loops). The architecture supports containerization (Docker, Kubernetes) and automated resource scaling (Horizontal/Vertical Pod Autoscaler) to meet demand efficiently. This contrasts with on-premises deployment, where AIaaS offers significant benefits in resource optimization, cost-effectiveness, and privacy handling via methods like federated learning, synthetic data, or anonymized data.

Case Study: AIaaS in Healthcare (Fog/Edge)

AIaaS is critical for healthcare, especially with Fog/Edge deployment, enabling low-latency, real-time decision-making for sensitive medical data. The article mentions AI-based Fog computing solutions for healthcare provisioning systems that collect patient data closer to the subject, apply AI techniques to classify health status, and synchronize with a central cloud database. This approach optimizes resource utilization and ensures data privacy closer to the source. Specific use cases from the research include IoT-based emotion detection in smart health systems to identify distress and provide medical care, and processing fuzzy-based decisions for healthcare provisioning, all benefiting from the proximity and low-latency capabilities of Fog and Edge AIaaS.

Case Study: AIaaS for Autonomous Physical Security

Leveraging smart surveillance cameras, AIaaS enables autonomous physical security systems to perform critical operations such as facial recognition, re-identification, and traffic monitoring. The article details the use of lightweight DL algorithms (like SqueezeNet and MobileNet) at the edge to pre-detect target objects and extract desired video segments. This local processing reduces data transfer to the cloud, enhancing privacy and lowering latency for critical real-time security responses. Complex processing can be offloaded to fog servers or cloud resources. AIaaS facilitates the fine-tuning of DL networks and improves detection accuracy, ensuring robust and efficient security operations.

Calculate Your Potential AIaaS ROI

Estimate the tangible benefits of adopting AIaaS for your enterprise. Adjust the parameters below to see your potential annual savings and reclaimed human hours.

Your Industry

Number of Employees (Impacted by AI Automation)

Average Hours Spent on Repetitive Tasks per Week per Employee

Average Hourly Cost per Employee ($)

Potential Annual Savings

Reclaimed Human Hours Annually

Quantify Your AIaaS Benefits

Your AIaaS Implementation Roadmap

Embark on a structured journey to integrate AIaaS into your enterprise. Our phased approach ensures a smooth transition and maximizes your return on investment.

Phase 1: Discovery & Strategy Formulation

Conduct a comprehensive assessment of existing infrastructure, data sources, and business objectives. Define clear AIaaS use cases, identify key stakeholders, and establish success metrics. Develop a tailored strategy aligned with organizational goals and compliance requirements.

Phase 2: Pilot Program & Vendor Selection

Identify critical AI applications suitable for a pilot program, focusing on high-impact, low-risk areas. Evaluate AIaaS providers based on capabilities (ML/DL, NLP, vision), security features (federated learning, encryption), scalability, and support for hybrid cloud/edge deployments. Negotiate SLAs and initial resource allocations.

Phase 3: Data Integration & Model Development

Securely integrate enterprise data sources with the chosen AIaaS platform, addressing data privacy and silo challenges. Develop or adapt AI models for specific use cases, leveraging pre-trained models and custom training. Implement data preprocessing, feature engineering, and continuous model optimization techniques.

Phase 4: Deployment & Operationalization

Deploy AI models via APIs, ensuring seamless integration with existing applications. Establish dynamic resource allocation, monitoring, and metering for optimal performance and cost control. Operationalize AIaaS applications, train internal teams, and set up continuous feedback loops for performance improvement and concept drift management.

Phase 5: Scaling & Advanced Optimization

Expand AIaaS adoption across the enterprise, identifying new use cases and integrating advanced AI capabilities. Continuously monitor model accuracy, latency, and resource utilization. Explore advanced techniques like multi-hop computational offloading and hierarchical federated learning for further optimization and efficiency gains across cloud, fog, and edge environments.

Plan Your AIaaS Journey

Ready to Transform Your Enterprise with AIaaS?

The future of AI-driven operations is here. Don't miss out on the competitive advantages of scalable, secure, and efficient AI. Schedule a personalized consultation to explore how AIaaS can elevate your business.

Schedule Your Consultation Today

Enterprise AI Analysis

Artificial Intelligence as a Service (AlaaS) for Cloud, Fog and the Edge: State-of-the-Art Practices

Executive Summary: Key Business Insights

Deep Analysis & Enterprise Applications

AI in the Cloud: Scalability and Rich Analytics

AI in Fog: Bridging Cloud and Edge for IoT

AI at the Edge: Real-time Decisions and Low Latency

Core AIaaS Characteristics for Enterprise Adoption

Addressing Key Challenges in AIaaS Deployment

Automated Hardware Configuration Flow

Generative Chatbot Response Generation Flow

AIaaS vs. In-House Chatbot Deployment

Cloud, Fog, and Edge Computing Comparison

Case Study: Generative Chatbot on AIaaS

Case Study: AIaaS in Healthcare (Fog/Edge)

Case Study: AIaaS for Autonomous Physical Security

Calculate Your Potential AIaaS ROI

Your AIaaS Implementation Roadmap

Phase 1: Discovery & Strategy Formulation

Phase 2: Pilot Program & Vendor Selection

Phase 3: Data Integration & Model Development

Phase 4: Deployment & Operationalization

Phase 5: Scaling & Advanced Optimization

Ready to Transform Your Enterprise with AIaaS?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai