Edit

Get started with AI architecture design

AI enables machines to analyze data, generate content, synthesize speech, make predictions, and support decision-making across industries. As AI capabilities expand through generative models, language models, and agent-based architectures, organizations need architectural guidance to design reliable, secure, cost-effective, and operationally stable AI workloads.

This article helps you get started with AI on Azure. It introduces AI services, reference architectures, best practices, and learning resources so that you can design and build AI solutions that meet your workload requirements.

Azure services for AI

Azure provides a range of services for building, deploying, and managing AI workloads. These services span development platforms, prebuilt AI capabilities, data platforms, and tools for creating custom models. Use the following services to integrate AI into your workload design.

AI development platforms

  • Microsoft Foundry: A unified platform as a service (PaaS) for developing and deploying generative AI applications and agents. Microsoft Foundry provides access to a model catalog, agent hosting through Foundry Agent Service, fine-tuning, evaluation tools, and responsible AI capabilities. Use the Foundry portal to experiment, build, and deploy AI models and agents.

  • Azure Machine Learning: A cloud service for building, training, and deploying machine learning models at scale. Azure Machine Learning supports open-source frameworks like PyTorch, TensorFlow, and scikit-learn. It also provides capabilities for AutoML, hyperparameter tuning, distributed training, machine learning operations, and responsible AI.

  • Microsoft Copilot Studio: A low-code platform for building, customizing, and deploying AI-powered agents. Use Microsoft Copilot Studio to create conversational agents for internal and external scenarios and to extend Microsoft 365 Copilot with enterprise data and custom workflows.

Prebuilt AI services

  • Foundry Tools: A suite of prebuilt and customizable APIs and models for adding intelligent features to applications. Foundry Tools includes capabilities for speech, translation, language understanding, document intelligence, content understanding, vision, content safety, and search.

  • Azure OpenAI: Provides access to OpenAI models, including GPT and DALL-E, through Azure-managed infrastructure with enterprise security, networking, and responsible AI controls.

Data platforms for AI

  • Microsoft Fabric: An end-to-end analytics and data platform that covers data ingestion, transformation, real-time event routing, and reporting. Microsoft Fabric provides OneLake as a unified data lake and includes embedded AI capabilities, Microsoft Copilot features, and integration with Foundry Tools.

  • Azure Databricks: A Spark-based analytics platform for data engineering, data science, and machine learning. Azure Databricks provides Databricks Runtime for Azure Machine Learning, MLflow integration, AutoML, foundation model fine-tuning, and Mosaic AI Vector Search for embedding-based retrieval.

  • Azure HDInsight: A managed Apache Spark service for big data processing and analytics. Azure HDInsight Spark clusters support machine learning workloads through MLlib, are compatible with Azure Storage and Azure Data Lake Storage, and support SynapseML for deep learning.

  • Azure Data Lake Storage: A scalable, centralized repository for storing structured and unstructured data. Azure Data Lake Storage provides file system semantics, file-level security, and tiered storage built on Azure Blob Storage.

Enterprise Intelligence Layers

AI architecture at Microsoft introduces three complementary intelligence layers or IQs that represent key sources of context for AI systems:

  • Work IQ captures intelligence about how work happens inside an organization. It uses signals from Microsoft 365 such as emails, chats, meetings, documents, and collaboration patterns.

    This layer is essential for grounding AI in human activity and organizational behavior rather than treating interactions as isolated prompts.

  • Fabric IQ provides intelligence derived from structured enterprise data managed in Microsoft Fabric, including analytics models, key performance indicators (KPIs), and business entities.

    Fabric IQ enables AI to answer complex analytical questions and interpret results in the context of business operations.

  • Foundry IQ provides a unified, multisource knowledge layer that allows AI agents to retrieve and ground responses in enterprise data.

    This layer is critical for implementing patterns like retrieval-augmented generation (RAG) to help ensure that AI outputs are accurate, current, and compliant.

Architecture

The following diagram shows a baseline end-to-end chat architecture that uses Microsoft Foundry. This reference architecture demonstrates how AI services comprise a production-ready solution on Azure. It includes identity, networking, monitoring, and governance layers.

Diagram that shows a baseline end-to-end chat architecture that uses Microsoft Foundry.

The diagram presents a detailed Azure architecture for deploying an AI solution. On the left, a user connects through an application gateway with a web application firewall (WAF), which is part of a virtual network. This gateway links to private Domain Name System (DNS) zones. Azure DDoS Protection protects the gateway. Below the gateway, private endpoints connect to services like Azure App Service, Azure Key Vault, and Azure Storage, which are used for client app deployment. Azure App Service is managed with identity and spans three zones. Application Insights and Azure Monitor provide monitoring, and Microsoft Entra ID handles authentication. To the right, the virtual network has several subnets: App Service integration, a private endpoint, Microsoft Foundry integration, Azure AI agent integration, Azure Bastion, a jump box, build agents, and Azure Firewall. Each subnet hosts specific endpoints or services, like storage, Microsoft Foundry, Azure AI Search, Azure Cosmos DB, and knowledge stores, that connect via private endpoints. Outbound traffic from the network passes through Azure Firewall to reach internet sources. To the far right, a separate box represents Microsoft Foundry, which includes an account and a project. Managed identities connect Foundry Agent Service to the Microsoft Foundry project, which in turn accesses Azure OpenAI. The diagram uses numbered circles to indicate the logical flow, which shows how user requests traverse the network, interact with different endpoints, and connect to Foundry Tools and storage.

Download a Visio file of this architecture.

The previous diagram demonstrates a typical baseline AI implementation. For real-world solutions that you can build in Azure, see AI architectures.

Explore AI architectures and guides

The articles in this section include guides and fully developed architectures that you can deploy in Azure and expand to production-grade solutions. Solution ideas demonstrate implementation patterns and possibilities to consider as you plan your AI proof-of-concept (POC) development. These articles can help you decide how to use AI technologies in Azure.

AI guides

The following article helps you evaluate and select the best AI technologies for your workload requirements:

  • Machine learning options: Compares Azure Machine Learning products and technologies to help you choose the right platform for model training and deployment.

AI agent design

RAG solution development and evaluation

Multitenant RAG solution

Machine learning operations

Proxy generative AI models

Foundation model life cycle

AI architectures

The following production-ready architectures demonstrate end-to-end AI solutions that you can deploy and customize.

Chat with data

Document processing

Video and image classification

Audio processing

Regulatory requirements

AI solution ideas

The following AI solution ideas demonstrate implementation patterns and possibilities to explore.

Audio processing

Image processing

Predictive analytics

Machine learning operations

Document processing and enrichment

Workflow automation

Organizational readiness

Organizations at the beginning of the cloud adoption process can use the Cloud Adoption Framework for Azure to access proven guidance that accelerates cloud adoption.

For AI-specific adoption guidance, see the following Cloud Adoption Framework resources:

  • AI adoption: Provides a structured process for adopting AI solutions in Azure, including strategy, planning, readiness, governance, security, and management.

  • AI ready: Outlines the organizational process for building AI workloads in Azure, including resource organization, networking, reliability, and governance.

To help ensure the quality of your AI solution on Azure, follow the guidance in the Azure Well-Architected Framework. The Well-Architected Framework provides prescriptive guidance for organizations that seek architectural excellence and describes how to design, provision, and monitor cost-optimized Azure solutions.

For AI-specific guidance, see the following Well-Architected Framework resource:

  • AI workloads on Azure: Addresses architectural challenges of designing AI workloads, including nondeterministic functionality, data and application design, and operations across the five Well-Architected Framework pillars.

For AI-related Well-Architected Framework service guides, see:

Best practices

Follow these best practices to improve the reliability, security, cost effectiveness, operational quality, and performance of your AI workloads on Azure:

Stay current with AI

Azure AI services evolve to address modern data challenges. Stay informed about the latest updates and features.

To stay current with key AI services, see the following articles:

Amazon Web Services (AWS) or Google Cloud professionals

To help you ramp up quickly, the following articles compare Azure AI options to other cloud services:

  • Data and AI: Compares AWS data and AI services, including machine learning, generative AI, and AI platform services, to their Azure counterparts.

  • AI and machine learning: Compares Google Cloud AI and machine learning services, including Vertex AI, Dialogflow, and Gemini, to their Azure counterparts.