Quick Setup
- Prerequisites - to setup AIFactory - Estimated setup time: 1-2h
- End-2-End SETUP tutorial - AIFactory + 1 ESMLProject - Estimated setup time: 4-8h
Governance related - relevant for central IT, networking team (CoreTeam: 10-29)
- 11) Infra:AIFactory: Static documentation (CoreTeam)
- 12) Infra:AIFactory: Roles & Permissions for users (CoreTeam)
- 13) Infra:AIFactory: Flow diagrams (CoreTeam)
- 14) Infra:AIFactory: Networking (CoreTeam)
- 15) Infra:AIFactory: Overview of services: Naming convention(CoreTeam)
- 21) Infra:AIFactory: How-to: Onboarding of CoreTeam users and ProjectMembers via Pipelines (CoreTeam)
- 22) Datalake template: How-to: Setup Datalake & Onboard ProjectTeam permissions (CoreTeam)
- 23) DataOps template: How-to: Setup DataOps via PIPELINE templates (CoreTeam)
Consumer related - relevant for developers, data scientists, data engineers (ProjectTeam: 30-39)
- 30) Usage: Dashboard, Available Tools & Services, DataOps, MLOps, Access options to AIFactory (ProjectTeam)
- 31) How-to guide: Get access to the AIFactory:RBAC & Networking (ProjectTeam)
- 32) Overview: Dashboards, Services & Acceleration in AIFactory (ProjectTeam, CoreTeam)
- 33) Setup: Install AzureML SDK v1+v2 and ESML accelerator library
- 34) Setup: Datalake: How-to onboard your own data, to project folder R&D purpose
- 35) Setup: ESML SDK accelerated Notebook templates
- 36) How-to guide: DataOps
- 37) How-to guide: MLOps
- 38) How-to guide: LLMOps & RAG Chat Agent
- 39) End-2-End config tutorial - ESML Project, ESGenAI Project - Estimated config time: 1-2h
FAQ - Trouble shooting - relevant for all
- FAQ - Core team & AFactory infra
- FAQ - Data scientist & Azure ML pipelines
- FAQ - Data engineering & Azure ML pipelines
This is the main page for all documentation, with links to underlying specifics. The docs pages is sorted within a number series, by role and by component.
Example: Series 10-29 is targeting the AIFactory CoreTeam, e.g. governance of the AIFactory. Series 30-39 is relevant for AIFactory ProjectTeams, e.g. the consumers of the AIFactory services & accelerators.
Doc series | Role | Focus | Details |
---|---|---|---|
10-19 | CoreTeam |
Governance |
Setup of AI Factory. Governance. Infrastructure, networking. Permissions |
20-29 | CoreTeam |
Usage |
User onboarding & AI Factory usage. DataOps for the CoreTeam's data ingestion team |
30-39 | ProjectTeam |
Usage |
Dashboard, Available Tools & Services, DataOps, MLOps, Access options to the private AIFactory |
40-49 | All |
FAQ |
Various frequently asked questions. Please look here, before contacting an ESML AIFactory mentor. |
All 4 can be used, or optionally cherry picked. The 1st component, Infra:AI Factory, is a pre-requsite. Below the 4 components are seen.
This table will be used in the documentation to clarify WHAT a section covers, and for WHOM/Role
Component | In section | Focus in section | Role | Doc series |
---|---|---|---|---|
1) Infra:AIFactory | Y | - | CoreTeam | 10-19 |
2) Datalake template | Y | - | All | 20-29,30-39 |
3) Templates for: DataOps, MLOps, *LLMOps | Y | - | All | 20-29,30-39 |
4) Accelerators: ESML SDK (Python, PySpark), RAG Chatbot, etc | Y | - | ProjectTeam | 30-39 |
Here you will see the definition of an AIFactory via lists and diagrams, and workflows how to add projects or members to an AIFactory.
- High-Level Diagram - AIFactory Capabilities
- High-Level Diagram - ESML project: Overview services
- High-Level Diagram - ESGenAI project: Overview services
- Mid-Level Diagram - Azure Services integration:ESML
- Mid-Level Diagram - Azure Services integration:ESGenAI
- Low-Level Diagram - Infrastructure & LLMOps
- Low-Level Diagram - Infrastructure & MLOps
- Networking Diagram - Hub-Spoke, ESLZ, Private DNS, FW
- Network Connectivity
- Network topology: Hub/Spoke | VirtualWan (Vwan Hub)
- Firewall: What ports needs to be opened?
- User access: Direct via corp network, VPN from home, Bastion "jumphost" for admins
- Network Connectivity
All Diagrams: Architecture & Services - High-Level & Low Level Diagrams
Detailed information about Roles and permission, such as Microsoft Entra ID
Service Principals (Automation & Ops purpose) & Permissions for Users, AD groups.
Permissions & Roles: Coreteam VS Project team
Flow diagram which can explains the workflows, how to utilize the complete solution(AI Factory, MLOps Accelerator) in different scenarios.
HOWTO - Networking: Hybrid access, Private DNS Zones, etc
Component | In section | Focus in section | Role in section | Index |
---|---|---|---|---|
1) Infra:AIFactory | Y | Usage & Onboard teams | CoreTeam:Infra | 21 |
2) Datalake template | Y | Setup | CoreTeam:Infra | 22 |
3) Templates for: DataOps, MLOps, *LLMOps | Y | DataOps | CoreTeam: DataIngestion | 23 |
4) Accelerators: ESML SDK (Python, PySpark), RAG Chatbot, etc | N | - | - | - |
User onboarding, permissions and usage howto. There are 2 roles in the AIFactory, here the different permissions roles have is explained.
- AIFactory CoreTeam:
- AIFactory ProjectTeam:
21) Infra:AIFactory
- How-to: Onboarding of CoreTeam users and ProjectMembers via PIPELINES (CoreTeam)
See Roles and permissions. See Flow Diagram - Add AIFactory project, Add users
- Workflows:
-
- Terraform+GithubActions
- Workflows:
-
- Terraform+AzureDevops
Here you can find HOWTO guides for the ESML CoreTeam, how to setup the Datalake structure, and how to provide a ProjectTeam access to their datalake projectfolder, by running a pipeline (ADO, GHA)
- What Permissions: This is the datalake access users will get
- How-to: Add users to AIFactory project, gets them correct Datalake folder access automatically
Here you can find HOWTO guides for the ESML CoreTeam, its Dataingestion team within the CoreTeam.
Prerequisites - to setup AIFactory
Here is an End-2-End setup tutorial, with an estimated ~4-8 hour setup time.
- Estimated setup time is 3-7 hours, to have the full AIFactory automation configured, and create "AIFactory Common DEV" + the 1st AIFactory project created (type:ESML)
- Estimated setup time 1 hour: Configure AIFactory Common DEV + the 1st ESMLProject (type: ESML)
After the setup, you can simply click on a pipeline to provision 1 or 250 AIFactory project architectures, of type ESML or ESGenAI.
30) Usage: Dashboard, Available Tools & Services, DataOps, MLOps, Access options to AIFactory (ProjectTeam)
Here you can find HOWTO guides for a ESML ProjectTeam, including its DataOps, MLOps, supported use cases, accelerated use cases. Also how to get access to the private AIFactory.
Component | In section | Focus in section | Role in section | Index |
---|---|---|---|---|
1) Infra:AIFactory | Y | Usage & Get Access, Dashboards, Services | ProjectTeam | 31,32 |
2) Datalake template | Y | Usage & Get Access | ProjectTeam | 34 |
3) Templates for: DataOps, MLOps, *LLMOps | Y | DataOps | ProjectTeam | 36,37 |
4) Accelerators: ESML SDK (Python, PySpark), RAG Chatbot, etc | Y | Setup SDK & Templates | ProjectTeam | 33,35 |
-
Talk to your AIFactory CoreTeam. Ask them to onboard you, as per section 20)
-
Depending on your AIFactory setup, you may need to take additional actions to get network access.
- A) AIFactory, isolated mode (not peered): HOWTO - Bastion acccess
- B) AIFactory, peered mode, corp network: - No action needed. Line of sight exists already.
- C) AIFactory, peered mode, corp VPN: - No action needed. Line of sight exists already
- Pre-req: Same as B)
There are multiple ESML AIFactory dashboards available, that the coreteam shared to you.
How-to guide: IMPORT AIFacotory dashboards or select them, clone them, to customize them further
There are Azure services packaged both for DataOps, MLOps, and Generative AI.
SERVICES LIST: Overview of the services, as a list, with naming convetions
SERVICES ARCHITECTIRE - architectural diagrams
Here you can find information about supported use cases, and accelerated use cases.
HOWTO - Supported use cases & Accelerated use cases
Here you see HOWTO install AzureML SDK v1+v2 and ESML accelerator library, locally, or at an Azure Machine Learning Compute Instance.
HOWTO - install AzureML SDK v1+v2 and ESML accelerator library
- Prerequisite: Datalake access HOW-TO: Setup Datalake & Onboard ProjectTeam permissions (CoreTeam)
- HOWTO - Quickstart: Onboard own data, to the datalake
The ESML AIFactory comes with Notebook templates, generic notebooks (not examples). See below: How to clone them, and configure them, to accelerate your projects. It also comes with DEMO data, per project team.
Pre-requisite: Get Datalake access - See section 34
HOWTO - Quickstart: Copy AI Factory notebook templates
HOWTO - Quickstart: Configure templates, and data settings - train model with own data
HOWTO - Quickstart: DataOps with Azure Datafactory ESML templates
-
START HERE - WHAT is accelerated & WHAT is vanilla Azure ML, Azure Databricks?
-
HOWTO - Quickstart: Accelerate MLOps - with ESML boost on Azure Machine Learning and Databricks
-
HOWTO - Quickstart: MLOps with Azure Machine Learning and Databricks
!WIP! - HOWTO - Quickstart: LLMOps & RAG Chat Agent
ESML Project: Here is an end-2-end setup tutorial, for DataOps and MLOps
- Estimated time to setup is 1-2 hours
- ESML project configured with both DataOps and MLOps - retrained on "data changed" and on "code changed"
- Retraining DEMO-model on "data changed" and on "code changed" (CI/CD trigger)
- Deployed as online endpoint, and batch endpoint
- !WIP! - Quickstart: MLOps with Azure Machine Learning and Databricks
ESGenAI Project: Here is an end-2-end setup tutorial, for ESML GenAI project
- Estimated time to setup is 1-2 hours
- ESGenAI project configured with "on your data" with Azure AI Search, Promptflow with multiple indexed.
- Re-indexed on "code changed" (CI/CD trigger on promptflow) and redeployed endpoint
- Deployed as an online endpoint, with possibility to scale with multiple backends (round-robin)
- Batch Evaluation & Live Monitoring
- !WIP! HOWTO - Quickstart: MLOps with Azure Machine Learning and Databricks
[TODO - Link]
Here you can browse some Q's, and jump into relevant section. We tried to group the FAQ's into different roles & services
Example quetions:
- Q: How-to clone repo with submodule to local computer? the folder
azure-enterprise-scale-ml
is empty? - Q: Why can't it find the path in my submodule?
- Q: DataOps: How to work with Azure DataFactory, and branching?
- Q: MLOps: Azure Devops and GIT Branching stragey - DEV, TEST, PROD, many models branches?
Example quetions:
- Q: How-to clone repo with submodule to local computer? the folder
azure-enterprise-scale-ml
is empty? - Q: Why can't it find the path in my submodule?
- Q: DataOps: How to work with Azure DataFactory, and branching?
- Q: MLOps: Azure Devops and GIT Branching stragey - DEV, TEST, PROD, many models branches?
Example questions:
- Q: Why does Azure Datafactory not trigger DataMesh copy to MASTER from PROJECt, when project pipeline is finished?
- Q: How to setup eventdriven