Explore Azure AI Services: Curated list of prebuilt models and demos

by info.odysseyx@gmail.com · September 3, 2024

Azure AI services provide a comprehensive suite of pre-built models and demos designed to address a wide range of use cases. These models are easily accessible and enable seamless implementation of AI-based solutions. WWe’ve curated and cataloged pre-built demos available across Azure AI services to help you seamlessly infuse AI into your products and services.

Voice Recognition

Scenario for converting speech to text

script	explanation	link
Convert real-time speech to text	Quickly test audio on your speech recognition endpoint without writing any code.	Explore the Demo
Whisper model on Azure OpenAI service	Transcribe and translate audio content from 57 languages into English using the OpenAI Whisper v2-large model.	Explore the Demo
Convert bulk speech to text	Asynchronously transcribe large amounts of audio from a repository.	Explore the Demo
Custom voice	Improve speech recognition accuracy by leveraging domain-specific vocabulary and data.	Explore the Demo
Pronunciation Assessment	Evaluate the accuracy and fluency of your spoken pronunciation and get feedback.	Explore the Demo
Voice Translation	Translate your speech into another language in real time with low latency.	Explore the Demo

Text to speech

Text to speech scenario

script	explanation	link
Voice Gallery	Create natural-sounding voices by choosing from 486 voices across 148 languages and language variants.	Explore the Demo
Custom neural voice	Create natural-sounding synthetic voices based on human voice recordings.	Explore the Demo
personal voice	Generate AI voices from human voice samples for personalized voice experiences.	Explore the Demo
Audio content creation	Build highly natural audio content for a variety of scenarios, including audiobooks, video narration, and more.	Explore the Demo
Text to speech avatar	Turn text into video using AI-generated avatars and realistic voices.	Explore the Demo

Another scenario


Captions that convert speech to text	Learn how to use Azure Speech to transcribe audio from movies, videos, live events, and more, and automatically caption your content in real time and offline using a sample application. Display the resulting text on screen to provide an accessible experience. This example converts speech to text and leverages features like phrase lists.
After the call Transcription and Analysis	Batch transcribe call center recordings and extract valuable information such as personally identifiable information (PII), sentiment, and call summaries. This shows how to analyze call center conversations using speech and language services.
	Have natural conversations with an avatar that recognizes your voice input and responds fluently with a realistic AI voice.
	Get instant feedback on your pronunciation accuracy, fluency, intonation, grammar, vocabulary, and more via chat.
	Automatically and seamlessly translate and generate videos in multiple languages. Powerful features help you efficiently localize your video content for diverse audiences around the world.

Vision Studio

Vision-based scenarios

script	explanation	link
Video Search and Summary	Quickly summarize key points in a video and search for specific moments.	Explore the Demo
Customize your model with images	Locate specific objects within images for use cases such as product placement and assembly line inspection.	Explore the Demo
Add dense captions to images	Generate human-readable captions for all important objects detected in an image.	Explore the Demo
Remove background from image	Easily remove backgrounds and preserve foreground elements.	Explore the Demo
Add a caption to an image	Generate human-readable sentences that describe the content of an image.	Explore the Demo
Detect common objects in images	Detect and extract bounding boxes of recognizable objects and living things.	Explore the Demo
Extract text from images	Extract printed or handwritten text from images, PDFs, and TIFF files using OCR.	Explore the Demo
Extract common tags from images	Extract tags based on recognizable objects, scenes, and actions.	Explore the Demo
Create smart cropped images	Automatically crops images to highlight the most important areas.	Explore the Demo
Detect faces in images	Detects the location and properties of human faces in images.	Explore the Demo
Counting people in an area	Analyze video to count the number of people in a specified area.	Explore the Demo
Detect when people cross the line	Detects when a person crosses a line in the camera’s field of view.	Explore the Demo

Language Studio

Language processing scenario

script	explanation	link
Extract PII	Identify sensitive personally identifiable information (PII) in text.	Explore the Demo
Extract key phrases	Quickly identify key points in unstructured text.	Explore the Demo
Finding connected entities	Link to a knowledge base to clarify the identity of entities found in text.	Explore the Demo
Extract named entities	Identify and classify entities in text using Named Entity Recognition (NER).	Explore the Demo
Extract health information	Extract and label medical information from unstructured text.	Explore the Demo
Analysis of emotions and opinions	Provides sentiment labels and confidence scores at sentence and document level.	Explore the Demo
Language detection	Determines the language used in the input document and returns a confidence score.	Explore the Demo
Custom Text Classification	Create a custom text classification project using labeled data and a trained model.	Explore the Demo
Answer the question	Extracts answers to questions from provided text.	Explore the Demo
Conversational Language Understanding Project	Build your project using labeled data and trained models to understand conversational language.	Explore the Demo
Orchestration Project	Build and manage projects that integrate multiple language services.	Explore the Demo
Summary of information	Use the Summary API to generate summaries of conversations or documents.	Explore the Demo
	Bulk translate documents into one or more languages from local storage or Azure Blob storage.

Document Intelligence

Document Analysis Scenario

script	explanation	link
read	Extract printed and handwritten text, barcodes, and formulas from documents.	Explore the Demo
Listed carefully	Extract tables, checkboxes, and text from forms and images.	Explore the Demo
General Documents	Extract key-value pairs and structures from any form or document.	Explore the Demo

Pre-built model scenarios


	Extract invoice details including customer and supplier details, total amount, and line items.
	Extract transaction details from receipts, including date, seller information, and total amount.
	Extract details from passports and ID cards.
US Health Insurance Card	Extract details from your US health insurance card.
	Classify and extract information from documents including W2, 1040, 1098, and 1099.
	Extracting information from various mortgages
	Extract employee information and payroll information including income, deductions, and net pay.

	Extracts amount, date, on-demand MICR number, player’s name and address, etc.
	Extract details from marriage certificate.
	Extract credit card details including card number and cardholder name.
	Extracts rights holder and signatory information from contracts.
	Extract contact information from business cards.

Gen-AI Safety Solutions

safeYour protection Image content


	A tool to evaluate various content moderation scenarios. Consider a variety of factors, including content type, platform policies, and potential impact to users. Run moderation tests on sample content. Rerun and refine the test results using Configure filters. Add specific terms to the blocklist that you want to detect and take action on.
Intermediate multimodal content	Run moderation tests on content that combines images and text. Evaluate test results based on the severity detected.

Protect youR Text


	Run moderation tests on text content. Evaluate test results by detected severity.
	Grounding detection detects ungrounding generated from large-scale language models (LLMs).
Detect protected data	Detect and protect third-party text materials in LLM modules.
	Prompt Shields provides a unified API to handle the following types of attacks: jailbreak attacks and indirect attacks.

Real-time safety measures


	This shows API usage, review results, and category distribution. You can customize the severity threshold for each category to see updated results and distribute new thresholds to you. You can also edit the block list on this page to respond to all incidents.

Voice Recognition

Scenario for converting speech to text

Text to speech

Text to speech scenario

Another scenario

Vision Studio

Vision-based scenarios

Language Studio

Language processing scenario

Document Intelligence

Document Analysis Scenario

Gen-AI Safety Solutions

safeYour protection Image content

Protect youR Text

Real-time safety measures

Our Company

About Links

Useful Links

Newsletter

Laest News

Explore Azure AI Services: Curated list of prebuilt models and demos

Voice Recognition

Scenario for converting speech to text

Text to speech

Text to speech scenario

Another scenario

Vision Studio

Vision-based scenarios

Language Studio

Language processing scenario

Document Intelligence

Document Analysis Scenario

Gen-AI Safety Solutions

safeYour protection Image content

Protect youR Text

Real-time safety measures

Restoring Soft-Deleted Blobs with multithreading in Azure Storage Using C#

Microsoft Security Exposure Management Graph: Prioritization is the king

You may also like

Leave a Comment Cancel Reply

Our Company

About Links

Useful Links

Newsletter

Laest News