Home NewsX Explore Azure AI Services: Curated list of prebuilt models and demos

Explore Azure AI Services: Curated list of prebuilt models and demos

by info.odysseyx@gmail.com
0 comment 11 views


Azure AI services provide a comprehensive suite of pre-built models and demos designed to address a wide range of use cases. These models are easily accessible and enable seamless implementation of AI-based solutions. WWe’ve curated and cataloged pre-built demos available across Azure AI services to help you seamlessly infuse AI into your products and services.

Voice Recognition

Scenario for converting speech to text

script explanation link
Convert real-time speech to text Quickly test audio on your speech recognition endpoint without writing any code. Explore the Demo
Whisper model on Azure OpenAI service Transcribe and translate audio content from 57 languages ​​into English using the OpenAI Whisper v2-large model. Explore the Demo
Convert bulk speech to text Asynchronously transcribe large amounts of audio from a repository. Explore the Demo
Custom voice Improve speech recognition accuracy by leveraging domain-specific vocabulary and data. Explore the Demo
Pronunciation Assessment Evaluate the accuracy and fluency of your spoken pronunciation and get feedback. Explore the Demo
Voice Translation Translate your speech into another language in real time with low latency. Explore the Demo

Text to speech

Text to speech scenario

script explanation link
Voice Gallery Create natural-sounding voices by choosing from 486 voices across 148 languages ​​and language variants. Explore the Demo
Custom neural voice Create natural-sounding synthetic voices based on human voice recordings. Explore the Demo
personal voice Generate AI voices from human voice samples for personalized voice experiences. Explore the Demo
Audio content creation Build highly natural audio content for a variety of scenarios, including audiobooks, video narration, and more. Explore the Demo
Text to speech avatar Turn text into video using AI-generated avatars and realistic voices. Explore the Demo

Another scenario

Captions that convert speech to text

Learn how to use Azure Speech to transcribe audio from movies, videos, live events, and more, and automatically caption your content in real time and offline using a sample application. Display the resulting text on screen to provide an accessible experience. This example converts speech to text and leverages features like phrase lists.

After the call Transcription and Analysis

Batch transcribe call center recordings and extract valuable information such as personally identifiable information (PII), sentiment, and call summaries. This shows how to analyze call center conversations using speech and language services.

Have natural conversations with an avatar that recognizes your voice input and responds fluently with a realistic AI voice.

Get instant feedback on your pronunciation accuracy, fluency, intonation, grammar, vocabulary, and more via chat.

Automatically and seamlessly translate and generate videos in multiple languages. Powerful features help you efficiently localize your video content for diverse audiences around the world.

Vision Studio

Vision-based scenarios

script explanation link
Video Search and Summary Quickly summarize key points in a video and search for specific moments. Explore the Demo
Customize your model with images Locate specific objects within images for use cases such as product placement and assembly line inspection. Explore the Demo
Add dense captions to images Generate human-readable captions for all important objects detected in an image. Explore the Demo
Remove background from image Easily remove backgrounds and preserve foreground elements. Explore the Demo
Add a caption to an image Generate human-readable sentences that describe the content of an image. Explore the Demo
Detect common objects in images Detect and extract bounding boxes of recognizable objects and living things. Explore the Demo
Extract text from images Extract printed or handwritten text from images, PDFs, and TIFF files using OCR. Explore the Demo
Extract common tags from images Extract tags based on recognizable objects, scenes, and actions. Explore the Demo
Create smart cropped images Automatically crops images to highlight the most important areas. Explore the Demo
Detect faces in images Detects the location and properties of human faces in images. Explore the Demo
Counting people in an area Analyze video to count the number of people in a specified area. Explore the Demo
Detect when people cross the line Detects when a person crosses a line in the camera’s field of view. Explore the Demo

Language Studio

Language processing scenario

script explanation link
Extract PII Identify sensitive personally identifiable information (PII) in text. Explore the Demo
Extract key phrases Quickly identify key points in unstructured text. Explore the Demo
Finding connected entities Link to a knowledge base to clarify the identity of entities found in text. Explore the Demo
Extract named entities Identify and classify entities in text using Named Entity Recognition (NER). Explore the Demo
Extract health information Extract and label medical information from unstructured text. Explore the Demo
Analysis of emotions and opinions Provides sentiment labels and confidence scores at sentence and document level. Explore the Demo
Language detection Determines the language used in the input document and returns a confidence score. Explore the Demo
Custom Text Classification Create a custom text classification project using labeled data and a trained model. Explore the Demo
Answer the question Extracts answers to questions from provided text. Explore the Demo
Conversational Language Understanding Project Build your project using labeled data and trained models to understand conversational language. Explore the Demo
Orchestration Project Build and manage projects that integrate multiple language services. Explore the Demo
Summary of information Use the Summary API to generate summaries of conversations or documents. Explore the Demo


Bulk translate documents into one or more languages ​​from local storage or Azure Blob storage.

Document Intelligence

Document Analysis Scenario

script explanation link
read Extract printed and handwritten text, barcodes, and formulas from documents. Explore the Demo
Listed carefully Extract tables, checkboxes, and text from forms and images. Explore the Demo
General Documents Extract key-value pairs and structures from any form or document. Explore the Demo

Pre-built model scenarios

Extract invoice details including customer and supplier details, total amount, and line items.

Extract transaction details from receipts, including date, seller information, and total amount.

Extract details from passports and ID cards.

US Health Insurance Card

Extract details from your US health insurance card.

Classify and extract information from documents including W2, 1040, 1098, and 1099.

Extracting information from various mortgages

Extract employee information and payroll information including income, deductions, and net pay.

Extracts amount, date, on-demand MICR number, player’s name and address, etc.

Extract details from marriage certificate.

Extract credit card details including card number and cardholder name.

Extracts rights holder and signatory information from contracts.

Extract contact information from business cards.

Gen-AI Safety Solutions

safeYour protection Image content

A tool to evaluate various content moderation scenarios. Consider a variety of factors, including content type, platform policies, and potential impact to users. Run moderation tests on sample content. Rerun and refine the test results using Configure filters. Add specific terms to the blocklist that you want to detect and take action on.

Intermediate multimodal content

Run moderation tests on content that combines images and text. Evaluate test results based on the severity detected.

Protect youR Text

Run moderation tests on text content. Evaluate test results by detected severity.

Grounding detection detects ungrounding generated from large-scale language models (LLMs).

Detect protected data

Detect and protect third-party text materials in LLM modules.

Prompt Shields provides a unified API to handle the following types of attacks: jailbreak attacks and indirect attacks.

Real-time safety measures

This shows API usage, review results, and category distribution. You can customize the severity threshold for each category to see updated results and distribute new thresholds to you. You can also edit the block list on this page to respond to all incidents.





Source link

You may also like

Leave a Comment

Our Company

Welcome to OdysseyX, your one-stop destination for the latest news and opportunities across various domains.

Newsletter

Subscribe my Newsletter for new blog posts, tips & new photos. Let's stay updated!

Laest News

@2024 – All Right Reserved. Designed and Developed by OdysseyX