Analyze an image
This feature returns information about visual content found in an image. Use tagging, descriptions, and domain-specific models to identify content and label it with confidence. Apply the adult/racy settings to enable automated restriction of adult content. Identify image types and color schemes in pictures.
See it in action
Want to build this?
Read text in images
Optical character recognition (OCR) detects text in an image and extract the recognized words into a machine-readable character stream. Analyze images to detect embedded text, generate character streams, and enable searching. Take photos of text instead of copying to save time and effort.
See it in action
By uploading data for this demo, you agree that Microsoft may store it and use it to improve Microsoft services, including this API. To help protect your privacy, we take steps to de-identify your data and keep it secure. We won’t publish your data or let other people use it.
Want to build this?
Preview: Read handwritten text from images
This technology (handwritten OCR) allows you to detect and extract handwritten text from notes, letters, essays, whiteboards, forms, etc. It works with different surfaces and backgrounds, such as white paper, yellow sticky notes, and whiteboards.
Handwritten text recognition saves time and effort and can make you more productive by allowing you to take images of text, rather than having to transcribe it. It makes it possible to digitize notes, which then allows you to implement quick and easy search. It also reduces paper clutter.
Note: this technology is currently in preview and is only available for English text.
To try this optical character recognition demo, upload a locally stored image or provide an image URL. We don’t store the images you supply for this demo unless you give us permission.
See it in action
Want to build this?
Recognize celebrities and landmarks
The Celebrity and Landmark Models are examples of Domain Specific Models. Our celebrity recognition model recognizes 200K celebrities from business, politics, sports and entertainment. Our landmark recognition model recognizes 9000 natural and man-made landmarks from around the world. Domain Specific Models is a continuously evolving feature within Computer Vision API.
See it in action
Want to build this?
Analyze video in near real-time
Analyze video in near real-time Use any of the Computer Vision APIs with you video files by extracting frames of the video from your device and then sending those frames to the API calls of your choice. Get results from your videos faster.
Use our sample on GitHub to get started and build your own app.
Learn moreSee it in action
Want to build this?
Generate a thumbnail
Generate a high quality storage-efficient thumbnail based on any input image. Use thumbnail generation to modify images to best suit your needs for size, shape, and style. Apply smart cropping to generate thumbnails that differ from the aspect ratio of your original image, yet preserve the region of interest.
See it in action
By uploading data for this demo, you agree that Microsoft may store it and use it to improve Microsoft services, including this API. To help protect your privacy, we take steps to de-identify your data and keep it secure. We won’t publish your data or let other people use it.
Want to build this?
"We can use the Computer Vision API to prove to our clients the reliability of the data, so they can be confident making important business decisions based on that information"
Leendert de Voogd: CEO | Vigiglobe
"It didn’t take us long to realize Microsoft Cognitive Services had handed us a powerful set of computer-vision and artificial-intelligence tools that we could use to create great apps and new features for our customers in just a few hours"
John Fan: Cofounder and CEO | Cardinal Blue Software
"Because the Cognitive Services APIs harness the power of machine learning, we were able to bring advanced intelligence into our product without the need to have a team of data scientists on hand"
Aaron Edell: Chief Product Owner | GrayMeta
"We found Cognitive Services to be the missing piece in the equation, the one that we needed to bring this solution to market and really revolutionize the way people look at video"
Katie McCann: Vice President of Product and Engineering | Prism Skylabs
"Microsoft Cognitive Services gives us a huge range of opportunities. It’s a perfect match for us now, and in the future when we want to add more features to our app"
Jaan Apajalahti: CEO | Blucup
"Using the Cognitive Services APIs, it took us three months to develop a test pair of glasses that can translate text and images into speech, identify emotions, and describe scenery. If we had been working full time, we could have done it in two weeks"
Benoit Chirouter: R&D Director | Pivothead
Check out the other Cognitive Services APIs
Computer Vision API
Distill actionable information from images
Content Moderator
Automated image, text, and video moderation
Video API PREVIEW
Intelligent video processing
Video Indexer PREVIEW
Unlock video insights
Face API
Detect, analyze, organize, and tag faces in photos
Emotion API PREVIEW
Personalize user experiences with emotion recognition
Custom Vision Service PREVIEW
Easily customize your own state-of-the-art computer vision models for your unique use case.
Language Understanding Intelligent Service PREVIEW
Teach your apps to understand commands from your users
Bing Spell Check API
Detect and correct spelling mistakes in your app
Web Language Model API PREVIEW
Use the power of predictive language models trained on web-scale data
Text Analytics API PREVIEW
Easily evaluate sentiment and topics to understand what users want
Translator Text API
Easily conduct machine translation with a simple REST API call
Linguistic Analysis API PREVIEW
Simplify complex language concepts and parse text with the Linguistic Analysis API.
Translator Speech API
Easily conduct real-time speech translation with a simple REST API call
Bing Speech API
Convert speech to text and back again to understand user intent
Speaker Recognition API PREVIEW
Use speech to identify and authenticate individual speakers
Custom Speech Service PREVIEW
Overcome speech recognition barriers like speaking style, background noise, and vocabulary
Bing Autosuggest API
Give your app intelligent autosuggest options for searches
Bing News Search API
Search for news and get comprehensive results
Bing Web Search API
Get enhanced search details from billions of web documents
Bing Entity Search API PREVIEW
Enrich your experiences by identifying and augmenting entity information from the web.
Bing Image Search API
Search for images and get comprehensive results
Bing Video Search API
Search for videos and get comprehensive results
Bing Custom Search PREVIEW
An easy-to-use, ad-free, commercial-grade search tool that lets you deliver the results you want.
Recommendations API PREVIEW
Predict and recommend items your customers want
Knowledge Exploration Service PREVIEW
Enable interactive search experiences over structured data via natural language inputs
Entity Linking Intelligence Service API PREVIEW
Power your app's data links with named entity recognition and disambiguation.
Academic Knowledge API PREVIEW
Tap into the wealth of academic content in the Microsoft Academic Graph
QnA Maker API PREVIEW
Distill information into conversational, easy-to-navigate answers.
Custom Decision Service PREVIEW
A cloud-based, contextual decision-making API that sharpens with experience
Project Prague
Gesture based controls
Project Nanjing
Isochrones calculations
Project Johannesburg
Route logistics
Project Cuzco
Event associated with Wikipedia entries
Project Abu Dhabi
Distance matrix
Project Wollongong
Location insights