We like to think about artificial intelligence as a set of technologies that allows machines and systems to sense, comprehend, act, and learn. Let's talk in a little bit more detail about these capabilities and the technologies behind them. Let's start with the first capability, sense. When we talk about sense, we mean that AI lets a machine perceive the world around it by gathering and processing images, sounds, speech, and text. Examples of this include facial recognition, such as when your phone unlocks with a simple glance, Image categorization-- is it a dog or a cat? Sound pattern recognition-- is it fireworks or a bomb going off?
Translating speech to texts-- creating subtitles for movies. Some of the technologies that are behind it are computer vision. This allows machines such as computers of mobile phones to see their surroundings. Computer vision has already made its way to our mobile phones via different e-commerce or camera apps. Audio processing-- this has to do with detecting and translating audio signals.
I will give you two examples related to the sense capability that you might find easy to identify with. The first, Google Cloud Vision, which classifies images into thousands of categories such as sailboats and detects objects and faces within images. The second, Amazon Echo, which acts as a personal DJ that you can control through your voice.
Image Categorisation- The process of putting images into different categories for use within a training model.
Sound Pattern Recognition - The process of classifying sounds into different categories and recognising patterns in the sounds.
Translating Speech to Text - When technology turns language/voice that is spoken into a textual transcript.
Computer Vision - The ability to allow computers to see, recognise and process images in the way that humans can through video or image analytics.
Audio Processing - The analysing of audio signals.
Tomorrow we'll discuss about the second capability comprehend
Translating speech to texts-- creating subtitles for movies. Some of the technologies that are behind it are computer vision. This allows machines such as computers of mobile phones to see their surroundings. Computer vision has already made its way to our mobile phones via different e-commerce or camera apps. Audio processing-- this has to do with detecting and translating audio signals.
I will give you two examples related to the sense capability that you might find easy to identify with. The first, Google Cloud Vision, which classifies images into thousands of categories such as sailboats and detects objects and faces within images. The second, Amazon Echo, which acts as a personal DJ that you can control through your voice.
Key Words:
Facial Recognition- Technology that can identify a person from an image or video using facial characteristics.Image Categorisation- The process of putting images into different categories for use within a training model.
Sound Pattern Recognition - The process of classifying sounds into different categories and recognising patterns in the sounds.
Translating Speech to Text - When technology turns language/voice that is spoken into a textual transcript.
Computer Vision - The ability to allow computers to see, recognise and process images in the way that humans can through video or image analytics.
Audio Processing - The analysing of audio signals.
Tomorrow we'll discuss about the second capability comprehend
0 comments:
Post a Comment