Using the Vision AI Developer Kit for Audio

Overview

This repo demonstrates how to use the Vision AI Developer Kit (VAI DevKit) to develop a Neural Network model to process audio sounds. For information on using Vision on the VAI DevKit, refer to Vision AI DevKit main page.

Solution Videos

Background

Processing video or images through a Neural Network involves converting images, most commonly JPEG, into a NumPy array where features can be extracted and calculated. At the hightest level, most Vision AI projects include this with added capabilities.

A few of the challenges with Vision include requiring a camera and the camera only has a limited field of view. To detect images in a complete circle, you often need 4+ cameras, which has a higher cost and require specialized hardware to process so much data and networks.

Audio, using just a microphone, is a much more cost effective approach for lots of use cases. The advantages of Audio include:

Lower Price
Full 360° coverage
No dependency on light

While audio is not the answer to all use cases, it can be used in many. With your eyes closed, listen to all the sounds around you and think about how you were "trained" to recognize the sound.

Resources

Github Repository - this site
Azure Subscription -- We will use Azure IoT Edge and Azure Machine Learning Workspace in the sample
Visual Studio Code -- the IDE for this sample
Audio Documentation - all documentation for using the VAI DevKit for audio processing
Sample - a sample solution for audio processing on the VAI DevKit
Qualcomm QCS603 - learn more about the chipset powering the Vision AI Developer Kit hardware

Get a kit

You can purchase the DevKit from Arrow Electronics.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
AMLNotebook		AMLNotebook
documentation		documentation
samples		samples
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMLNotebook

AMLNotebook

documentation

documentation

samples

samples

README.md

README.md

Repository files navigation

Using the Vision AI Developer Kit for Audio

Overview

Solution Videos

Background

Resources

Get a kit

About

Releases

Packages

Languages

ksaye/vision-ai-developer-kit-audio

Folders and files

Latest commit

History

Repository files navigation

Using the Vision AI Developer Kit for Audio

Overview

Solution Videos

Background

Resources

Get a kit

About

Topics

Resources

Stars

Watchers

Forks

Languages