Skip to content

mishranilesh012/Natural_Language_Processing_Techniques

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Natural Language Processing Techniques

Developed Speaker Diarization, Speech Sentiment Analysis and Object Character Recognition

Topics

  1. Getting Started
  2. Prerequisites
  3. Installation
  4. Angular Installation
  5. Dataset
  6. Screenshots

Getting Started

For Speaker Diarization, Package used :

  • pyAudioAnalysis - A Python library for audio feature extraction, classification, segmentation and applications.

For OCR, package used:

  • Tesseract - Tesseract is an optical character recognition engine for various operating systems.

Prerequisites

  • Dependencies for OCR
$ pip install Flask flask-restful SpeechRecognition opencv-python pytesseract

$ pip install -U textblob
  • Dependencies for pyAudioAnalysis
$ pip install numpy matplotlib scipy sklearn hmmlearn simplejson eyed3 pydub

Installation

  • Clone the source of this library:
$ git clone https://github.com/mishranilesh012/Natural_Language_Processing_Techniques.git

Angular Installation

  • Installing Angular CLI version 6
$ npm install -g @angular/cli@6
  • Run within angular directory Natural_Language_Processing_Techniques\Angular\NLP $
$ npm install
  • Running Angular
$ npm start
  • Open browser on
http://localhost: 4200

Dataset

Tested Dataset

Screenshots

OCR

ocr11

Output

ocroutput

Speaker Diarization and Speech Sentiment Analysis

dia111

Output

diares

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published