Skip to content

ma4ank/TimeSide

 
 

Repository files navigation

TimeSide : open audio processing framework

Version Downloads travis_master coveralls_master

TimeSide is a python framework enabling low and high level audio analysis, imaging, transcoding, streaming and labelling. Its high-level API is designed to enable complex processing on very large datasets of any audio or video assets with a plug-in architecture, a secure scalable backend and an extensible dynamic frontend.

Use cases

  • Scaled audio computing
  • Pseudo-realtime streaming
  • Plugin prototyping
  • Audio metadata management and visualization
  • Automatic segmentation and labelling synchronized with audio events.

Goals

  • Do asynchronous and fast audio processing with Python,
  • Decode audio frames from any audio or video media format into numpy arrays,
  • Analyze audio content with some state-of-the-art audio feature extraction libraries like Aubio, Yaafe and VAMP as well as some pure python processors
  • Visualize sounds with various fancy waveforms, spectrograms and other cool graphers,
  • Transcode audio data in various media formats and stream them through web apps,
  • Serialize feature analysis data through various portable formats,
  • Playback and interact on demand through a smart high-level HTML5 extensible player,
  • Index, tag and annotate audio archives with semantic metadata (see Telemeta which embed TimeSide).

Funding and support

To fund the project and continue our fast development process, we need your explicit support. So if you use TimeSide in production or even in development, please let us know:

Thanks for your help!

Architecture

The streaming architecture of TimeSide relies on 2 main parts: a processing engine including various plugin processors written in pure Python and a user interface providing some web based visualization and playback tools in pure HTML5.

image

Dive in

Let's produce a really simple audio analysis of an audio file. First, list all available plugins:

>>> import timeside.core >>> timeside.core.list_processors()

Define some processors:

>>> from timeside.core import get_processor >>> from timeside.core.tools.test_samples import samples >>> wavfile = samples['sweep.wav'] >>> decoder = get_processor('file_decoder')(wavfile) >>> grapher = get_processor('waveform_simple')() >>> analyzer = get_processor('level')() >>> encoder = get_processor('vorbis_encoder')('sweep.ogg')

Then run the magic pipeline:

>>> (decoder | grapher | analyzer | encoder).run()

Render the grapher results:

>>> grapher.render(output='waveform.png')

Show the analyzer results:

Level: {'level.max': AnalyzerResult(...)}

So, in only one pass, the audio file has been decoded, analyzed, graphed and transcoded.

For more extensive examples, please see the full documentation.

News

0.7.1

  • fix django version to 1.6.10 (sync with Telemeta 1.5)

0.7

  • Code refactoring:

    • Create a new module timeside.plugins and move processors therein: timeside.plugins.decoder,analyzer, timeside.plugins.encoder, timeside.plugins.fx
    • WARNING: to properly manage the namespace packages structure, the TimeSide main module is now timeside.core and code should now be initialized with import timeside.core
    • timeside.plugins is now a namespace package enabling external plugins to be automatically plugged into TimeSide (see for example timeside-diadems). This now makes TimeSide a real plugin host, yeah!
    • A dummy timeside plugin will soon be provided for easy development start.
  • Move all analyzers developped by the partners of the Diadems project to a new repository: timeside-diadems
  • Many fixes for a better processing by Travis-CI
  • Add a dox file to test the docker building continously on various distributions

0.6.2

  • Bugfix release for #63 #64 #68

0.6.1

0.6

  • WARNING! some processor ids have changed. Please see the full list below.
  • NEW analyzers: IRIT Monopoly (see Processors)
  • NEW graphers: IRIT Start/Session segmentation
  • Add extensible buffering thanks to pytables (NEW dependency)
  • Add typed parameters in processors and server thanks to traits (NEW dependency)
  • Add a graph model to the pipe thanks to networkx (NEW dependency)
  • Add test sample generators based on GStreamer
  • Add a background image option for rendering analyzers
  • Add on-the-fly filtering decorators
  • Add a Docker development image and a Dockerfile
  • Add a Vagrant development box
  • Update the Debian package installation procedure
  • Results are now stored in pipe.results as as dictionnary of AnalyzerResults
  • Update various processors
  • Prevent duplication of processor in the pipe (i.e. processors sharing the same class and parameters). This also fix #60.
  • Update of Travis CI scripts https://travis-ci.org/Parisson/TimeSide/

0.5.6

  • Bugfix release
  • Fix analyzer instanciation as parent for some graphers
  • Store analyzer's results in pipe.results by uuid instead of id (fix #24)

For older news, please visit: https://github.com/Parisson/TimeSide/blob/master/NEWS.rst

Processors

IEncoder

  • live_encoder : Gstreamer-based Audio Sink
  • flac_encoder : FLAC encoder based on Gstreamer
  • aac_encoder : AAC encoder based on Gstreamer
  • mp3_encoder : MP3 encoder based on Gstreamer
  • vorbis_encoder : OGG Vorbis encoder based on Gstreamer
  • opus_encoder : Opus encoder based on Gstreamer
  • wav_encoder : WAV encoder based on Gstreamer
  • webm_encoder : WebM encoder based on Gstreamer

IDecoder

  • array_decoder : Decoder taking Numpy array as input
  • file_decoder : File Decoder based on Gstreamer
  • live_decoder : Live source Decoder based on Gstreamer

IGrapher

  • grapher_aubio_pitch : Image representing Aubio Pitch
  • grapher_onset_detection_function : Image representing Onset detection function
  • grapher_waveform : Image representing Waveform from Analyzer
  • spectrogram_log : Logarithmic scaled spectrogram (level vs. frequency vs. time).
  • spectrogram_lin : Linear scaled spectrogram (level vs. frequency vs. time).
  • waveform_simple : Simple monochrome waveform image.
  • waveform_centroid : Waveform where peaks are colored relatively to the spectral centroids of each frame buffer.
  • waveform_contour_black : Black amplitude contour waveform.
  • waveform_contour_white : an white amplitude contour wavform.
  • waveform_transparent : Transparent waveform.

IAnalyzer

  • mean_dc_shift : Mean DC shift analyzer
  • level : Audio level analyzer
  • aubio_melenergy : Aubio Mel Energy analyzer
  • aubio_mfcc : Aubio MFCC analyzer
  • aubio_pitch : Aubio Pitch estimation analyzer
  • aubio_specdesc : Aubio Spectral Descriptors collection analyzer
  • aubio_temporal : Aubio Temporal analyzer
  • yaafe : Yaafe feature extraction library interface analyzer
  • spectrogram_analyzer : Spectrogram image builder with an extensible buffer based on tables
  • onset_detection_function : Onset Detection Function analyzer
  • spectrogram_analyzer_buffer : Spectrogram image builder with an extensible buffer based on tables
  • waveform_analyzer : Waveform analyzer

IEffect

  • fx_gain : Gain effect processor

API / Documentation

Install

Any platform

Thanks to Docker, TimeSide is now fully available as a docker image ready to work. The image includes all the necessary applications, modules and volumes to start your project in a few minutes.

First install Git, Docker and docker-compose, then copy these commands in a terminal and hit ENTER:

git clone https://github.com/Parisson/TimeSide.git
cd TimeSide
docker-compose up

That's it! You can now browse the TimeSide API: http://localhost:8000/api/

and the admin: http://localhost:8000/admin (admin/admin)

To process some data by hand, you can also start a python shell session into the sandbox:

docker-compose run app python examples/sandbox/manage.py shell

To build your own audido project on top of TimeSide, just pull our latest master image:

docker pull parisson/timeside:latest

WARNING: our docker composition already bundles some powerfull containers and bleeding edge frameworks (Nginx, MySQL, RabbitMQ, Celery, Python, Django) that can be scaled from development to massive production environments very easily. But you must modify all the passwords and secret keys of the sandbox before any serious usecase.

More infos about the TimeSide docker image: https://registry.hub.docker.com/u/parisson/timeside/

Debian, Ubuntu

For Debian based distributions, we provide a safe repository giving additional dependencies that are not included in Debian yet. Please follow the instructions on this page.

Some of the needed dependencies

python (2.7.x) python-setuptools python-numpy python-scipy python-h5py python-matplotlib python-imaging python-simplejson python-yaml python-mutagen libhdf5-serial-dev python-tables python-gst0.10 gstreamer0.10-gnonlin gstreamer0.10-plugins-good gstreamer0.10-plugins-bad gstreamer0.10-plugins-ugly aubio yaafe python-aubio python-yaafe vamp-examples django (1.6.x) django-south djangorestframework django-extensions

User Interfaces

Python

Of course all the TimeSide are available in our beloved python envionment. As IPython is really great for discovering objects with completion, writing notebooks, we strongly advise to install and use it:

sudo apt-get install ipython
ipython
>>> import timeside

Shell

Of course, TimeSide can be used in any python environment. But, a shell script is also provided to enable preset based and recursive processing through your command line interface:

timeside-launch -h
Usage: scripts/timeside-launch [options] -c file.conf file1.wav [file2.wav ...]
 help: scripts/timeside-launch -h

Options:
 -h, --help            show this help message and exit
 -v, --verbose         be verbose
 -q, --quiet           be quiet
 -C <config_file>, --conf=<config_file>
                       configuration file
 -s <samplerate>, --samplerate=<samplerate>
                       samplerate at which to run the pipeline
 -c <channels>, --channels=<channels>
                       number of channels to run the pipeline with
 -b <blocksize>, --blocksize=<blocksize>
                       blocksize at which to run the pipeline
 -a <analyzers>, --analyzers=<analyzers>
                       analyzers in the pipeline
 -g <graphers>, --graphers=<graphers>
                       graphers in the pipeline
 -e <encoders>, --encoders=<encoders>
                       encoders in the pipeline
 -R <formats>, --results-formats=<formats>
                       list of results output formats for the analyzers
                       results
 -I <formats>, --images-formats=<formats>
                       list of graph output formats for the analyzers results
 -o <outputdir>, --ouput-directory=<outputdir>
                       output directory

Find some preset examples in examples/presets/

Web player

TimeSide comes with a smart and pure HTML5 audio player.

Features: * embed it in any audio web application * stream, playback and download various audio formats on the fly * synchronize sound with text, bitmap and vectorial events * seek through various semantic, analytic and time synced data * fully skinnable with CSS style

Screenshot:

image

Examples of the player embeded in the Telemeta open web audio CMS: * http://parisson.telemeta.org/archives/items/PRS_07_01_03/ * http://archives.crem-cnrs.fr/items/CNRSMH_I_1956_002_001_01/

Development documentation: * https://github.com/Parisson/TimeSide/wiki/Ui-Guide

TODO list: * zoom * layers

Web server

An EXPERIMENTAL web server based on Django has been added to the package from version 0.5.5. The goal is to provide a full REST API to TimeSide to enable new kinds of audio processing web services.

A sandbox is provided in timeside/server/sandbox and you can initialize it and test it like this:

cd examples/sandbox
./manage.py syncdb
./manage.py migrate
./manage.py runserver

and browse http://localhost:8000/api/

At the moment, this server is NOT connected to the player using TimeSide alone. Please use Telemeta.

Development

travis_dev coveralls_dev

Docker is a great tool for developing and deploying processing environments. We provide a docker image which contains TimeSide and all the necessary packages (nginx, uwsgi, etc) to run it either in development or in production stages.

First, install Docker: https://docs.docker.com/installation/

Then, simply pull the image and run it:

docker pull parisson/timeside
docker run -p 9000:80 parisson/timeside

You can now browse the TimeSide API: http://localhost:9000/api/

or get a shell session:

docker run -ti parisson/timeside bash

To start a new development, it is advised to checkout the dev branch in the container:

cd /opt/TimeSide
git checkout dev

or get our latest-dev image:

docker pull parisson/timeside:latest-dev

More infos: https://registry.hub.docker.com/u/parisson/timeside/

Native

First, install TimeSide (see Install).

Then:

sudo apt-get build-dep python-timeside
sudo apt-get install git
git clone https://github.com/Parisson/TimeSide.git
cd TimeSide
git checkout dev
sudo pip install -e .
echo "export PYTHONPATH=$PYTHONPATH:`pwd`" >> ~/.bashrc
source ~/.bashrc
tests/run_all_tests

Sponsors and Partners

  • Parisson
  • CNRS (National Center of Science Research, France)
  • Huma-Num (big data equipment for digital humanities, ex TGE Adonis)
  • CREM (french National Center of Ethomusicology Research, France)
  • Université Pierre et Marie Curie (UPMC Paris, France)
  • ANR (CONTINT 2012 project : DIADEMS)
  • MNHN : Museum National d'Histoire Naturelle (Paris, France)

Copyrights

  • Copyright (c) 2006, 2015 Parisson Sarl
  • Copyright (c) 2006, 2015 Guillaume Pellerin
  • Copyright (c) 2013, 2015 Thomas Fillon
  • Copyright (c) 2010, 2014 Paul Brossier
  • Copyright (c) 2013, 2014 Maxime Lecoz
  • Copyright (c) 2013, 2014 David Doukhan
  • Copyright (c) 2006, 2010 Olivier Guilyardi

License

TimeSide is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 2 of the License, or (at your option) any later version.

TimeSide is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

See LICENSE for more details.

About

Audio processing framework for the web

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 57.4%
  • JavaScript 41.3%
  • Other 1.3%