- Click nodes to play music
- Please click again on song node to stop playing
Cepstra
Technology
Cepstra algorithm analyzes the top terms that describe a particular song or artist. Each artist and song can have thousands of characteristics representing them. Each one has an associated score, which describes how significant that description is for a song/artist. These tags or words are added to the model of each song and artist, which is then used for modeling what songs to recommend to a user.
With the help of sound processing technique and artificial intelligence, Cepstra Algorithm reads and understands music as if a human understands (and enjoy it! not sure if server enjoys it 😉 ). Cepstra’s Algorithms learns a massive number of important observations that humans make while listening to a song like the right pattern of modulation, the right combination of instruments playing at the right time of a song. Cepstra not only make hundreds of observation about the song also applied intelligence that a human would apply to, or create a likable song. The journey of understanding music starts from the number of instruments, voices played inside the mp3 file, modulation, and angles at which sounds are modulated, to put the information in Cepstra’s proprietary algorithm which not only learns the music patterns but also separates the voice and instruments.
The understanding starts from various frequency-based learning algorithms based on Fourier transforms and spectrograms. To learn the human likeliness of music, Cepstra’s Web crawlers read and understand the music from world wide web feed them back to our system. The system then applies unsupervised learning to structurize the information for our predictions. Analyzing RAW audio, we process the tracks through various Machine learning Models at scale, it transforms the raw sound and produces numerous characteristics like time signature, key, mode, tempo, and loudness. After being processed by our pipeline, it provides metrics to understand and define songs and music vectors. This understandingallows Cepstra to compare songs based on those key metrics. For example, someone who likes heavy metal might like songs that are more “loud.“ By combining these two models, Cepstra analyzes the similarity of different songs and artists and can recommend new songs to users every week.
Cepstra’s algorithm analyses the audio of a song to see the similarities between other songs. So, when a likeness of the new song is found to be similar to other songs user like, Cepstra can add it to a User’s playlist. All this at a large scale of processing, we recommend next likely music at O(1) complexity and speed. The algorithm is not only capable of defining the music but suggests music with milliseconds speed.
Why the name Cepstra?
A cepstrum (/ˈkɛpstrʌm, ˈsɛp-, -strəm/) is the result of taking the inverse Fourier transform (IFT) of the logarithm of the estimated spectrum of a signal. It may be pronounced in the two ways given, the second having the advantage of avoiding confusion with “kepstrum”, which also exists (see below). There is a complex cepstrum, a real cepstrum, a power cepstrum, and a phase cepstrum. The power cepstrum, in particular, has applications in the analysis of human speech.
The name “cepstrum” was derived by reversing the first four letters of “spectrum”. Operations on cepstra are labeled quefrency analysis (aka quefrency analysis[1]), liftering, or cepstral analysis.
But why only Music?
AMusic is an integral part of our Lives. It is personal to us, and many of us are passionate about it. Music is encompassed across genres and geographies, ages and formats – from radio to streaming and beyond.
Following are some insights of IFPI MUSIC CONSUMER INSIGHT REPORT 2018:
GLOBAL
- On average consumers spend 17.8 hrs listening to music each week globally which translates to around 2.5 hrs/day.
- 75% of consumers use smartphones to listen to music.
- Younger consumers (16-24s) are more likely to listen to music during any activity and much more likely to listen on their way to work or education or while at work or education.
- 57% of 16-24-year-olds use a paid audio streaming service.
INDIA
- 96% of consumers are listening to music on smartphones – the highest rate in the world
- 96% of consumers in India listen to licensed music
- INDIA’S FAVOURITE GENRES: Bollywood new, Bollywood Old, Indian classical music, Pop, Rock
So what is Cepstra working on? How is it disrupting the Music industry?
We are developing an engine that uses following recommendation models:
- Audio modeling uses a song’s raw signals to understand the tune of the song and compares it to other songs.
- Signal processing to identify the sound.
- Natural Language Processing (NLP) analyzes the text in each song.