Author: Ketan Doshi
-
A Gentle Guide to Feature Engineering and Visualization with Geospatial data, in Plain English
11 min read -
A Gentle Guide to fundamental techniques used by gradient descent optimizers like SGD, Momentum, RMSProp,…
12 min read -
A Gentle Guide to how Beam Search enhances predictions, in Plain English
10 min read -
Audio Deep Learning Made Simple: Automatic Speech Recognition (ASR), How it Works
Artificial IntelligenceSpeech-to-Text algorithm and architecture, including Mel Spectrograms, MFCCs, CTC Loss and Decoder, in Plain English
17 min read -
An end-to-end example and architecture for Audio Deep Learning’s foundational application scenario, in Plain English.
14 min read -
A Gentle Guide to enhancing Spectrogram features for optimal performance. Also Data Augmentation, in Plain…
8 min read -
Audio Deep Learning Made Simple (Part 2): Why Mel Spectrograms perform better
Artificial IntelligenceA Gentle Guide to processing audio in Python. What are Mel Spectrograms and how to…
8 min read -
A Gentle Guide to the world of disruptive deep learning audio applications and architectures. And…
14 min read -
A Gentle Guide to the inner workings of Self-Attention, Encoder-Decoder Attention, Attention Score and Masking,…
12 min read