ADVANCEMENTS IN AI AND BIOACOUSTIC SIGNAL PROCESSING - ATAL FDP Presentation 15.02.2025

JohnAmose 46 views 49 slides Mar 05, 2025
Slide 1
Slide 1 of 49
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49

About This Presentation

I am grateful to be a Resource Person in the AICTE ATAL Academy-sponsored Faculty Development Program (FDP) on Advancements in AI and Bioacoustic Signal Processing on 15.02.2025. In the session, the evolution of Natural Language Processing (NLP) through the Transformer architecture, inspired by the ...


Slide Content

Dr. John Amose
Assistant Professor (Sr.G)
Department of CSE (AI & ML)
Sri Krishna College of Technology (SKCT), Coimbatore
Mob: +91 9042944206
linkedin.com/in/johnamose/
AICTE Training and Learning (ATAL) Academy sponsored
Faculty Development Program on
Transforming Healthcare through Wearable Technologies and AI
ADVANCEMENTS IN AI AND
BIOACOUSTIC SIGNAL PROCESSING
15.02.2025 at 3:30pm

Advancements in AI -NVIDIA1.
Perception AIa.
Generative AIb.
Agentic AIc.
Physical AId.
Roboticse.
Project Digitsf.
Foundational Technology2.
Natural Language Processinga.
Transformer Architectureb.
Bioacoustic Signal Processing3.
Case Study 1 - Lung Acoustica.
Case Study 2 - Speech Acousticb.
Case Study 3 - Heart Acousticc.
Wearable Technology & Responsible AI4.

Advancements in AI -
NVIDIA
SECTION 1

Consumer Electronics Show (CES) - Las Vegas 2025 - 07 Jan 2025
Highlights - NVIDIA CEO Jensen Huang Keynote at CES 2025

Consumer Electronics Show (CES) - Las Vegas 2025
Highlights - NVIDIA - Generative AI
DENTAL REPORT

Consumer Electronics Show (CES) - Las Vegas 2025
Highlights - NVIDIA - Agentic AI

Consumer Electronics Show (CES) - Las Vegas 2025
Highlights - NVIDIA - Physical AI

Consumer Electronics Show (CES) - Las Vegas 2025
Highlights - NVIDIA - Autonomous Vehicles

Consumer Electronics Show (CES) - Las Vegas 2025
Highlights - NVIDIA - Robotics

Consumer Electronics Show (CES) - Las Vegas 2025
Highlights - NVIDIA - Project Digits

Foundational
Technologies
SECTION 2

NATURAL LANGUAGE PROCESSING (NLP)
Text Pre-processing
Tokenization
Stemming
Lamnatization
Bag of Words
Vector Embedding
Word2vec
Cosine similarity (Semantic)
Large Language Models (LLMs)
Transformer

This is mathematically implemented using three main components:
Query (Q) – Represents the current processing element (e.g., a
word in a sentence).
1.
Key (K) – Represents all elements in the input that can be
attended to.
2.
Value (V) – Contains the actual information from the input.3.
Scaled Dot-Product Attention (Used in Transformers)
Each Query-Q computes a similarity score with all Keys-K,
determining which inputs are important. This is done as:
How Does Attention Work?
Attention assigns different weights to different
input elements, prioritizing the most important
ones.
TRANSFORMERS – ATTENTION IS ALL YOU NEED

Your paragraph text
TRANSFORMER
ARCHITECTURE

Bioacoustic
Signal
Processing
SECTION 3

BIOACOUSTIC SIGNAL
PROCESSING
Case Study (Pulmonologist)
Lung Acoustics for Respiratory Care
Implementation of
Perception AI
Generative AI
Agentic AI

CASE STUDY 1
LUNG ACOUSTICS - COPD DIAGNOSIS
Analog Stethoscopes
Digitization

Digital Stethoscopes

Indian - Digital Stethoscopes

Multi-Feature
Medical Devices

Wearable Respiratory Monitor - Strados Lab

Wearable
Respiratory
Monitor
Strados Lab

LUNG ACOUSTICS

Chest Sounds
Lung Sounds
Spectrogram MFCC
Time-Frequency Representation
Denoised Signal
Bandpass Filter [20-2000 Hz]
Downsampling to 4000Hz

•MFCC 0 [Energy Parameter] has a high correlation and redundancy, with average absolute correlations
around 0.49.
•Wheeze - Spectral shape are better captured by the higher-order MFCCs (spectral details of the sound)
•COPD tends to produce more consistent sound patterns due to chronic obstruction,overall energy level
might not fluctuate much over time
MFCC Coefficient Analysis - Correlation

Hyperparameter Tuning - Heatmap of GridSeachCV
Mean Test Scores

Precision Score
Recall Score
F1 Score

Langflow is a low-code tool for developers that makes it easier to build powerful
AI agents and workflows that can use any API, model, or database.

GENERATIVE AI
Data Augmentation for Lung Sound Classification
AI-Powered Lung Sound Interpretation
Explainable AI (XAI) for Lung Sound Predictions
AI-Generated Personalized Reports
Generative AI Chatbot for Medical Queries
Implementation Plan: Agentic AI for Bio-Acoustic Analysis
We will create an AI agent that can:
Analyze lung sound spectrograms using a trained CNN model.
Answer medical queries using LLMs.
Retrieve medical research papers based on user input.
Provide diagnostic insights based on lung sound classification.
AGENTIC AI

BIOACOUSTIC SIGNAL
PROCESSING
Case Study (Neurologist & Speech Pathologist)
Speech Acoustics- Dysarthria
Implementation of
Perception AI
Generative AI
Agentic AI

Thoppil MG, Kumar CS, Kumar A, Amose J. Speech signal analysis and pattern recognition in
diagnosis of dysarthria. Ann Indian Acad Neurol 2017;20:352-7 [SCI]

Comparison of pitch of normal speech with
spastic speech. Demonstrates F0 jitter
Comparison of pitch of normal speech with ataxic speech.
Demonstrates F0 break
Comparison of pitch of normal speech with
extrapyramidal speech. Demonstrates F0
monotonicity
Demonstrates that the formant range (F1 and F2) decreases as severity
of speech increases. Comparison of formant range of normal speech
with severe dysarthria

GENERATIVE AI
AI-Based Speech Reconstruction
Personalized Voice Cloning
AI-Powered Speech Prediction
Implementation Plan: Agentic AI for Bio-Acoustic Analysis
We will create an AI agent that can:
Autonomous Speech Severity Analysis Agent
Self-Improving Speech Therapy Agent with feedback on exercise
Intelligent Speech Prediction Agent for speech correction.
AGENTIC AI

BIOACOUSTIC SIGNAL
PROCESSING
Case Study (Cardiologist)
Heart Acoustics
Implementation of
Perception AI
Generative AI
Agentic AI

Acoustics Lab Research Associate
Andrew McDonald

GENERATIVE AI
Data Augmentation for Heart Sound Classification
AI-Powered Heart Sound Interpretation
Explainable AI (XAI) for Heart Sound Predictions
AI-Generated Personalized Reports
Generative AI Chatbot for Medical Queries
Implementation Plan: Agentic AI for Bio-Acoustic Analysis
We will create an AI agent that can:
Analyze Heart sound spectrograms using a trained CNN model.
Answer medical queries using LLMs.
Retrieve medical research papers based on user input.
Provide diagnostic insights based on heart sound classification.
AGENTIC AI

Wearable
Technologies
&
Responsible AI
SECTION 4

RESPONSIBLE AI FOR
WEARABLE HEALTHCARE APPLICATIONS
DATA & BIASES
Demographic
Unbalanced
Language
Political bias
PRIVACY & DATA SECURITY
HUMAN-AI COLLABORATION
TRANSPARENCY & EXPLAINABILITY
ROBUSTNESS & RELIABILITY
Data Augmentation and Generative
Challenges in Acceptance

What has changed?
Technology
Rise of Open-Source AI Challenging
Proprietary Models
Lower AI Deployment Costs Increasing
Accessibility
Specialized & Multimodal AI Reducing
Dependence on General Models
Wearable Technologies
Edge AI & On-Device Processing
Lower Costs & Faster Innovation
Enhanced Multimodal Integration
Geopolitical
Data is the new oil, a war on global dominance

CONTACT ME
+91 9042944206
[email protected]
linkedin.com/in/johnamose
SKCT, Coimbatore

FOR YOUR ‘ATTENTION’
linkedin.com/in/johnamose
THANK YOU!