Ppt presentation on voice morphing:two identities in one voice
Size: 759.69 KB
Language: en
Added: Oct 08, 2024
Slides: 19 pages
Slide Content
PresentedBy
NikamShreya Rajendra.
Third Year of Artificial Intelligence and Data Science
Engineering
Department of Artificial Intelligence and Data Science Engineering
JaihindCollege of Engineering, Kuran
SAVITRIBAI PHULE PUNE UNIVERSITY
2023-2024
Presentation on
“ Voice Morphing : Two Identities In One Voice”
PROBLEM STATEMENT :
Invariousapplicationssuchasentertainment,security,andtelecommunications,thereisagrowing
demandforcreatinghybridvoicesthatcombinecharacteristicsoftwodifferentindividuals.This
process,knownasvoicemorphing,aimstoblenduniquevocaltraitssuchaspitch,tone,accent,and
speechpatternsoftwospeakersintoasingle,coherentoutput.
MOTIVATION :
1.Creative Applications in Entertainment
2. Personalization in Virtual Assistants
3. Medical and Therapeutic Applications
4. Security and Identity Protection
5. Psychological and Emotional Impact
ALGORITHMIC SURVEY :
1.Voice Preprocessing:
Input: Two distinct voice samples (Voice A and Voice B).
Noise Removal: Apply noise reduction techniques to clean the input voices for higher quality processing.
2. Feature Mapping:
Normalize Audio Features: Normalize the extracted features from both voices to ensure they are in a
compatible format for blending.
3. Voice Morphing Process:WeightingFactor Selection: Choose weighting factors (α, β) to control
the blending ratio between Voice A and Voice B. The formula for the morphed voice (Voice C) can be
represented as:
\text{Voice C} = \alpha \cdot\text{Voice A} + \beta \cdot\text{Voice B}
4. Emotion and Expression Blending:EmotionMapping: Analyze and map the emotional states in
both voices (e.g., happy, sad, angry) to preserve and blend emotional content in the morphed voice.Smooth
Transitions: Ensure smooth transitions between phonemes, prosody, and emotional expressions to avoid
abrupt changes in the final voice output.
FIGURE . overview of voice conversion on the basis of voice morphing.
RESULT ANALYSIS :
Analysis Parameter Description Observation
1.Voice Quality Resemblanceof the morphed
voice to the original identities.
Degree of similarity to original
voices;naturalnessscore.
2.Smoothness of TransitionSeamlessblending of vocal
characteristics.
No noticeable breaks or
artifactsbetween transition.
3.Pitch Average pitch and pitch
variation in morphed voices.
Pitch lies between two voices
4.Emotion Transfer Ability of morphed voicesto
convey emotions from original
voices.
Retaionsemotional tone from
both identities.
5.Word Recognization
Accuracy
Percentage of correctly
recognizedwords from the
morphed voice.
High accuracy in word
recognizationon by listeners.
ADVANTAGES & APPLICATIONS :
Applications:
1. Entertainment and Media : Animated Movies and TV Shows.
2. Film Post-Production : Dubbing and ADR.
3.Gaming and Virtual Reality.
4.Personalized Voices for Assistants.
5. Marketing and Branding.
6. Language Learning and Translation.