0% found this document useful (0 votes)

12 views

speech processing pbl

The project report focuses on pitch period estimation using the autocorrelation function, highlighting its importance in speech processing applications. It introduces a multi-line cut method to improve peak position estimation, enhancing accuracy over traditional methods. The report includes methods, MATLAB code, results, and concludes that the autocorrelation function is a practical tool for analyzing audio signals.

Uploaded by

ECE02 Abhishek Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

speech processing pbl

Uploaded by

ECE02 Abhishek Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

A Project Report on

“Pitch Period Estimation using

Autocorrelation Function”

Submitted in Partial Fulfilment of the

Requirements for the Degree of
Bachelor of Technology
In
Electronics & Communication Engineering

Under Guidance of
Dr. Satish Kumar Singh

Head of Department
Prof. Deepak Nagaria

Submitted By
Aashi Diwakar(2004331001)
Abhishek Patel(2004331001)
Aditya Kumar(1904331006)

Department of Electronics & Communication Engineering

Bundelkhand Institute of Engineering & Technology
(An Autonomous Institute)

Jhansi (U.P.) India - 284128

Session 2023-24

1
CERTIFICATE

This is to certify that this Project Based Learning Report on “Pitch Period Estimation
using Autocorrelation Function” has been successfully deliveredby “Abhishek Patel,
Aditya Kumar and Aashi Diwakar (B.Tech. Final Year)” under the guidance of Dr.
Satish Kumar Singh for fulfilment of Bachelor of Technology degree from Bundelkhand
Institute of Engineering and Technology, Jhansi during academic year 2023-2024.

2
ACKNOWLEDGEMENT

This project would not have been possible without the kind support and help of many
individuals and organizations. We are indebted to our respected Head of Department, Dr.
Deepak Nagaria Sir for guiding us. We are also grateful to our project guide, Dr. Satish
Kumar Singh Sir, for his indomitable contribution and guidance without which this
project would have been impossible to complete. Our sincerest gratitude to all the
teachers, seniors and colleagues whose help and guidance brought this project to a
successful completion.

3
CONTENTS

S.No. Topic Page No.

1. Abstract 5

2. Introduction 6-7

3. Methods 8
4. Code 9-10

5. Results 10-11

6. Conclusion 12

7. References 13

4
1.ABSTRACT

In speech recognition and speech synthesis, accurate estimation of the pitch period is an important
part of speech processing. The traditional direct peak estimation method and the autocorrelation
function method are both effective time domain estimation algorithms. The autocorrelation method
is a pitch period estimation algorithm suitable for low SNR. Both algorithms need to get accurate
peak position estimation. In this paper, a multi-line cut method which is a method for judging the
position of the peak point is proposed. The multi-line cut method is used to intercept the sampled
data of the waveform by using multiple cut lines. The median value is calculated by the starting
and ending points of the cut line position, and the peak position is indirectly evaluated. By
minimizing the impact of interference on the peak estimate, the likelihood of falling into local
extreme points is reduced, therefore a more accurate peak point estimate than the direct search for
peak points can be obtained. The simulation results show that compared with the traditional direct
peak estimation method, the performance of peak estimation by the multi-line cut method can be
greatly improved, and the multi-line cut method can be used to estimate the peak value in the
autocorrelation method, and also achieve a certain performance improvement. In addition, the
number of cut lines is directly related to performance, and the more the number is, the better the
performance is. The complexity of this method is not high and easy to implement.

5
2.INTRODUCTION

In speech signal processing technology, the estimation of the pitch period is a very important link.
Pitch detection is widely used in speech analysis, speech synthesis, speech compression coding,
speech recognition and speech segmentation. For many years, researchers have proposed various
pitch detection algorithms, such as Autocorrelation Function method(ACF), Average Magnitude
Difference Function (AMDF), wavelet transform method, Cepstrum method, etc. In general, pitch
period extraction methods are mainly time domain estimation methods and transform domain
estimation methods. The time domain estimation method is to estimate the pitch period directly
from the waveform of the speech signal, and it has been applied very early, and it is widely used
because of its simple implementation and low computational complexity. The peak direct
estimation method is one of the time domain estimation methods and is still widely used at present.
The autocorrelation function method is also a time domain estimation method, which is suitable
for the pitch period extraction in the case of low SNR. The autocorrelation function method needs
to estimate the peak position when performing the pitch extraction. When the peak position is
inaccurate due to the local minimum value, the performance is affected. In this paper, a peak point
position estimation method will be described, which can make the judgment of the peak point
position more accurate, and relatively accurately estimate the pitch period of the speech signal.
The following is a description of the traditional peak direct estimation method and the short-term
autocorrelation function estimation method to estimate the pitch period. Then, the multi line cut
method proposed in this paper is introduced, and then the four methods are verified and evaluated.

Two time domain pitch period estimation methods In the time domain pitch period estimation
method, the traditional direct peak estimation method and the autocorrelation function method are
both effective algorithms. Among them, the autocorrelation method is a pitch period estimation
algorithm suitable for low SNR. Both algorithms require an accurate estimate of the peak position.
The following is a brief introduction to the two algorithms. Traditional peak direct estimation The
peak direct estimation method is to directly find two adjacent peak points of the periodic signal,
and calculate the interval time T between the two peak points, that is, the period of the signal.
However, due to the influence of noise and interference, this method may lead to inaccurate peak
point estimation, which results in inaccurate period estimation. However, this method is simple
and intuitive, and has low complexity. There are still many application scenarios.

Short-term autocorrelation function method The autocorrelation function method belongs to the
time domain estimation algorithm. Compared with other time domain algorithms, it has better anti-
noise interference characteristics. The extracted pitch contour features are obvious, the accuracy
is good, the implementation is simple, and it is also a widely used algorithm in the field of speech

6
signal processing. The principle of the algorithm is that the autocorrelation function value of the
speech signal will peak at an integer multiple of the pitch period, and the pitch period can be
extracted to estimate the pitch period. Autocorrelation calculation is performed for each frame
by the calculation formula of the short-time autocorrelation function.

7
3.Methods

Pitch period estimation is a crucial step in many audio and speech processing applications, such
as speech recognition and pitch-based musical analysis. The autocorrelation function is commonly
used for pitch period estimation. Here's a basic method using autocorrelation:

1.Preprocessing:
Begin by pre-processing the audio signal. Typically, this involves windowing the signal to reduce
spectral leakage. A commonly used window function is the Hamming window.

Normalize the signal to ensure consistent amplitude.

2.Peak Picking:
Identify peaks in the autocorrelation function. Peaks correspond to potential pitch periods. One
way to do this is to find local maxima in the autocorrelation function.

3.Pitch Period Estimation:

Once peaks are identified, the pitch period(P) can be estimated by selecting the lag corresponding
to the highest peak. The pitch period is related to the lag() by:

/Fs.

Where Fs is the sampling rate of the signal.

Basic Steps:

Preprocess the signal by applying a window function (e.g., Hamming) to mitigate spectral leakage
and normalize amplitude.

Compute the autocorrelation function using the “xcorr” function.

Identify peaks in the autocorrelation function using the “findpeaks” function.

Estimate the pitch period based on the lag corresponding to the highest peak.

8
4. Matlab Code

close all;
clc;
[x1,fs]=audioread('test1.wav');
T=1/fs;
plot(x1)
ylabel('Amplitude')
xlabel('Time (s)');
title('Synthesis Signal');
x=x1 (700:800);
[rxx lags] = xcorr (x,x);
figure
plot (lags, rxx)
xlabel('lag');
ylabel('Correlation Measurement');
title('Auto-correlation Function')
first_peak_loc= length (x)+1;
min_period_in_samples=30;
half_min = min_period_in_samples/2;
seq(first_peak_loc-half_min:first_peak_loc+half_min) = min(seq);
plot(rxx, 'rx');
hold on
plot (seq)
[max_val second_peak_loc] = max(seq);
period_in_samples = abs (second_peak_loc-first_peak_loc)
period_in_samples = period_in_samples*T
9
Fundamental_frequency=1/period_in_samples
sound(x1)

5.Results

MATLAB 7.0 is used for our calculations. We chose MATLAB as our programming environment
as it offers many advantages. It contains a variety of signal processing and statistical tools, which
help users in generating a variety of signals and plotting them. MATLAB excels at numerical
computations, especially when dealing with vectors or matrices of data. One of the speech signal
used in this study is given with Fig.02 algorithm pitch periods estimation using autocorrelation
function..

Figure 1-sample audio signal

10
Figure 2-Autocorrelation vs Lag in Discrete domain.

11
6.Conclusion

In conclusion, pitch period estimation using the autocorrelation function in MATLAB is a practical
and accessible method for analyzing periodicities in audio signals. MATLAB provides a
convenient environment for signal processing tasks, including pitch period estimation using the
autocorrelation function. The xcorr function is employed to compute the autocorrelation, and the
findpeaks function helps identify peaks in the autocorrelation function. Basic Steps involves
preprocess the signal by applying a window function (e.g., Hamming) to mitigate spectral leakage
and normalize amplitude. Compute the autocorrelation function using the xcorr function. Identify
peaks in the autocorrelation function using the findpeaks function. Estimate the pitch period based
on the lag corresponding to the highest peak.

In summary, the theoretical foundation of pitch period estimation using autocorrelation in

MATLAB rests on sound principles of signal processing, and its practical implementation involves
careful parameter tuning and consideration of the characteristics of the analyzed signals.
Continuous refinement and adaptation make this method a valuable tool in various audio
processing applications.

12
7.References

• https://www.researchgate.net/publication/259823741_VoicedUnvoiced_Decision_for_Sp
eech_Signals_Based_on_Zero-Crossing_Rate_and_Energy
• https://www.mathworks.com/help/signal/ref/zerocrossrate.html
• "Fundamentals of Speech Recognition" by Lawrence Rabiner and Biing-Hwang Juang.
• https://www.youtube.com/watch?v=q9nki9ksHHs&t=214s

The SSD Solution Composition and SSD Chemical Formula
82% (11)
The SSD Solution Composition and SSD Chemical Formula
2 pages
Digital Modulations using Matlab
From Everand
Digital Modulations using Matlab
Mathuranathan Viswanathan
4/5 (6)
A Tutorial To Extract The Pitch in Speech Signals Using Autocorrelation
No ratings yet
A Tutorial To Extract The Pitch in Speech Signals Using Autocorrelation
11 pages
2-A Comparative Study of Time-Delay Estimation Techniques Using
No ratings yet
2-A Comparative Study of Time-Delay Estimation Techniques Using
57 pages
The Troll Pits of Hextor (9160671)
86% (7)
The Troll Pits of Hextor (9160671)
45 pages
Scala Dynamic Cone Penetrometer Test: DOP DOP DOP DOP DOP
No ratings yet
Scala Dynamic Cone Penetrometer Test: DOP DOP DOP DOP DOP
12 pages
Eup - C - 08-Pitch Tracking
No ratings yet
Eup - C - 08-Pitch Tracking
10 pages
Pitch
No ratings yet
Pitch
6 pages
Chapter 4: Pitch Estimation For Music Signal Processing: KH Wong
No ratings yet
Chapter 4: Pitch Estimation For Music Signal Processing: KH Wong
33 pages
D2 Report 2022JTM2399
No ratings yet
D2 Report 2022JTM2399
18 pages
A Pitch Detection Method Based On Continuous Wavelet Transform For Harmonic Signal
No ratings yet
A Pitch Detection Method Based On Continuous Wavelet Transform For Harmonic Signal
10 pages
Pitch Estimation Explanation
No ratings yet
Pitch Estimation Explanation
15 pages
Period Tracking Using Autocorrelation - Dadorran
No ratings yet
Period Tracking Using Autocorrelation - Dadorran
7 pages
Pitch Detection of Speech Signals (Project Report)
No ratings yet
Pitch Detection of Speech Signals (Project Report)
9 pages
A Practical Handbook of Speech Coders
No ratings yet
A Practical Handbook of Speech Coders
15 pages
Pitch Estimation Using A Full/Multi-Band Approaches: Mikhail Tadjikov, Arya Ahmadi
No ratings yet
Pitch Estimation Using A Full/Multi-Band Approaches: Mikhail Tadjikov, Arya Ahmadi
5 pages
Activity4 Uzair
No ratings yet
Activity4 Uzair
6 pages
Image Enhancement
No ratings yet
Image Enhancement
14 pages
DSP Lab Report
No ratings yet
DSP Lab Report
19 pages
Gender Recognition by Speech Analysis
No ratings yet
Gender Recognition by Speech Analysis
24 pages
Lecture 23
No ratings yet
Lecture 23
10 pages
Chapter 3 Nonparameteric Power Spectrum Estimation
No ratings yet
Chapter 3 Nonparameteric Power Spectrum Estimation
61 pages
Pitch Detection Algorithms
No ratings yet
Pitch Detection Algorithms
21 pages
Instantaneous Pitch Estimation Algorithm Based On Multirate Sampling
No ratings yet
Instantaneous Pitch Estimation Algorithm Based On Multirate Sampling
5 pages
Research and Design of Snow Hydrology Sensors and Instrumentation: Selected Research Papers
From Everand
Research and Design of Snow Hydrology Sensors and Instrumentation: Selected Research Papers
Raman K. Attri
No ratings yet
Applied Digital Signal Processing and Applications
From Everand
Applied Digital Signal Processing and Applications
Othman Omran Khalifa
No ratings yet
Fast and Accurate Generalized Harmonic Analysis Using Newton's Method
No ratings yet
Fast and Accurate Generalized Harmonic Analysis Using Newton's Method
10 pages
lec36
No ratings yet
lec36
13 pages
Spectral Estimation
No ratings yet
Spectral Estimation
79 pages
Spectrum Estimation
No ratings yet
Spectrum Estimation
39 pages
Robust Pitch Detection Using DCT Based Spectral Autocorrelation
No ratings yet
Robust Pitch Detection Using DCT Based Spectral Autocorrelation
20 pages
Pitch Tracking: 1. Pitch Tracking 2. Spectral Approaches 3. Time Domain 4. Example Algorithms
No ratings yet
Pitch Tracking: 1. Pitch Tracking 2. Spectral Approaches 3. Time Domain 4. Example Algorithms
18 pages
Barlett Method
No ratings yet
Barlett Method
15 pages
Vocal Pitch Detection For Musical Transcription PDF
No ratings yet
Vocal Pitch Detection For Musical Transcription PDF
3 pages
Analysisof Speech Signal 29 TH October 2018
No ratings yet
Analysisof Speech Signal 29 TH October 2018
16 pages
Intelligent Technologies for Research and Engineering
From Everand
Intelligent Technologies for Research and Engineering
S. Kannadhasan
No ratings yet
Advanced Signal Integrity for High-Speed Digital Designs
From Everand
Advanced Signal Integrity for High-Speed Digital Designs
Stephen H. Hall
No ratings yet
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
From Everand
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
Fouad Sabry
No ratings yet
Pitch Tracking - ACF - BABU ARUN KR.
No ratings yet
Pitch Tracking - ACF - BABU ARUN KR.
6 pages
Developing A MATLAB Code For Fundamental Frequency and Pitch Estimation From Audio Signal
No ratings yet
Developing A MATLAB Code For Fundamental Frequency and Pitch Estimation From Audio Signal
16 pages
Automatic Target Recognition: Fundamentals and Applications
From Everand
Automatic Target Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
The Columbine Massacre - Barack Obama - Zionist Wolf in Sheep's (PDFDrive)
No ratings yet
The Columbine Massacre - Barack Obama - Zionist Wolf in Sheep's (PDFDrive)
18 pages
Program Exp 5
No ratings yet
Program Exp 5
2 pages
Investigation of Simultaneous Audio Sources Localization: XXX, IEEE Member, XXX, IEEE Member and XXX
No ratings yet
Investigation of Simultaneous Audio Sources Localization: XXX, IEEE Member, XXX, IEEE Member and XXX
4 pages
Ee4015 Matlab3
No ratings yet
Ee4015 Matlab3
4 pages
Real-Time Critical Systems
From Everand
Real-Time Critical Systems
Jordan Lee Mauro-Buhagiar
3/5 (1)
Seismic Instrumentation Design: Selected Research Papers on Basic Concepts
From Everand
Seismic Instrumentation Design: Selected Research Papers on Basic Concepts
Raman K. Attri
No ratings yet
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
Analysis of Power Spectrum Estimation Using Welch Method For Various Window Techniques
No ratings yet
Analysis of Power Spectrum Estimation Using Welch Method For Various Window Techniques
4 pages
Welch PSD Estimator Using Winows: SL - No. Title
No ratings yet
Welch PSD Estimator Using Winows: SL - No. Title
11 pages
Activity 4
No ratings yet
Activity 4
2 pages
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Abstract 3. Power Spectral Density 4. Welch Method 5. Result 6. Conclusion
No ratings yet
Abstract 3. Power Spectral Density 4. Welch Method 5. Result 6. Conclusion
10 pages
LPC Vocoder: 1-Introduction
No ratings yet
LPC Vocoder: 1-Introduction
12 pages
Spectrum Estimation: Presentation by Dr. K.Muthumeenakshi Asso - Prof / ECE SSN College of Engineering
No ratings yet
Spectrum Estimation: Presentation by Dr. K.Muthumeenakshi Asso - Prof / ECE SSN College of Engineering
51 pages
Rahim Karim J 201410 PHD
No ratings yet
Rahim Karim J 201410 PHD
210 pages
Ramaiah University of Applied Sciences: Faculty of Engineering & Technology Lab Exam Question Paper - M. Tech
No ratings yet
Ramaiah University of Applied Sciences: Faculty of Engineering & Technology Lab Exam Question Paper - M. Tech
7 pages
Digital Filters Design for Signal and Image Processing
From Everand
Digital Filters Design for Signal and Image Processing
Mohamed Najim
No ratings yet
ICSC_FreqEst_2016
No ratings yet
ICSC_FreqEst_2016
4 pages
Frequency-Domain Techniques For High-Quality Voice Modification
No ratings yet
Frequency-Domain Techniques For High-Quality Voice Modification
5 pages
Digital Measurement of Phase Difference - A Comparative Study of DSP Algorithms
No ratings yet
Digital Measurement of Phase Difference - A Comparative Study of DSP Algorithms
15 pages
A Practitioner's Approach for Problem-Solving using AI
From Everand
A Practitioner's Approach for Problem-Solving using AI
Satvik Vats
No ratings yet
Fast Nearly ML Estimation of The Parameters of Real or Complex Single Tones or Resolved Multiple Tones
No ratings yet
Fast Nearly ML Estimation of The Parameters of Real or Complex Single Tones or Resolved Multiple Tones
9 pages
Creo Advanced Framework Extension Data Sheet (English)
No ratings yet
Creo Advanced Framework Extension Data Sheet (English)
3 pages
Middle East Map Study Guide 2017
No ratings yet
Middle East Map Study Guide 2017
2 pages
Appraisal of Griddle Cakes and Waffles
0% (2)
Appraisal of Griddle Cakes and Waffles
9 pages
Baker Line
50% (2)
Baker Line
65 pages
A Partial Decipherment of The Indus Vall
No ratings yet
A Partial Decipherment of The Indus Vall
259 pages
HP Deskjet F2100 All-in-One Series: Basics Guide
No ratings yet
HP Deskjet F2100 All-in-One Series: Basics Guide
17 pages
Types of High Voltage Generators
No ratings yet
Types of High Voltage Generators
7 pages
Strain Tensor-1
No ratings yet
Strain Tensor-1
14 pages
Research Method
No ratings yet
Research Method
37 pages
Barudak Jawi_menu Book_feb 2025 + Barja Mubarak
No ratings yet
Barudak Jawi_menu Book_feb 2025 + Barja Mubarak
42 pages
Doorpost Summary
No ratings yet
Doorpost Summary
2 pages
Modalloy : Product Datasheet Non Ferrous Metal Treatment
0% (1)
Modalloy : Product Datasheet Non Ferrous Metal Treatment
4 pages
CAT 797F Off-Highway Truck: Order Today - Only 2,500 Will Be Produced!
No ratings yet
CAT 797F Off-Highway Truck: Order Today - Only 2,500 Will Be Produced!
2 pages
Shooting Star
No ratings yet
Shooting Star
2 pages
(GED109) Chapter 10 Gene Therapy
No ratings yet
(GED109) Chapter 10 Gene Therapy
23 pages
2018 The Hidden Dangers of Fast and Processed Food PDF
100% (1)
2018 The Hidden Dangers of Fast and Processed Food PDF
7 pages
Bio Circle L
No ratings yet
Bio Circle L
9 pages
Essence of Character Assimilation
No ratings yet
Essence of Character Assimilation
2 pages
Performance Measurement of Mining Equipment: International Journal of Emerging Technology and Advanced Engineering
No ratings yet
Performance Measurement of Mining Equipment: International Journal of Emerging Technology and Advanced Engineering
9 pages
Batch Distillation: Group 7 Errynne Yanza Hosleck Galasinao Aron Balines
No ratings yet
Batch Distillation: Group 7 Errynne Yanza Hosleck Galasinao Aron Balines
18 pages
11 KV 6 3 KV Trans - Epc
No ratings yet
11 KV 6 3 KV Trans - Epc
16 pages
Artifact Cyoa Final r2
No ratings yet
Artifact Cyoa Final r2
15 pages
Antifungal Breakpoints V 9.0 180212
No ratings yet
Antifungal Breakpoints V 9.0 180212
5 pages
Thanongsak Nochaiya, Watcharapong Wongkeo, Arnon Chaipanich: Sciencedirect
No ratings yet
Thanongsak Nochaiya, Watcharapong Wongkeo, Arnon Chaipanich: Sciencedirect
7 pages
Profit Improvement Plan: - Cut & Sew
No ratings yet
Profit Improvement Plan: - Cut & Sew
9 pages
EEE2203 PHYSICAL ELECTRONICS II Lesson1
No ratings yet
EEE2203 PHYSICAL ELECTRONICS II Lesson1
11 pages
Self WT of Post 4.0kg /M X 2.0m 8.0 KG 0.10 KN Self WT of Chain Link Mesh 3.1 KG/M X 3.0 X 2.75 25.6 KG 0.26 KN Other Accessories 10 KG 0.1 KN Total
No ratings yet
Self WT of Post 4.0kg /M X 2.0m 8.0 KG 0.10 KN Self WT of Chain Link Mesh 3.1 KG/M X 3.0 X 2.75 25.6 KG 0.26 KN Other Accessories 10 KG 0.1 KN Total
1 page

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

speech processing pbl

Uploaded by

speech processing pbl

Uploaded by

A Project Report on

“Pitch Period Estimation using

Submitted in Partial Fulfilment of the

Department of Electronics & Communication Engineering

Jhansi (U.P.) India - 284128

S.No. Topic Page No.

Normalize the signal to ensure consistent amplitude.

3.Pitch Period Estimation:

Where Fs is the sampling rate of the signal.

Compute the autocorrelation function using the “xcorr” function.

Identify peaks in the autocorrelation function using the “findpeaks” function.

Figure 1-sample audio signal

In summary, the theoretical foundation of pitch period estimation using autocorrelation in

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.