Voice acoustic test analysis

　　The purpose is to introduce the classification of voice acoustic testing and analysis methods and their clinical significance. Methods Voice acoustic detection analysis is mainly divided into two types: subjective psychoacoustic assessment and objective acoustic detection analysis. The most used subjective psychoacoustic assessment method is GRBAS, which includes: G(ove trap¨g trap de degree), R(∞ugh), B(breath), A(asthenic), S(strained); objective testing and analysis methods include: voice acoustic analysis, vocal fold morphology and motion test, laryngeal aerodynamic test, and laryngeal muscle test. The objective testing methods included: voice acoustic analysis, vocal fold morphology and movement, laryngeal aerodynamic analysis, and laryngeal electromyography. The results of voice acoustic analysis can check the effect of laryngeal disease treatment and the improvement of laryngeal function. It can monitor the articulatory status of laryngeal users and provide an auxiliary diagnosis of laryngeal disease. Conclusion Voice acoustic test analysis is one of the main methods to assess the laryngeal function, and the use of multi-parameter integrated test analysis will give a more comprehensive picture of the physiological function of the larynx.　　Speech is an important tool for human information communication, and voice is the basis of speech formation, which is formed by vocal organs under the control of central nervous system. The human vocal organs include the dynamic part (lungs, diaphragm), the vibratory part (vocal folds, pseudo-vocal folds), the resonant part (pharyngeal cavity, laryngeal cavity), and the constructive part (lips, teeth). The vibration of vocal folds is the key to produce voice. When the airflow from the lungs reaches the closed vocal folds, a kind of subglottal pressure is generated, and when the pressure reaches the level where the closed vocal folds can be blown open, the aerodynamic energy is converted into acoustic energy and the vocal folds vibrate to produce sound. Voice includes pitch, intensity and timbre. Tone is related to the frequency of vocal cord vibration, and intensity is related to the amplitude of vocal cord vibration; if the frequency is fast, the tone is high, and if the amplitude is large, the voice is strong. Tone is related to the length, thickness and tension of the vocal cords, which are short, thin and tense, and vibrate at a fast frequency and high pitch. Tone is the characteristic that distinguishes two voices of the same loudness and pitch from each other in hearing. In short, because the length, quality, tension, resonance cavity shape, air flow through the vocal cords, and pressure under the vocal cords of each person are somewhat different, it is impossible for everyone’s voice to be exactly the same, and the difference in voice between different sexes and different ages is more obvious.　　With the widespread implementation of partial laryngectomy and microlaryngeal surgery to preserve the function of laryngeal articulation, more and more attention is being paid to voice analysis in clinical practice. Although there is no unified international standard for evaluation, quantitative analysis of various parameters in voice signals with the help of microcomputers and various instruments has undoubtedly become a non-invasive and painless objective method for studying laryngeal vocal function and laryngeal diseases. The International Association of Logopedics and Phoniatr.cs (IALP) was established in Vienna in 1924. The Society organizes regular academic conferences to exchange the latest achievements in the field of voice and speech medicine, publishes international journals on voice and speech, and sponsors scientific research projects on voice and speech medicine. In most countries, voice medicine is a branch of otolaryngology, but in a few countries it is a separate specialty from other medical specialties.　　The development process of voice medicine research is gradually deepening with the emergence of electronics, optics and computer technology, and it is closely related to basic sciences such as physics, chemistry and biochemistry, and closely related to basic medicine such as physiology, pathology, anatomy, linguistics and psychology. Abnormal voice as an acoustic signal reflects pathological changes of the larynx, which is often caused by abnormal airflow in the respiratory tract and abnormal vibration of the vocal cords p”. In the past, most otolaryngologists have been estimating the effectiveness of laryngeal diseases by the degree of mute voice based on auditory impressions and experience; however, this classification is subjective and there is no common assessment standard in otolaryngology. Yanagihara proposed a method of classifying hoarseness on a speech instrument based on this experiment, which has been applied to check the effectiveness of laryngeal disease treatment and improvement of functional recovery. The laryngeal condition of singers, actors and professional laryngectomees can be monitored. The disadvantage of this method is that it cannot quantitatively represent the changes in frequency and resonance peak structure and hoarseness; it is judged visually and is inevitably influenced by subjective factors.’…. Since the emergence of electronic computers in the 1960s, the scope of voice analysis has been broadened, and acoustic instruments such as spectrum analyzers have been used to extract various common and unique parameters of the voice to objectively assess the acoustic characteristics of normal and pathological voices, providing clinicians with an objective and non-invasive method of supporting diagnosis.　　Voice acoustic testing and analysis is divided into two main types: subjective psycho-auditory assessment and objective acoustic testing and analysis.　　1. Subjective psycho-auditory assessment method The most used internationally is the GRBAS assessment standard developed by the Japanese Society of Phonetic and Speech Medicine in 1979. It includes: total deafness G (overaIlgrade degree); rough type R (rough), which is caused by irregular vibration of the vocal folds due to various etiologies, and patients with vocal fold polyps are prone to this type: breath type B (breath), which is caused by incomplete closure of the vocal folds during articulation and increased exhaled airflow, and is more often seen in people with sulcus vocal fold and vocal fold paralysis; weak type A (asthenic), when the vocal folds become thinner, the quality is reduced, and the tension is reduced, it is easy to have an A-type voice. Each type is further divided into four grades, with normal being grade 0, mild being grade 1, moderate being grade 2, and severe being grade 3. Since GRBAS assessment is a subjective assessment method, there are inevitably some differences among the assessors. Therefore, in clinical application, the assessment must be performed by an experienced voice specialist or speech pathologist or voice therapist, and it is better for three people to perform their own independent assessment of the same voice sample, and then take the average of the three people as the assessment result. There are not many experimental reports on the application of subjective psychological auditory assessment in domestic journals. pJ It is believed that with the advancement of voice medicine research, subjective psychological auditory assessment will be widely used as an important part of laryngeal articulatory function assessment.　　Objective testing and analysis methods Objective testing of laryngeal function mainly consists of three parts: (1) voice acoustic analysis; (2) vocal fold morphology and motion testing; (3) laryngeal aerodynamic testing and laryngeal electromyography analysis.　　2.1 Acoustic analysis of voice. Voice acoustic analysis is now mostly performed by electronic computers equipped with analysis software for various parameters, and various types of analyzers can detect more than twenty parameters, but the most valuable parameters for clinical purposes are: fundamental frequency, sound domain, resonance peak, maximum articulation time, perturbation value, harmonic-to-noise ratio and standardized noise energy detection value.　　Before doing voice acoustic test, we should take a detailed medical history, perform a routine physical examination, do laryngeal shape, respiratory status, neck palpation, laryngeal body mobility, laryngoscopy (indirect or fiberoptic laryngoscopy), and necessary laboratory tests. A testing protocol is developed after the initial clinical diagnosis is made on the subject, and the testers are required to be trained professionals to ensure consistency of testing between samples or at different times for the same sample. The experimental instruments are regularly calibrated to ensure accurate and reliable test results. The ambient noise should be less than 45 dB, preferably in a soundproof room, and the vowels “e”, “i” and “a”, which are least disturbed by the mouth and tongue, should be selected for the test. Before the test, the subject was asked to do a short pronunciation practice to ensure a smooth pronunciation. The subject’s mouth was placed 1 5 cm from the microphone and the smooth middle part of the voice sample was taken for analysis. The main parameters of clinical voice analysis are: ① frequency (FREQUENCY). Frequency is the intrinsic frequency of vocal fold vibration, expressed in Hz, which is the number of vocal fold vibrations per second. The most representative frequency parameter is the fundamental frequency (Fo), and F0 is affected by age and gender. range). Sound range refers to the frequency range between the highest and lowest sound, because the highest sound is divided into the highest true sound and the highest false sound, so the human voice range is also divided into the true sound range and false sound range, sound range value in octaves or semitones (two frequencies for the twelfth root of the frequency range between the two) as a unit. Older women’s true and false voice range becomes narrower with age, and older people’s true voice range becomes narrower and false voice range becomes wider with age; ③ Intensity. Intensity is a physical unit related to the loudness of the voice, expressed in decibels (dB). The greater the pressure under the sound gate, the greater the amplitude of the vocal cord vibration, and the greater the resulting sound intensity; ④ resonance peak (formant). The resonance peak of the voice is generated by the resonance cavity between the vocal folds and the lips, and the position of the lips, teeth and tongue can control the size of the resonance cavity. The frequency location of the resonance peak envelope and the overtones in the resonance peak envelope determine the voice quality and timbre, so the resonance peak is of greater use in artistic noise research o “1; ⑤ perturbation (perturbat. on). Jitter represents the small variation of the voice signal period with time, which is also expressed by the frequency perturbation percentage (JR) in the United States, and the frequency perturbation quotient (pitch perturbationquotient) PPQ and amplitude perturbation quotient (amplitude perturbation quotient) in Japan. amplitude perturbation quotient) APQ to express perturbation. Since Lieberman (1961) first proposed the method of detecting perturbation, frequency perturbation and amplitude perturbation have become an important parameter for clinical analysis of pathological voice. In China (1 989), after applying this detection technique to clinical practice, it was found that voice perturbation was positively correlated with the pattern of vocal fold vibration, amplitude, mucosal wave, and vocal gate closure status; (6) Voice noise test. o…1 by Kitajlma et al. in the early 1980s, mmoto et al. (iv) used harmonic to noise ratio H, N (harmonIcto noise) to analyze voice In the mid-1980s, Kasuya, Kikuchi et al. o…1 proposed the standardized noise energy level NNEa (normalized noise energies a) and NNEb (normalized noise energies b) test methods. nnea and nneb NNEa and NNEb represent the noise from 0 to 4 kHz and 1 to 4 kHz, respectively. Since the clinical application in China in the 1980s, it has been found that H, N and NNE values are one of the most important parameters for objective voice detection and analysis, as well as muscleter, Shimmer, APQ and PPQ, and can be used as an objective quantitative assessment index for comparison and analysis of laryngeal disease before and after treatment in clinical practice.　　2.2 Vocal fold motion detection. Using stroboscopy (Videostroboscopy) to check the vocal fold vibration in ① symmetrical (SYM); ② regularity (REG); ③ amplitude (AMP…ude, AMP): ④ glottal cIosure (GLO); ⑤ mucosal wave (MUC); ⑥vocal fold vibration homogeneity (nonVibrating portion, NON).　　2.3 Aerodynamics of the larynx (aerodynamics) test. The test items are ①mean a.rflow rate (MFR) o..1; ②maximum phonation time (MPT); ③subgIollic pressu re (SP); ④glottaI resIstance (GR) resIstance, GR). If necessary, laryngeal electromyogram (eIectromyog raphy, EMG) examinations”””1 and electroglottography (eIectrogIottography, EGG) examinations can also be performed’.211 Voice acoustics The test analysis is one of the main methods of laryngeal articulatory function assessment, but its analysis results are not yet able to fully reflect the functional status of the larynx. Only by using multi-parameter comprehensive analysis, conducting test analysis from different subjective and objective perspectives, and comparing the test results with controls of the same age and gender can we more accurately reflect the true physiological functional status of the larynx.