CN105632523A - Method and device for regulating sound volume output value of audio data, and terminal - Google Patents

Method and device for regulating sound volume output value of audio data, and terminal Download PDF

Info

Publication number
CN105632523A
CN105632523A CN201511000762.2A CN201511000762A CN105632523A CN 105632523 A CN105632523 A CN 105632523A CN 201511000762 A CN201511000762 A CN 201511000762A CN 105632523 A CN105632523 A CN 105632523A
Authority
CN
China
Prior art keywords
audio frequency
data
audio
played
frequency data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201511000762.2A
Other languages
Chinese (zh)
Other versions
CN105632523B (en
Inventor
王文熹
郑美玲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Meizu Technology Co Ltd
Original Assignee
Meizu Technology China Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Meizu Technology China Co Ltd filed Critical Meizu Technology China Co Ltd
Priority to CN201511000762.2A priority Critical patent/CN105632523B/en
Publication of CN105632523A publication Critical patent/CN105632523A/en
Application granted granted Critical
Publication of CN105632523B publication Critical patent/CN105632523B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B19/00Driving, starting, stopping record carriers not specifically of filamentary or web form, or of supports therefor; Control thereof; Control of operating function ; Driving both disc and head
    • G11B19/02Control of operating function, e.g. switching from recording to reproducing

Landscapes

  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention provides a method and a device for regulating a sound volume output value of audio data. The method includes the steps of: reading audio data to be played and currently-played audio data; extracting music style feature information of the audio data to be played and the currently-played audio data; comparing the music style feature information of the audio data to be played with that of the currently-played audio data; determining a music style type of the audio data to be played relative to the currently-played audio data according to a comparison result; calculating a sound volume gain value relative to a sound volume output value of the currently-played audio data according to the determined music style type; and regulating the sound volume output value of the audio data to be played according to the sound volume gain value after the playing of the currently-played audio data is stopped. According to the invention, how to automatically regulate the sound volume output of different types of audio files is solved, and the output sound volume of the audio files can be regulated according to a practical situation of currently-played music.

Description

Regulate the method for volume output value of audio frequency data and device and terminal
Technical field
The present invention relates to a kind of sound-volume management method and system, particularly relate to the method for the volume output value regulating audio frequency data after a kind of and current audio plays compares and device and terminal.
Background technology
Existing mobile terminal or multimedia player are when playing music, user generally uses earphone or loud speaker when listening to, but owing to the type of each music is different, such as user listens to music, the bent wind of the first melodies in front and back two is different, and a front one first soft then head is loud and sonorous, if the volume remained unchanged, then the hearing of user can be caused damage, bring not good experience to user.
Prior art have employed following method when solving this kind of problem: by obtaining label data corresponding to each audio file, the label of such as mp3 file is called as ID3, storing the title of this audio documents in this label, sing the information such as (playing) person, composition (word) person, the lyrics, audio types, above-mentioned audio types comprises rock and roll, jazz, rural area, allusion etc. The output signal of the audio documents (such as rock and roll, rural area) of different types has different amplitudes, tone and frequency, by reading this label (such as ID3), know the audio types that this music is corresponding, thus search the volume output value that default different audio types are corresponding, thus reach the object automatically regulating music output volume. But, if this kind of method run into audio file label do not represent audio types or audio file actual cannot Perfect Matchings preset volume output value time, the volume output value that just cannot make to preset mates audio file accurately.
Summary of the invention
Based on this, it is necessary to export problem for how automatically regulating the volume of dissimilar audio file, it provides a kind of method of volume output value regulating audio frequency data and device.
Regulate a method for the volume output value of audio frequency data, comprising:
Read audio frequency data to be played and current playing audio-fequency data;
Extract the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data and compare, obtain comparison result;
According to comparison result, calculate the volume gain value of the volume output value relative to described current audio plays, after terminating to play current playing audio-fequency data, regulate the volume output value of described audio frequency data to be played according to described volume gain value.
Wherein in an embodiment, described bent wind characteristic information comprises: characterize the frequency values of low-and high-frequency switching times in audio frequency data, the center of mass values characterizing basic frequency band in the strong and weak audio amplitude of note vibrations, reflection audio frequency frame signal, reflection sound signal frequency spectrum data, beginning or ending with particular model in one or more combination.
Wherein in an embodiment, described audio frequency data to be played and current playing audio-fequency data are the audio frequency data extracted in the audio frequency data extracted in the next files in stream media taking from streaming media playing list respectively and the stream medium data currently play; Or take from same files in stream media next section of part to be played respectively and currently playing audio frequency data corresponding to part.
Wherein in an embodiment, the step of audio frequency data to be played described in described comparison and the bent wind characteristic information of current playing audio-fequency data comprises:
First the bent wind characteristic information extracted based on single audio frequency frame, the similarity of audio frequency data to be played and current playing audio-fequency data described in comparison frame by frame; Then, based on the parameter value of the bent wind characteristic information of the sign extracted in described default sample region, compare described audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in described default sample region;
Or, based on the parameter value of the bent wind characteristic information of the sign extracted in described default sample region, compare described audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in described default sample region.
Wherein in an embodiment, it is the time interval at the maximum portion of audio data place of the time range, the time interval at audio frequency data sound intermediate frequency amplitude highest portion point place or the low-and high-frequency switching times that pre-set between described default sample region.
Wherein in an embodiment, described based on single audio frequency frame extract bent wind characteristic information, described in comparison, the determination step of audio frequency data to be played and current playing audio-fequency data similarity comprises frame by frame:
Based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;
To the corresponding Euclidean distance value sequence of each audio frequency frame calculated;
Find out minimum Euclidean distance value, judge whether described minimum Euclidean distance value is less than a threshold value, if then exporting the result of determination that described audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then the result of determination that described audio frequency data to be played are not similar to current playing audio-fequency data is exported.
Wherein in an embodiment, the described method comparing described audio frequency data to be played and current playing audio-fequency data interior corresponding parametric values size in described default sample region comprises:
Based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, described parameter value corresponding to audio frequency frame of selecting to be positioned between described default sample region is as sample;
The sample mean estimation technique is adopted to obtain the trend estimation value of introversion wind characteristic parameter information between described default sample region;
The relatively size of the trend estimation value that described audio frequency data to be played are corresponding with current playing audio-fequency data.
Based on aforesaid method, present invention also offers the device of a kind of volume output value regulating audio frequency data, comprising:
Audio frequency data extraction module, for reading audio frequency data to be played and current playing audio-fequency data;
Characteristic information extracting module, for extracting the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data;
Characteristic information comparing module, for the bent wind characteristic information of audio frequency data to be played described in comparison and current playing audio-fequency data, obtains comparison result; And
Output module, for according to described comparison result, exporting the volume gain value exported relative to current volume, regulates current volume output value when playing audio frequency data to be played according to described volume gain value.
Wherein in an embodiment, described characteristic information comparing module comprises:
Euclidean distance calculates unit, for based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;
Sequencing unit, for the corresponding Euclidean distance value sequence of each audio frequency frame calculated; And
Judging unit, for finding out minimum Euclidean distance value, judges whether described minimum Euclidean distance value is less than a threshold value, if then exporting the result of determination that described audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then the result of determination that described audio frequency data to be played are not similar to current playing audio-fequency data is exported.
Wherein in an embodiment, described characteristic information comparing module comprises:
Sample extraction unit, for based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, described parameter value corresponding to audio frequency frame of selecting to be positioned between described default sample region is as sample;
Estimation unit, for adopting the sample mean estimation technique to obtain the trend estimation value of introversion wind characteristic parameter information between described default sample region; And
Relatively unit, for comparing the size of the described audio frequency data to be played trend estimation value corresponding with current playing audio-fequency data.
A kind of terminal, comprising:
The communication module of audio frequency data can be obtained from outside network;
For the storing device of stores audio data;
Audio playing module, for the audio frequency data acquisition electrical signal play in described storing device, and read audio frequency data to be played and current playing audio-fequency data, extract the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data and compare, obtain comparison result, according to described comparison result, calculate the volume gain value of the volume output value relative to described current audio plays, regulate the volume output value of described audio frequency data to be played according to described volume gain value after terminating to play current playing audio-fequency data, described electrical signal is adjusted according to this volume output value, and,
Dio Output Modules, comprises the interface for exporting described electrical signal or described electrical signal is converted into the transverter of acoustical signal.
The method of the volume output value of above-mentioned adjustment audio frequency data and device and terminal, compare based on by the bent wind characteristic information of audio frequency data to be played and current playing audio-fequency data, in the hope of obtaining one based on the bent wind type evaluation result corresponding to current playing audio-fequency data, then according to the bent wind type evaluation result that this obtains in real time, determine the yield value that volume exports, do like this and can evaluate bent wind type more accurately, basis for current playing audio-fequency data output volume output value provides the volume output value of a different user audiovisual demand of more fitting.
Accompanying drawing explanation
Fig. 1 is the method flow schematic diagram of the volume output value regulating audio frequency data;
Fig. 2 is the schematic flow sheet of one embodiment of the invention;
Fig. 3 is the schematic flow sheet of another embodiment of the present invention;
Fig. 4 is the schematic flow sheet of volume yield value acquisition methods in Fig. 1;
Fig. 5 is the schematic flow sheet of similarity decision method in Fig. 2;
Fig. 6 is that Euclidean distance calculates schematic diagram;
Fig. 7 is the structural representation that the present invention regulates the device of the volume output value of audio frequency data.
Embodiment
Based on the bent wind characteristic information comparison technology by audio frequency data to be played and current playing audio-fequency data, the present invention provides method and the device of a kind of volume output value regulating audio frequency data. Utilizing method and the device of the present invention, it is possible to use time listening to the music in family, if the current song play belongs to soft bent wind, the volume that user uses is bigger; Then current song play to close on ending time, the bent wind that preliminary examination measures next music being about to broadcasting is louder and more sonorous, then after terminating current music, before next music, automatically the volume of volume to suitable people's ear is turned down, to prevent the impaired hearing of user. The technology contents of the present invention is described in detail below in conjunction with each embodiment.
As shown in Figure 1, the method for a kind of volume output value regulating audio frequency data that the present embodiment provides, comprising:
Step 110: read audio frequency data to be played and current playing audio-fequency data;
Step 120: the bent wind characteristic information extracting above-mentioned audio frequency data to be played and current playing audio-fequency data;
Step 130: the bent wind characteristic information of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data;
Step 140: according to the result of above-mentioned comparison, it is possible to for determining the bent wind type of above-mentioned audio frequency data to be played relative to above-mentioned current playing audio-fequency data. Based on the comparison result of bent wind, calculate the volume gain value of the volume output value relative to current playing audio-fequency data, after terminating to play current playing audio-fequency data, regulate the volume output value of audio frequency data to be played according to above-mentioned volume gain value. Here volume gain value can be weighed by decibel value, it is also possible to represents with per-cent.
In aforesaid method step, it is possible to increase the step of the current volume output value obtaining current playing audio-fequency data. If when current displaying audio file, user have adjusted volume output value voluntarily, namely can export in the volume that user is adjusted after then utilizing aforesaid method and be worth on basis to regulate according to obtained volume gain value, and without the need to redefining the volume decibel value output area of respective type. Therefore utilizing the aforesaid method of the present invention can according to the ear of actual user ability pleasant to hear, such as somebody likes loud, and somebody likes little sound, then can export according to the adjustment volume of the practical situation of user human nature more.
In aforesaid method step 140, regulate the concrete grammar of current volume output value according to above-mentioned volume gain value when playing audio frequency data to be played, it is possible to shown in following steps:
Successively increase on the basis of the current volume output value obtained or reduce above-mentioned volume gain value, here can be increase the volume gain value of 1��5 decibel every time or reduce by the volume gain value of 1��5 decibel, it is also possible to be increase by the volume gain value of 1��5% every time or reduce by the volume gain value of 1��5%;
Judging whether to receive outside confirmation instruction, if then stopping regulating, otherwise continuing increase on the basis of current volume output value or reduce above-mentioned volume gain value.
Can successively be gone forward one by one by above method regulates volume to export, and relaxes the situation that volume becomes suddenly greatly or diminishes suddenly, and the audio providing best for user exports experience.
Audio frequency data to be played and current playing audio-fequency data in above-mentioned steps 110 can be the audio frequency data extracted in the audio frequency data extracted in the next files in stream media taking from streaming media playing list respectively and the stream medium data currently play; Or can be take from same files in stream media next section of part to be played respectively and currently playing audio frequency data corresponding to part. Here Streaming Media comprises absolute music compression data, video compression data etc. If absolute music data, then next music that above-mentioned audio frequency data to be played and current playing audio-fequency data can be respectively in music playlist and the music currently play; Or the content that can be next section of content to be played in the song currently play and currently playing. The method of the present embodiment is utilized can automatically to regulate the broadcast sound volume of two songs or two video files before and after in same playlist.
Bent wind characteristic information in above-mentioned steps 120 comprises: the frequency values (it can represent with by the switching times of low-and high-frequency in sampling time period) characterizing low-and high-frequency switching times in audio frequency data, characterize the audio amplitude that note vibrations are strong and weak, the center of mass values of basic frequency band in reflection audio frequency frame signal, characterize the sound dynamicrange (referring to the intensity difference of most forte and the most weak sound) of sound pressure level change, the frequency spectrum data of reflection sound signal, beginning or ending with particular model in one or more combination. such as reflect that the center of mass values of basic frequency band in audio frequency frame signal adopts following account form: set the sample frequency of an audio frequency sequence as fs, sampling channel number is Sd, sampling figure place is Ws, the data amount check of every height band is k (=fsSdWs* 20/ (8000*32)), the data amount check of each audio frequency frame is Num (Num=k*32), x (n) (n �� [1...Num]) is frame audio data set, siJ () (j �� [1...32]) is the audio data set of the i-th frame jth son band, the bent wind characteristic parameter calculating the i-th audio frequency frame is as follows:
The center of mass values C of basic frequency band in the reflection audio frequency frame signal that each audio frequency frame is correspondingiFor:
C i = Σ j 32 jM j Σ j 32 M j , Wherein, M j = Σ j 32 s i 2 ( j ) k
The audio amplitude R that the sign note vibrations that each audio frequency frame is corresponding are strong and weaki, represent with rootmean-square, it be specially:
R i = Σ j 32 M j 2 32
On the basis of above-described embodiment, in above-mentioned steps 130, the step of the above-mentioned audio frequency data to be played of comparison and the bent wind characteristic information of current playing audio-fequency data comprises: based on the parameter value for characterizing bent wind characteristic information extracted in above-mentioned default sample region, the relatively size of above-mentioned audio frequency data to be played and current playing audio-fequency data interior corresponding parametric values in above-mentioned default sample region, then in step 140 based on the size comparative result of parameter value, judge the bent wind type of above-mentioned audio frequency data to be played relative to above-mentioned current playing audio-fequency data, obtain comparison result. such as, as shown in Figure 3, if above-mentioned audio frequency data to be played are greater than the corresponding parametric values of current playing audio-fequency data, then perform step 142 and judge that bent wind is comparatively loud and sonorous, if above-mentioned audio frequency data to be played are less than the corresponding parametric values of current playing audio-fequency data, then perform step 143 and judge that bent wind is comparatively soft, if above-mentioned audio frequency data to be played equal the corresponding parametric values of current playing audio-fequency data, then perform step 141 and it is judged to that bent wind is similar.
Certainly, it is also possible to carry out two times and judge, to reduce complexity during word audio data processing, and accurate bent wind similarity is provided to detect. Such as, first, based on the bent wind characteristic information that single audio frequency frame extracts, the similarity of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data, carries out the first judgement of similarity frame by frame; Then, above-mentioned audio frequency data to be played non-to current playing audio-fequency data similar time, again based on the parameter value for characterizing bent wind characteristic information extracted in above-mentioned default sample region, compare above-mentioned audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in above-mentioned default sample region.
The present embodiment relatively carrys out jointly to determine the bent wind type of audio frequency data to be played relative to current playing audio-fequency data by similarity and/or value size, such as audio frequency data to be played are loud and sonorous, soft or similar comparative results relative to current playing audio-fequency data, relative to prior art only utilizing audio file label to judge the bent wind characteristic of audio frequency data, the present embodiment mode is more accurate, more meets the practical situation of the audio file that user plays. It it is the time interval at the maximum portion of audio data place of the time range, the time interval at audio frequency data sound intermediate frequency amplitude highest portion point place or the low-and high-frequency switching times that pre-set between above-mentioned default sample region. Bent wind type herein is that whether the data of two audio frequency are similar, loud and sonorous or soft.
Above-mentioned steps 130 specifically can adopt following two kinds of modes to realize.
First kind of way: as shown in Figure 2, based on above-described embodiment after execution of step 120, first, perform step 131: the bent wind characteristic information extracted based on single audio frequency frame, the similarity of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data frame by frame; If similar, then perform step 141 and it is judged to that bent wind is similar; If not similar, then perform step 132 again: based on the parameter value of the bent wind characteristic information of the sign extracted in above-mentioned default sample region, compare above-mentioned audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in above-mentioned default sample region; If above-mentioned audio frequency data to be played are greater than the corresponding parametric values of current playing audio-fequency data, then perform step 142 and judge that bent wind is comparatively loud and sonorous; If above-mentioned audio frequency data to be played are less than the corresponding parametric values of current playing audio-fequency data, then perform step 143 and judge that bent wind is comparatively soft.
2nd kind of mode, as shown in Figure 3, based on above-described embodiment after execution of step 120, only perform step 132: based on the parameter value of the bent wind characteristic information of the sign extracted in above-mentioned default sample region, compare above-mentioned audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in above-mentioned default sample region; If above-mentioned audio frequency data to be played are greater than the corresponding parametric values of current playing audio-fequency data, then perform step 142 and judge that bent wind is comparatively loud and sonorous; If above-mentioned audio frequency data to be played are less than the corresponding parametric values of current playing audio-fequency data, then perform step 143 and judge that bent wind is comparatively soft; If above-mentioned audio frequency data to be played equal the corresponding parametric values of current playing audio-fequency data, then perform step 141 and it is judged to that bent wind is similar.
When after the comparison result providing step 130 according to each step above-mentioned, correspondingly just can determine the bent wind type of audio frequency data to be played relative to current playing audio-fequency data in step 140, if being judged to that bent wind is similar by the step of Fig. 2 or Fig. 3, then perform step 151: the volume gain value of acquisition is zero, after terminating to play current playing audio-fequency data, maintain current volume output value constant; If judging that bent wind is comparatively loud and sonorous by the step of Fig. 2 or Fig. 3, then perform step 152: the volume gain value of acquisition is default decibel value or any value presetting in decibel interval, after terminating to play current playing audio-fequency data, export in current volume and it is worth on basis to subtract above-mentioned volume gain value, in order to turn down current volume output value corresponding to audio frequency data to be played; If judging that bent wind is comparatively soft by the step of Fig. 2 or Fig. 3, then perform step 153: the volume gain value of acquisition is default decibel value or any value presetting in decibel interval, after terminating to play current playing audio-fequency data, export in current volume and it is worth on basis to increase described volume gain value, in order to heighten current volume output value corresponding to audio frequency data to be played.
On the basis of above-described embodiment, after execution of step 110 to step 140, in above-mentioned steps 140, it is possible to obtain volume gain value by following step, as shown in Figure 4:
First, perform step 154: according to the above-mentioned bent wind type determined, search the two dimension dynamically array structure of the bent wind type of associated record and volume gain value, such as the volume gain value of the bent similar correspondence of wind is zero, the volume gain value of the bent comparatively loud and sonorous correspondence of wind is first decibel of value or any value in first decibel of interval, and the volume gain value of the bent comparatively soft correspondence of wind is the 2nd decibel of value or any value in the 2nd decibel of interval.
Then, perform step 155: obtain the volume gain value that corresponding bent wind type is corresponding;
Secondly, perform step 156: obtain the volume output value that current playing audio-fequency data is corresponding;
Again, perform step 157: when playing audio frequency data to be played, the volume gain value that superposition obtains on the basis of above-mentioned volume output value. And above-mentioned first decibel of value or be negative or negative interval range in first decibel of interval, above-mentioned 2nd decibel of value or be positive interval range in the 2nd decibel of interval.
Based on each embodiment above-mentioned, the method for the above-mentioned audio frequency data to be played of the comparison frame by frame of the present embodiment and current playing audio-fequency data similarity realizes to 138 according to the step 133 in Fig. 5. In the present embodiment, in step 120, from above-mentioned audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing above-mentioned bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding;
Step 133: utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;
Step 134: to the corresponding Euclidean distance value sequence of each audio frequency frame calculated;
Step 135: find out minimum Euclidean distance value;
Step 136: judge whether above-mentioned minimum Euclidean distance value is less than a threshold value, if then performing step 137: export the result of determination that above-mentioned audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then step 138 is performed: export the result of determination that above-mentioned audio frequency data to be played are not similar to current playing audio-fequency data.
Below with the center of mass values C of frequency band basic in reflection audio frequency frame signal corresponding to each audio frequency frameiThe audio amplitude R that the sign note vibrations corresponding with each audio frequency frame are strong and weakiFor example, the detailed process that above-mentioned similarity judges is described in detail.
Utilize weighted euclidean distance calculation formula, calculate center of mass values C respectively according to the mode shown in accompanying drawing 6iWith audio amplitude RiEuclidean distance WkAnd Gk, to k the Euclidean distance sequence calculated, find out minimum Euclidean distance value, judge W respectivelykmin�ܦ�1And Gkmin�ܦ�2Whether setting up, if set up, showing that front playing audio-fequency data is similar to audio frequency data to be played, otherwise not similar. Here ��1And ��2For the threshold value of setting.
W k = Σ i N d ( C i - C i ( k ) ) 2 ( C i 2 - C i ( k ) 2 ) 2 , k = N d - N b
G k = Σ i N d ( R i - R i ( k ) ) 2 ( R i 2 - R i ( k ) 2 ) 2 , k = N d - N b
Wherein, CiRepresent the barycenter of audio frequency data i-th frame to be played,The barycenter of playing audio-fequency data i-th frame before representing, NdRepresent the audio frame number of audio frequency data to be played, NbThe audio frame number of playing audio-fequency data before representing, RiRepresent the audio amplitude of audio frequency data i-th frame to be played,The audio amplitude of playing audio-fequency data i-th frame before representing.
Based on each embodiment above-mentioned, based on the parameter value of the bent wind characteristic information of the sign extracted in above-mentioned default sample region in above-mentioned steps 130, the method comparing above-mentioned audio frequency data to be played and current playing audio-fequency data interior corresponding parametric values size in above-mentioned default sample region comprises the following steps:
Based in the step 120 from above-mentioned audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing above-mentioned bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, the above-mentioned parameter value that to select to be positioned between above-mentioned default sample region audio frequency frame corresponding, as sample, is the time interval at the maximum portion of audio data place of the time range, the time interval at audio frequency data sound intermediate frequency amplitude highest portion point place or the low-and high-frequency switching times that pre-set between default sample region here;
The sample mean estimation technique is adopted to obtain the trend estimation value of introversion wind characteristic parameter information between above-mentioned default sample region;
The relatively size of the trend estimation value that above-mentioned audio frequency data to be played are corresponding with current playing audio-fequency data.
Below with the center of mass values C of frequency band basic in reflection audio frequency frame signal corresponding to each audio frequency frameiThe audio amplitude R that the sign note vibrations corresponding with each audio frequency frame are strong and weakiFor example, the detailed process that above-mentioned parameter value compares is described in detail.
The time interval at audio frequency data sound intermediate frequency amplitude highest portion point place is defined as x between default sample region, the center of mass values C then selecting each frame in this interval correspondingiWith audio amplitude Ri, utilize the sample mean estimation technique to calculate and preset the corresponding parameter of each audio frequency frame in sample region and be worth average so that it is be decided to be trend estimation value, in order to the roughly situation of a basic frequency band evaluating between audio amplitude highest portion point location and amplitude.
μ c = Σ i N d ( x ) C i ′ N d ( x ) , μ c ( k ) = Σ i N b ( x ) C i ′ ( k ) N b ( x ) ;
μ R = Σ i N d ( x ) R i ′ N d ( x ) , μ R ( k ) = Σ i N b ( x ) R i ′ ( k ) N b ( x ) ;
Wherein, C 'iRepresent the barycenter presetting the i-th frame in x between sample region in audio frequency data to be played,Playing audio-fequency data is preset the barycenter of the i-th frame in x between sample region, N before representingd(x)Represent the audio frame number preset between sample region in x in audio frequency data to be played, Nb(x)The audio frame number preset between sample region in x in playing audio-fequency data before representing, R 'iRepresent the audio amplitude presetting the i-th frame in x between sample region in audio frequency data to be played,Playing audio-fequency data is preset before representing the audio amplitude of the i-th frame in x between sample region.
Based on each embodiment of aforesaid method, the present embodiment also comprises the following steps:
Store the adjustment result of each audio frequency data volume output value to be played, and with audio frequency data association record;
Size according to above-mentioned adjustment result, audio frequency data interlace is arranged, or by the audio frequency data summarization arrangement of above-mentioned adjustment result within the scope of pre-set interval, form playlist. The object done like this is according to customer need, to listen to the audio frequency data within the scope of a certain decibel value, also listens to same bent wind music content, or the staggered music content listening to different bent wind.
Can also comprise based on the above method: a parameter regulation interface is provided, for upgrading the pre-set interval scope of above-mentioned relevant adjustment result according to customer need so that the selection of playlist human nature, more intelligent more.
Based on aforesaid method, as shown in Figure 7, present embodiments providing the device of a kind of volume output value regulating audio frequency data, it comprises
Audio frequency data extraction module 1, for reading audio frequency data to be played and current playing audio-fequency data;
Characteristic information extracting module 2, for extracting the bent wind characteristic information of above-mentioned audio frequency data to be played and current playing audio-fequency data;
Characteristic information comparing module 3, for the bent wind characteristic information of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data, obtains comparison result; And
Output module 5, for according to above-mentioned comparison result, calculate the volume gain value of the volume output value relative to described current playing audio-fequency data, after terminating to play current playing audio-fequency data, regulate the volume output value of described audio frequency data to be played according to described volume gain value.
Above-mentioned audio frequency data extraction module 1, characteristic information extracting module 2, characteristic information comparing module 3, output module 5 are respectively used to realize above-mentioned steps 110 to step 140, therefore the related description that its relevant specific implementation details is civilian before can refer to, is not repeated at this.
Based on the method for above-mentioned Fig. 2 and Fig. 3, above-mentioned characteristic information comparing module 3 comprises following two function modules:
Similarity determination module, for the bent wind characteristic information extracted based on single audio frequency frame, the similarity of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data frame by frame, carries out sentencing at the beginning of similarity;
Comparison module, for non-similar time again based on the parameter value of bent wind characteristic information of the sign extracted in above-mentioned default sample region, compare above-mentioned audio frequency data to be played and current playing audio-fequency data in above-mentioned default sample region in the size of corresponding parametric values.
Based on the method for Fig. 5, in above-mentioned characteristic information comparing module 3, similarity determination module can form by with lower unit:
Euclidean distance calculates unit, for based on from above-mentioned audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing above-mentioned bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;
Sequencing unit, for the corresponding Euclidean distance value sequence of each audio frequency frame calculated; And
Judging unit, for finding out minimum Euclidean distance value, judges whether above-mentioned minimum Euclidean distance value is less than a threshold value, if then exporting the result of determination that above-mentioned audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then the result of determination that above-mentioned audio frequency data to be played are not similar to current playing audio-fequency data is exported.
Based on aforesaid method, in above-mentioned characteristic information comparing module 3, comparison module can comprise:
Sample extraction unit, for based on from above-mentioned audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing above-mentioned bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, above-mentioned parameter value corresponding to audio frequency frame of selecting to be positioned between above-mentioned default sample region is as sample;
Estimation unit, for adopting the sample mean estimation technique to obtain the trend estimation value of introversion wind characteristic parameter information between above-mentioned default sample region; And
Relatively unit, for comparing the size of the above-mentioned audio frequency data to be played trend estimation value corresponding with current playing audio-fequency data.
Based on aforesaid method and system, it is possible to according to the bent wind automatic adjustment height of music etc. or turn down volume, avoiding the sense of hearing damaging user, user does not need to carry out manual volume adjusting simultaneously, improves Consumer's Experience.
In addition, based on the method for volume output value and the device of above-mentioned adjustment audio frequency data, it is also possible to being applied to a kind of terminal, this terminal can be the terminal with audio player such as mobile phone, electronic toy, earphone, computer, IPAD. This terminal comprises following element:
1, the communication module of audio frequency data can be obtained from outside network; This communication module can be the communication module meeting the communications protocol such as 4G, 5G, TDMA.
2, for the storing device of stores audio data; This storing device can be the built-in terminal such as flash memory, hard disk, USD, SD card or the various types of storing devices being placed on terminal.
3, audio playing module, for the audio frequency data acquisition electrical signal play in described storing device, and read audio frequency data to be played and current playing audio-fequency data, extract the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data and compare, obtain comparison result, according to described comparison result, calculate the volume gain value of the volume output value relative to described current audio plays, regulate the volume output value of described audio frequency data to be played according to described volume gain value after terminating to play current playing audio-fequency data, described electrical signal is adjusted according to this volume output value. this audio playing module is mainly used for realizing the broadcasting to audio frequency data, exports by audio frequency data and forms electrical signal, and the content based on step described in above-mentioned steps 110 to step 140 realizes the adjustment to electrical signal. such as, the amplitude of above-said current signal is adjusted according to volume output value. specific implementation details about audio playing module can join the related description in seing above. audio playing module in the present embodiment can be audio player built-in in terminal.
4, dio Output Modules, comprises for exporting the audio output interface of described electrical signal or described electrical signal is converted into the transverter of acoustical signal. Dio Output Modules can be the audio output interface of the equipment such as a kind of accessible loud speaker, earphone, such as, if terminal is computer, then only needs to provide audio output interface, and by loud speaker, electrical signal is converted into the sound that the mankind can hear. Again such as, dio Output Modules can be the transverter being converted into acoustical signal for electrical signal being built in terminal, and such as dio Output Modules can be that carry in terminal outer puts equipment, and so, terminal can carry outer playing function.
Each technology feature of the above embodiment or technical scheme can combine arbitrarily, for making description succinct, each all possible combination of technology feature in above-described embodiment is not all described, but, as long as the combination of these technology features does not exist contradiction, all it is considered to be the scope that this specification sheets is recorded.
The above embodiment only have expressed several enforcement modes of the present invention, and it describes comparatively concrete and detailed, but therefore can not be interpreted as the restriction to patent scope of the present invention. , it is also possible to make some distortion and improvement, it should be appreciated that for the person of ordinary skill of the art, without departing from the inventive concept of the premise these all belong to protection scope of the present invention. Therefore, the protection domain of patent of the present invention should be as the criterion with claims.

Claims (11)

1. one kind regulates the method for the volume output value of audio frequency data, it is characterised in that, described method comprises:
Read audio frequency data to be played and current playing audio-fequency data;
Extract the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data and compare, obtain comparison result;
According to comparison result, calculate the volume gain value of the volume output value relative to described current playing audio-fequency data, after terminating to play current playing audio-fequency data, regulate the volume output value of described audio frequency data to be played according to described volume gain value.
2. the method for the volume output value of adjustment audio frequency data according to claim 1, it is characterized in that, described bent wind characteristic information comprises: characterize the frequency values of low-and high-frequency switching times in audio frequency data, the center of mass values characterizing basic frequency band in the strong and weak audio amplitude of note vibrations, reflection audio frequency frame signal, reflection sound signal frequency spectrum data, beginning or ending with preset model in one or more combination.
3. the method for the volume output value of adjustment audio frequency data according to claim 1, it is characterized in that, described audio frequency data to be played and current playing audio-fequency data are the audio frequency data extracted in the audio frequency data extracted in the next files in stream media taking from streaming media playing list respectively and the stream medium data currently play; Or take from same files in stream media next section of part to be played respectively and currently playing audio frequency data corresponding to part.
4. the method for the volume output value of adjustment audio frequency data according to claim 1, it is characterised in that, the step of audio frequency data to be played described in described comparison and the bent wind characteristic information of current playing audio-fequency data comprises:
Based on the bent wind characteristic information that single audio frequency frame extracts, the similarity of audio frequency data to be played and current playing audio-fequency data described in comparison frame by frame, carry out sentencing at the beginning of similarity, non-similar time again based on the parameter value for characterizing bent wind characteristic information extracted in default sample region, compare described audio frequency data to be played and current playing audio-fequency data in described default sample region in the size of corresponding parametric values;
Or, based on the parameter value for characterizing bent wind characteristic information extracted in default sample region, compare described audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in described default sample region.
5. the method for the volume output value of adjustment audio frequency data according to claim 4, it is characterized in that, it is the time interval at the maximum portion of audio data place of the time range, the time interval at audio frequency data sound intermediate frequency amplitude highest portion point place or the low-and high-frequency switching times that pre-set between described default sample region.
6. the method for the volume output value of adjustment audio frequency data according to claim 4, it is characterized in that, described based on single audio frequency frame extract bent wind characteristic information, described in comparison, the determination step of audio frequency data to be played and current playing audio-fequency data similarity comprises frame by frame:
Based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;
To the corresponding Euclidean distance value sequence of each audio frequency frame calculated;
Find out minimum Euclidean distance value, judge whether described minimum Euclidean distance value is less than a threshold value, if then exporting the result of determination that described audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then the result of determination that described audio frequency data to be played are not similar to current playing audio-fequency data is exported.
7. the method for the volume output value of adjustment audio frequency data according to claim 4, it is characterised in that, the described method comparing described audio frequency data to be played and current playing audio-fequency data interior corresponding parametric values size in described default sample region comprises:
Based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, described parameter value corresponding to audio frequency frame of selecting to be positioned between described default sample region is as sample;
The sample mean estimation technique is adopted to obtain the trend estimation value of introversion wind characteristic parameter information between described default sample region;
The relatively size of the trend estimation value that described audio frequency data to be played are corresponding with current playing audio-fequency data.
8. one kind regulates the device of the volume output value of audio frequency data, it is characterised in that: described device comprises:
Audio frequency data extraction module, for reading audio frequency data to be played and current playing audio-fequency data;
Characteristic information extracting module, for extracting the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data;
Characteristic information comparing module, for the bent wind characteristic information of audio frequency data to be played described in comparison and current playing audio-fequency data, obtains comparison result; And
Output module, for according to described comparison result, calculate the volume gain value of the volume output value relative to described current playing audio-fequency data, after terminating to play current playing audio-fequency data, regulate the volume output value of described audio frequency data to be played according to described volume gain value.
9. the device of the volume output value of adjustment audio frequency data according to claim 8, it is characterised in that, described characteristic information comparing module comprises:
Euclidean distance calculates unit, for based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;
Sequencing unit, for the corresponding Euclidean distance value sequence of each audio frequency frame calculated; And
Judging unit, for finding out minimum Euclidean distance value, judges whether described minimum Euclidean distance value is less than a threshold value, if then exporting the result of determination that described audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then the result of determination that described audio frequency data to be played are not similar to current playing audio-fequency data is exported.
10. the device of the volume output value of adjustment audio frequency data according to claim 8, it is characterised in that, described characteristic information comparing module comprises:
Sample extraction unit, for based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, described parameter value corresponding to audio frequency frame of selecting to be positioned between described default sample region is as sample;
Estimation unit, for adopting the sample mean estimation technique to obtain the trend estimation value of introversion wind characteristic parameter information between described default sample region; And
Relatively unit, for comparing the size of the described audio frequency data to be played trend estimation value corresponding with current playing audio-fequency data.
11. 1 kinds of terminals, it is characterised in that, comprising:
The communication module of audio frequency data can be obtained from outside network;
For the storing device of stores audio data;
Audio playing module, for the audio frequency data acquisition electrical signal play in described storing device, and read audio frequency data to be played and current playing audio-fequency data, extract the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data and compare, obtain comparison result, according to described comparison result, calculate the volume gain value of the volume output value relative to described current audio plays, regulate the volume output value of described audio frequency data to be played according to described volume gain value after terminating to play current playing audio-fequency data, described electrical signal is adjusted according to this volume output value, and,
Dio Output Modules, comprises the interface for exporting described electrical signal or described electrical signal is converted into the transverter of acoustical signal.
CN201511000762.2A 2015-12-24 2015-12-24 Adjust the method and apparatus and terminal of the volume output valve of audio data Active CN105632523B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201511000762.2A CN105632523B (en) 2015-12-24 2015-12-24 Adjust the method and apparatus and terminal of the volume output valve of audio data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201511000762.2A CN105632523B (en) 2015-12-24 2015-12-24 Adjust the method and apparatus and terminal of the volume output valve of audio data

Publications (2)

Publication Number Publication Date
CN105632523A true CN105632523A (en) 2016-06-01
CN105632523B CN105632523B (en) 2018-07-20

Family

ID=56047360

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201511000762.2A Active CN105632523B (en) 2015-12-24 2015-12-24 Adjust the method and apparatus and terminal of the volume output valve of audio data

Country Status (1)

Country Link
CN (1) CN105632523B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107566652A (en) * 2017-09-20 2018-01-09 惠州Tcl移动通信有限公司 A kind of method, mobile terminal and the storage medium of automatic regulating volume size
WO2019033921A1 (en) * 2017-08-18 2019-02-21 Guangdong Oppo Mobile Telecommunications Corp., Ltd. Method and apparatus for adjusting volume, terminal device and storage medium
CN110677774A (en) * 2019-08-21 2020-01-10 中国平安财产保险股份有限公司 Volume self-adaptive adjusting method and device, computer equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6144627A (en) * 1996-03-08 2000-11-07 Sony Corporation Data recording and reproducing apparatus and method with phase error detection
CN104485121A (en) * 2014-11-24 2015-04-01 惠州Tcl移动通信有限公司 Method and system for automatically setting sound effect parameters

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6144627A (en) * 1996-03-08 2000-11-07 Sony Corporation Data recording and reproducing apparatus and method with phase error detection
CN104485121A (en) * 2014-11-24 2015-04-01 惠州Tcl移动通信有限公司 Method and system for automatically setting sound effect parameters

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019033921A1 (en) * 2017-08-18 2019-02-21 Guangdong Oppo Mobile Telecommunications Corp., Ltd. Method and apparatus for adjusting volume, terminal device and storage medium
US10440464B2 (en) 2017-08-18 2019-10-08 Guangdong Oppo Mobile Telecommunications Corp., Ltd. Method and apparatus for adjusting volume, terminal device and storage medium
CN107566652A (en) * 2017-09-20 2018-01-09 惠州Tcl移动通信有限公司 A kind of method, mobile terminal and the storage medium of automatic regulating volume size
CN110677774A (en) * 2019-08-21 2020-01-10 中国平安财产保险股份有限公司 Volume self-adaptive adjusting method and device, computer equipment and storage medium
CN110677774B (en) * 2019-08-21 2022-06-10 中国平安财产保险股份有限公司 Volume self-adaptive adjusting method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN105632523B (en) 2018-07-20

Similar Documents

Publication Publication Date Title
CN105074822B (en) Device and method for audio classification and processing
WO2019233358A1 (en) Method and system for processing sound characteristics based on deep learning
US8311821B2 (en) Parameterized temporal feature analysis
CN103152668B (en) Adjusting method of output audio and system thereof
CN105405439B (en) Speech playing method and device
KR102084931B1 (en) Volume leveler controller and controlling method
KR101269296B1 (en) Neural network classifier for separating audio sources from a monophonic audio signal
CN101002254B (en) Device and method for robustry classifying audio signals, method for establishing and operating audio signal database
CN101023469B (en) Digital filtering method, digital filtering equipment
CN107293286B (en) Voice sample collection method based on network dubbing game
JP2010020133A (en) Playback apparatus, display method, and display program
CN102486920A (en) Audio event detection method and device
CN101569092A (en) System for processing audio data
US20210158832A1 (en) Method and device for evaluating performance of speech enhancement algorithm, and computer-readable storage medium
WO2020155490A1 (en) Method and apparatus for managing music based on speech analysis, and computer device
JP2007534995A (en) Method and system for classifying audio signals
WO2015114216A2 (en) Audio signal analysis
CN105632523A (en) Method and device for regulating sound volume output value of audio data, and terminal
CN101292241A (en) Method and device for calculating a similarity metric between a first feature vector and a second feature vector
CN106980487A (en) Audio control method and audio control apparatus
CN108615536A (en) Time-frequency combination feature musical instrument assessment of acoustics system and method based on microphone array
Kendrick et al. Perceived audio quality of sounds degraded by non-linear distortions and single-ended assessment using HASQI
Wilson et al. Perception & evaluation of audio quality in music production
JP4770194B2 (en) Information embedding apparatus and method for acoustic signal
KR101382356B1 (en) Apparatus for forgery detection of audio file

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 519085 Guangdong Zhuhai science and technology innovation coastal Meizu Technology Building

Patentee after: MEIZU TECHNOLOGY Co.,Ltd.

Address before: 519000 technology innovation coastal Meizu science and technology building, Zhuhai, Guangdong

Patentee before: Meizu Technology (China) Co.,Ltd.