CN105632523A

CN105632523A - Method and device for regulating sound volume output value of audio data, and terminal

Info

Publication number: CN105632523A
Application number: CN201511000762.2A
Authority: CN
Inventors: 王文熹; 郑美玲
Original assignee: Meizu Technology China Co Ltd
Current assignee: Meizu Technology Co Ltd
Priority date: 2015-12-24
Filing date: 2015-12-24
Publication date: 2016-06-01
Anticipated expiration: 2035-12-24
Also published as: CN105632523B

Abstract

The invention provides a method and a device for regulating a sound volume output value of audio data. The method includes the steps of: reading audio data to be played and currently-played audio data; extracting music style feature information of the audio data to be played and the currently-played audio data; comparing the music style feature information of the audio data to be played with that of the currently-played audio data; determining a music style type of the audio data to be played relative to the currently-played audio data according to a comparison result; calculating a sound volume gain value relative to a sound volume output value of the currently-played audio data according to the determined music style type; and regulating the sound volume output value of the audio data to be played according to the sound volume gain value after the playing of the currently-played audio data is stopped. According to the invention, how to automatically regulate the sound volume output of different types of audio files is solved, and the output sound volume of the audio files can be regulated according to a practical situation of currently-played music.

Description

Regulate the method for volume output value of audio frequency data and device and terminal

Technical field

The present invention relates to a kind of sound-volume management method and system, particularly relate to the method for the volume output value regulating audio frequency data after a kind of and current audio plays compares and device and terminal.

Background technology

Existing mobile terminal or multimedia player are when playing music, user generally uses earphone or loud speaker when listening to, but owing to the type of each music is different, such as user listens to music, the bent wind of the first melodies in front and back two is different, and a front one first soft then head is loud and sonorous, if the volume remained unchanged, then the hearing of user can be caused damage, bring not good experience to user.

Prior art have employed following method when solving this kind of problem: by obtaining label data corresponding to each audio file, the label of such as mp3 file is called as ID3, storing the title of this audio documents in this label, sing the information such as (playing) person, composition (word) person, the lyrics, audio types, above-mentioned audio types comprises rock and roll, jazz, rural area, allusion etc. The output signal of the audio documents (such as rock and roll, rural area) of different types has different amplitudes, tone and frequency, by reading this label (such as ID3), know the audio types that this music is corresponding, thus search the volume output value that default different audio types are corresponding, thus reach the object automatically regulating music output volume. But, if this kind of method run into audio file label do not represent audio types or audio file actual cannot Perfect Matchings preset volume output value time, the volume output value that just cannot make to preset mates audio file accurately.

Summary of the invention

Based on this, it is necessary to export problem for how automatically regulating the volume of dissimilar audio file, it provides a kind of method of volume output value regulating audio frequency data and device.

Regulate a method for the volume output value of audio frequency data, comprising:

Read audio frequency data to be played and current playing audio-fequency data;

Extract the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data and compare, obtain comparison result;

According to comparison result, calculate the volume gain value of the volume output value relative to described current audio plays, after terminating to play current playing audio-fequency data, regulate the volume output value of described audio frequency data to be played according to described volume gain value.

Wherein in an embodiment, described bent wind characteristic information comprises: characterize the frequency values of low-and high-frequency switching times in audio frequency data, the center of mass values characterizing basic frequency band in the strong and weak audio amplitude of note vibrations, reflection audio frequency frame signal, reflection sound signal frequency spectrum data, beginning or ending with particular model in one or more combination.

Wherein in an embodiment, described audio frequency data to be played and current playing audio-fequency data are the audio frequency data extracted in the audio frequency data extracted in the next files in stream media taking from streaming media playing list respectively and the stream medium data currently play; Or take from same files in stream media next section of part to be played respectively and currently playing audio frequency data corresponding to part.

Wherein in an embodiment, the step of audio frequency data to be played described in described comparison and the bent wind characteristic information of current playing audio-fequency data comprises:

First the bent wind characteristic information extracted based on single audio frequency frame, the similarity of audio frequency data to be played and current playing audio-fequency data described in comparison frame by frame; Then, based on the parameter value of the bent wind characteristic information of the sign extracted in described default sample region, compare described audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in described default sample region;

Or, based on the parameter value of the bent wind characteristic information of the sign extracted in described default sample region, compare described audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in described default sample region.

Wherein in an embodiment, it is the time interval at the maximum portion of audio data place of the time range, the time interval at audio frequency data sound intermediate frequency amplitude highest portion point place or the low-and high-frequency switching times that pre-set between described default sample region.

Wherein in an embodiment, described based on single audio frequency frame extract bent wind characteristic information, described in comparison, the determination step of audio frequency data to be played and current playing audio-fequency data similarity comprises frame by frame:

Based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;

To the corresponding Euclidean distance value sequence of each audio frequency frame calculated;

Find out minimum Euclidean distance value, judge whether described minimum Euclidean distance value is less than a threshold value, if then exporting the result of determination that described audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then the result of determination that described audio frequency data to be played are not similar to current playing audio-fequency data is exported.

Wherein in an embodiment, the described method comparing described audio frequency data to be played and current playing audio-fequency data interior corresponding parametric values size in described default sample region comprises:

Based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, described parameter value corresponding to audio frequency frame of selecting to be positioned between described default sample region is as sample;

The sample mean estimation technique is adopted to obtain the trend estimation value of introversion wind characteristic parameter information between described default sample region;

The relatively size of the trend estimation value that described audio frequency data to be played are corresponding with current playing audio-fequency data.

Based on aforesaid method, present invention also offers the device of a kind of volume output value regulating audio frequency data, comprising:

Audio frequency data extraction module, for reading audio frequency data to be played and current playing audio-fequency data;

Characteristic information extracting module, for extracting the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data;

Characteristic information comparing module, for the bent wind characteristic information of audio frequency data to be played described in comparison and current playing audio-fequency data, obtains comparison result; And

Output module, for according to described comparison result, exporting the volume gain value exported relative to current volume, regulates current volume output value when playing audio frequency data to be played according to described volume gain value.

Wherein in an embodiment, described characteristic information comparing module comprises:

Euclidean distance calculates unit, for based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;

Sequencing unit, for the corresponding Euclidean distance value sequence of each audio frequency frame calculated; And

Judging unit, for finding out minimum Euclidean distance value, judges whether described minimum Euclidean distance value is less than a threshold value, if then exporting the result of determination that described audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then the result of determination that described audio frequency data to be played are not similar to current playing audio-fequency data is exported.

Sample extraction unit, for based on from described audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing described bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, described parameter value corresponding to audio frequency frame of selecting to be positioned between described default sample region is as sample;

Estimation unit, for adopting the sample mean estimation technique to obtain the trend estimation value of introversion wind characteristic parameter information between described default sample region; And

Relatively unit, for comparing the size of the described audio frequency data to be played trend estimation value corresponding with current playing audio-fequency data.

A kind of terminal, comprising:

The communication module of audio frequency data can be obtained from outside network;

For the storing device of stores audio data;

Audio playing module, for the audio frequency data acquisition electrical signal play in described storing device, and read audio frequency data to be played and current playing audio-fequency data, extract the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data and compare, obtain comparison result, according to described comparison result, calculate the volume gain value of the volume output value relative to described current audio plays, regulate the volume output value of described audio frequency data to be played according to described volume gain value after terminating to play current playing audio-fequency data, described electrical signal is adjusted according to this volume output value, and,

Dio Output Modules, comprises the interface for exporting described electrical signal or described electrical signal is converted into the transverter of acoustical signal.

The method of the volume output value of above-mentioned adjustment audio frequency data and device and terminal, compare based on by the bent wind characteristic information of audio frequency data to be played and current playing audio-fequency data, in the hope of obtaining one based on the bent wind type evaluation result corresponding to current playing audio-fequency data, then according to the bent wind type evaluation result that this obtains in real time, determine the yield value that volume exports, do like this and can evaluate bent wind type more accurately, basis for current playing audio-fequency data output volume output value provides the volume output value of a different user audiovisual demand of more fitting.

Accompanying drawing explanation

Fig. 1 is the method flow schematic diagram of the volume output value regulating audio frequency data;

Fig. 2 is the schematic flow sheet of one embodiment of the invention;

Fig. 3 is the schematic flow sheet of another embodiment of the present invention;

Fig. 4 is the schematic flow sheet of volume yield value acquisition methods in Fig. 1;

Fig. 5 is the schematic flow sheet of similarity decision method in Fig. 2;

Fig. 6 is that Euclidean distance calculates schematic diagram;

Fig. 7 is the structural representation that the present invention regulates the device of the volume output value of audio frequency data.

Embodiment

Based on the bent wind characteristic information comparison technology by audio frequency data to be played and current playing audio-fequency data, the present invention provides method and the device of a kind of volume output value regulating audio frequency data. Utilizing method and the device of the present invention, it is possible to use time listening to the music in family, if the current song play belongs to soft bent wind, the volume that user uses is bigger; Then current song play to close on ending time, the bent wind that preliminary examination measures next music being about to broadcasting is louder and more sonorous, then after terminating current music, before next music, automatically the volume of volume to suitable people's ear is turned down, to prevent the impaired hearing of user. The technology contents of the present invention is described in detail below in conjunction with each embodiment.

As shown in Figure 1, the method for a kind of volume output value regulating audio frequency data that the present embodiment provides, comprising:

Step 110: read audio frequency data to be played and current playing audio-fequency data;

Step 120: the bent wind characteristic information extracting above-mentioned audio frequency data to be played and current playing audio-fequency data;

Step 130: the bent wind characteristic information of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data;

Step 140: according to the result of above-mentioned comparison, it is possible to for determining the bent wind type of above-mentioned audio frequency data to be played relative to above-mentioned current playing audio-fequency data. Based on the comparison result of bent wind, calculate the volume gain value of the volume output value relative to current playing audio-fequency data, after terminating to play current playing audio-fequency data, regulate the volume output value of audio frequency data to be played according to above-mentioned volume gain value. Here volume gain value can be weighed by decibel value, it is also possible to represents with per-cent.

In aforesaid method step, it is possible to increase the step of the current volume output value obtaining current playing audio-fequency data. If when current displaying audio file, user have adjusted volume output value voluntarily, namely can export in the volume that user is adjusted after then utilizing aforesaid method and be worth on basis to regulate according to obtained volume gain value, and without the need to redefining the volume decibel value output area of respective type. Therefore utilizing the aforesaid method of the present invention can according to the ear of actual user ability pleasant to hear, such as somebody likes loud, and somebody likes little sound, then can export according to the adjustment volume of the practical situation of user human nature more.

In aforesaid method step 140, regulate the concrete grammar of current volume output value according to above-mentioned volume gain value when playing audio frequency data to be played, it is possible to shown in following steps:

Successively increase on the basis of the current volume output value obtained or reduce above-mentioned volume gain value, here can be increase the volume gain value of 1��5 decibel every time or reduce by the volume gain value of 1��5 decibel, it is also possible to be increase by the volume gain value of 1��5% every time or reduce by the volume gain value of 1��5%;

Judging whether to receive outside confirmation instruction, if then stopping regulating, otherwise continuing increase on the basis of current volume output value or reduce above-mentioned volume gain value.

Can successively be gone forward one by one by above method regulates volume to export, and relaxes the situation that volume becomes suddenly greatly or diminishes suddenly, and the audio providing best for user exports experience.

Audio frequency data to be played and current playing audio-fequency data in above-mentioned steps 110 can be the audio frequency data extracted in the audio frequency data extracted in the next files in stream media taking from streaming media playing list respectively and the stream medium data currently play; Or can be take from same files in stream media next section of part to be played respectively and currently playing audio frequency data corresponding to part. Here Streaming Media comprises absolute music compression data, video compression data etc. If absolute music data, then next music that above-mentioned audio frequency data to be played and current playing audio-fequency data can be respectively in music playlist and the music currently play; Or the content that can be next section of content to be played in the song currently play and currently playing. The method of the present embodiment is utilized can automatically to regulate the broadcast sound volume of two songs or two video files before and after in same playlist.

Bent wind characteristic information in above-mentioned steps 120 comprises: the frequency values (it can represent with by the switching times of low-and high-frequency in sampling time period) characterizing low-and high-frequency switching times in audio frequency data, characterize the audio amplitude that note vibrations are strong and weak, the center of mass values of basic frequency band in reflection audio frequency frame signal, characterize the sound dynamicrange (referring to the intensity difference of most forte and the most weak sound) of sound pressure level change, the frequency spectrum data of reflection sound signal, beginning or ending with particular model in one or more combination. such as reflect that the center of mass values of basic frequency band in audio frequency frame signal adopts following account form: set the sample frequency of an audio frequency sequence as f_s, sampling channel number is S_d, sampling figure place is W_s, the data amount check of every height band is k (=f_sS_dW_s* 20/ (8000*32)), the data amount check of each audio frequency frame is Num (Num=k*32), x (n) (n �� [1...Num]) is frame audio data set, s_iJ () (j �� [1...32]) is the audio data set of the i-th frame jth son band, the bent wind characteristic parameter calculating the i-th audio frequency frame is as follows:

The center of mass values C of basic frequency band in the reflection audio frequency frame signal that each audio frequency frame is corresponding_iFor:

C_{i} = \frac{Σ_{j}^{32} {jM}_{j}}{Σ_{j}^{32} M_{j}},

Wherein,

M_{j} = \sqrt{\frac{Σ_{j}^{32} s_{i}^{2} (j)}{k}}

The audio amplitude R that the sign note vibrations that each audio frequency frame is corresponding are strong and weak_i, represent with rootmean-square, it be specially:

R_{i} = \sqrt{\frac{Σ_{j}^{32} M_{j}^{2}}{32}}

On the basis of above-described embodiment, in above-mentioned steps 130, the step of the above-mentioned audio frequency data to be played of comparison and the bent wind characteristic information of current playing audio-fequency data comprises: based on the parameter value for characterizing bent wind characteristic information extracted in above-mentioned default sample region, the relatively size of above-mentioned audio frequency data to be played and current playing audio-fequency data interior corresponding parametric values in above-mentioned default sample region, then in step 140 based on the size comparative result of parameter value, judge the bent wind type of above-mentioned audio frequency data to be played relative to above-mentioned current playing audio-fequency data, obtain comparison result. such as, as shown in Figure 3, if above-mentioned audio frequency data to be played are greater than the corresponding parametric values of current playing audio-fequency data, then perform step 142 and judge that bent wind is comparatively loud and sonorous, if above-mentioned audio frequency data to be played are less than the corresponding parametric values of current playing audio-fequency data, then perform step 143 and judge that bent wind is comparatively soft, if above-mentioned audio frequency data to be played equal the corresponding parametric values of current playing audio-fequency data, then perform step 141 and it is judged to that bent wind is similar.

Certainly, it is also possible to carry out two times and judge, to reduce complexity during word audio data processing, and accurate bent wind similarity is provided to detect. Such as, first, based on the bent wind characteristic information that single audio frequency frame extracts, the similarity of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data, carries out the first judgement of similarity frame by frame; Then, above-mentioned audio frequency data to be played non-to current playing audio-fequency data similar time, again based on the parameter value for characterizing bent wind characteristic information extracted in above-mentioned default sample region, compare above-mentioned audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in above-mentioned default sample region.

The present embodiment relatively carrys out jointly to determine the bent wind type of audio frequency data to be played relative to current playing audio-fequency data by similarity and/or value size, such as audio frequency data to be played are loud and sonorous, soft or similar comparative results relative to current playing audio-fequency data, relative to prior art only utilizing audio file label to judge the bent wind characteristic of audio frequency data, the present embodiment mode is more accurate, more meets the practical situation of the audio file that user plays. It it is the time interval at the maximum portion of audio data place of the time range, the time interval at audio frequency data sound intermediate frequency amplitude highest portion point place or the low-and high-frequency switching times that pre-set between above-mentioned default sample region. Bent wind type herein is that whether the data of two audio frequency are similar, loud and sonorous or soft.

Above-mentioned steps 130 specifically can adopt following two kinds of modes to realize.

First kind of way: as shown in Figure 2, based on above-described embodiment after execution of step 120, first, perform step 131: the bent wind characteristic information extracted based on single audio frequency frame, the similarity of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data frame by frame; If similar, then perform step 141 and it is judged to that bent wind is similar; If not similar, then perform step 132 again: based on the parameter value of the bent wind characteristic information of the sign extracted in above-mentioned default sample region, compare above-mentioned audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in above-mentioned default sample region; If above-mentioned audio frequency data to be played are greater than the corresponding parametric values of current playing audio-fequency data, then perform step 142 and judge that bent wind is comparatively loud and sonorous; If above-mentioned audio frequency data to be played are less than the corresponding parametric values of current playing audio-fequency data, then perform step 143 and judge that bent wind is comparatively soft.

2nd kind of mode, as shown in Figure 3, based on above-described embodiment after execution of step 120, only perform step 132: based on the parameter value of the bent wind characteristic information of the sign extracted in above-mentioned default sample region, compare above-mentioned audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in above-mentioned default sample region; If above-mentioned audio frequency data to be played are greater than the corresponding parametric values of current playing audio-fequency data, then perform step 142 and judge that bent wind is comparatively loud and sonorous; If above-mentioned audio frequency data to be played are less than the corresponding parametric values of current playing audio-fequency data, then perform step 143 and judge that bent wind is comparatively soft; If above-mentioned audio frequency data to be played equal the corresponding parametric values of current playing audio-fequency data, then perform step 141 and it is judged to that bent wind is similar.

When after the comparison result providing step 130 according to each step above-mentioned, correspondingly just can determine the bent wind type of audio frequency data to be played relative to current playing audio-fequency data in step 140, if being judged to that bent wind is similar by the step of Fig. 2 or Fig. 3, then perform step 151: the volume gain value of acquisition is zero, after terminating to play current playing audio-fequency data, maintain current volume output value constant; If judging that bent wind is comparatively loud and sonorous by the step of Fig. 2 or Fig. 3, then perform step 152: the volume gain value of acquisition is default decibel value or any value presetting in decibel interval, after terminating to play current playing audio-fequency data, export in current volume and it is worth on basis to subtract above-mentioned volume gain value, in order to turn down current volume output value corresponding to audio frequency data to be played; If judging that bent wind is comparatively soft by the step of Fig. 2 or Fig. 3, then perform step 153: the volume gain value of acquisition is default decibel value or any value presetting in decibel interval, after terminating to play current playing audio-fequency data, export in current volume and it is worth on basis to increase described volume gain value, in order to heighten current volume output value corresponding to audio frequency data to be played.

On the basis of above-described embodiment, after execution of step 110 to step 140, in above-mentioned steps 140, it is possible to obtain volume gain value by following step, as shown in Figure 4:

First, perform step 154: according to the above-mentioned bent wind type determined, search the two dimension dynamically array structure of the bent wind type of associated record and volume gain value, such as the volume gain value of the bent similar correspondence of wind is zero, the volume gain value of the bent comparatively loud and sonorous correspondence of wind is first decibel of value or any value in first decibel of interval, and the volume gain value of the bent comparatively soft correspondence of wind is the 2nd decibel of value or any value in the 2nd decibel of interval.

Then, perform step 155: obtain the volume gain value that corresponding bent wind type is corresponding;

Secondly, perform step 156: obtain the volume output value that current playing audio-fequency data is corresponding;

Again, perform step 157: when playing audio frequency data to be played, the volume gain value that superposition obtains on the basis of above-mentioned volume output value. And above-mentioned first decibel of value or be negative or negative interval range in first decibel of interval, above-mentioned 2nd decibel of value or be positive interval range in the 2nd decibel of interval.

Based on each embodiment above-mentioned, the method for the above-mentioned audio frequency data to be played of the comparison frame by frame of the present embodiment and current playing audio-fequency data similarity realizes to 138 according to the step 133 in Fig. 5. In the present embodiment, in step 120, from above-mentioned audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing above-mentioned bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding;

Step 133: utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;

Step 134: to the corresponding Euclidean distance value sequence of each audio frequency frame calculated;

Step 135: find out minimum Euclidean distance value;

Step 136: judge whether above-mentioned minimum Euclidean distance value is less than a threshold value, if then performing step 137: export the result of determination that above-mentioned audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then step 138 is performed: export the result of determination that above-mentioned audio frequency data to be played are not similar to current playing audio-fequency data.

Below with the center of mass values C of frequency band basic in reflection audio frequency frame signal corresponding to each audio frequency frame_iThe audio amplitude R that the sign note vibrations corresponding with each audio frequency frame are strong and weak_iFor example, the detailed process that above-mentioned similarity judges is described in detail.

Utilize weighted euclidean distance calculation formula, calculate center of mass values C respectively according to the mode shown in accompanying drawing 6_iWith audio amplitude R_iEuclidean distance W_kAnd G_k, to k the Euclidean distance sequence calculated, find out minimum Euclidean distance value, judge W respectively_kmin�ܦ�₁And G_kmin�ܦ�₂Whether setting up, if set up, showing that front playing audio-fequency data is similar to audio frequency data to be played, otherwise not similar. Here ��₁And ��₂For the threshold value of setting.

W_{k} = Σ_{i}^{N_{d}} \frac{{(C_{i} - C_{i}^{(k)})}^{2}}{{(C_{i}^{2} - C_{i}^{(k) 2})}^{2}}, k = N_{d} - N_{b}

G_{k} = Σ_{i}^{N_{d}} \frac{{(R_{i} - R_{i}^{(k)})}^{2}}{{(R_{i}^{2} - R_{i}^{(k) 2})}^{2}}, k = N_{d} - N_{b}

Wherein, C_iRepresent the barycenter of audio frequency data i-th frame to be played,The barycenter of playing audio-fequency data i-th frame before representing, N_dRepresent the audio frame number of audio frequency data to be played, N_bThe audio frame number of playing audio-fequency data before representing, R_iRepresent the audio amplitude of audio frequency data i-th frame to be played,The audio amplitude of playing audio-fequency data i-th frame before representing.

Based on each embodiment above-mentioned, based on the parameter value of the bent wind characteristic information of the sign extracted in above-mentioned default sample region in above-mentioned steps 130, the method comparing above-mentioned audio frequency data to be played and current playing audio-fequency data interior corresponding parametric values size in above-mentioned default sample region comprises the following steps:

Based in the step 120 from above-mentioned audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing above-mentioned bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, the above-mentioned parameter value that to select to be positioned between above-mentioned default sample region audio frequency frame corresponding, as sample, is the time interval at the maximum portion of audio data place of the time range, the time interval at audio frequency data sound intermediate frequency amplitude highest portion point place or the low-and high-frequency switching times that pre-set between default sample region here;

The sample mean estimation technique is adopted to obtain the trend estimation value of introversion wind characteristic parameter information between above-mentioned default sample region;

The relatively size of the trend estimation value that above-mentioned audio frequency data to be played are corresponding with current playing audio-fequency data.

Below with the center of mass values C of frequency band basic in reflection audio frequency frame signal corresponding to each audio frequency frame_iThe audio amplitude R that the sign note vibrations corresponding with each audio frequency frame are strong and weak_iFor example, the detailed process that above-mentioned parameter value compares is described in detail.

The time interval at audio frequency data sound intermediate frequency amplitude highest portion point place is defined as x between default sample region, the center of mass values C then selecting each frame in this interval corresponding_iWith audio amplitude R_i, utilize the sample mean estimation technique to calculate and preset the corresponding parameter of each audio frequency frame in sample region and be worth average so that it is be decided to be trend estimation value, in order to the roughly situation of a basic frequency band evaluating between audio amplitude highest portion point location and amplitude.

μ_{c} = \frac{Σ_{i}^{_{N_{d (x)}}} C_{i}^{'}}{N_{d (x)}}, μ_{c}^{(k)} = \frac{Σ_{i}^{N_{b (x)}} C_{i}^{' (k)}}{N_{b (x)}};

μ_{R} = \frac{Σ_{i}^{N_{d (x)}} R_{i}^{'}}{N_{d (x)}}, μ_{R}^{(k)} = \frac{Σ_{i}^{N_{b (x)}} R_{i}^{' (k)}}{N_{b (x)}};

Wherein, C '_iRepresent the barycenter presetting the i-th frame in x between sample region in audio frequency data to be played,Playing audio-fequency data is preset the barycenter of the i-th frame in x between sample region, N before representing_d(x)Represent the audio frame number preset between sample region in x in audio frequency data to be played, N_b(x)The audio frame number preset between sample region in x in playing audio-fequency data before representing, R '_iRepresent the audio amplitude presetting the i-th frame in x between sample region in audio frequency data to be played,Playing audio-fequency data is preset before representing the audio amplitude of the i-th frame in x between sample region.

Based on each embodiment of aforesaid method, the present embodiment also comprises the following steps:

Store the adjustment result of each audio frequency data volume output value to be played, and with audio frequency data association record;

Size according to above-mentioned adjustment result, audio frequency data interlace is arranged, or by the audio frequency data summarization arrangement of above-mentioned adjustment result within the scope of pre-set interval, form playlist. The object done like this is according to customer need, to listen to the audio frequency data within the scope of a certain decibel value, also listens to same bent wind music content, or the staggered music content listening to different bent wind.

Can also comprise based on the above method: a parameter regulation interface is provided, for upgrading the pre-set interval scope of above-mentioned relevant adjustment result according to customer need so that the selection of playlist human nature, more intelligent more.

Based on aforesaid method, as shown in Figure 7, present embodiments providing the device of a kind of volume output value regulating audio frequency data, it comprises

Audio frequency data extraction module 1, for reading audio frequency data to be played and current playing audio-fequency data;

Characteristic information extracting module 2, for extracting the bent wind characteristic information of above-mentioned audio frequency data to be played and current playing audio-fequency data;

Characteristic information comparing module 3, for the bent wind characteristic information of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data, obtains comparison result; And

Output module 5, for according to above-mentioned comparison result, calculate the volume gain value of the volume output value relative to described current playing audio-fequency data, after terminating to play current playing audio-fequency data, regulate the volume output value of described audio frequency data to be played according to described volume gain value.

Above-mentioned audio frequency data extraction module 1, characteristic information extracting module 2, characteristic information comparing module 3, output module 5 are respectively used to realize above-mentioned steps 110 to step 140, therefore the related description that its relevant specific implementation details is civilian before can refer to, is not repeated at this.

Based on the method for above-mentioned Fig. 2 and Fig. 3, above-mentioned characteristic information comparing module 3 comprises following two function modules:

Similarity determination module, for the bent wind characteristic information extracted based on single audio frequency frame, the similarity of the above-mentioned audio frequency data to be played of comparison and current playing audio-fequency data frame by frame, carries out sentencing at the beginning of similarity;

Comparison module, for non-similar time again based on the parameter value of bent wind characteristic information of the sign extracted in above-mentioned default sample region, compare above-mentioned audio frequency data to be played and current playing audio-fequency data in above-mentioned default sample region in the size of corresponding parametric values.

Based on the method for Fig. 5, in above-mentioned characteristic information comparing module 3, similarity determination module can form by with lower unit:

Euclidean distance calculates unit, for based on from above-mentioned audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing above-mentioned bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, utilize weighted euclidean distance calculation formula to calculate the Euclidean distance of parameter value corresponding to each audio frequency frame respectively;

Judging unit, for finding out minimum Euclidean distance value, judges whether above-mentioned minimum Euclidean distance value is less than a threshold value, if then exporting the result of determination that above-mentioned audio frequency data to be played are similar to current playing audio-fequency data; Otherwise, then the result of determination that above-mentioned audio frequency data to be played are not similar to current playing audio-fequency data is exported.

Based on aforesaid method, in above-mentioned characteristic information comparing module 3, comparison module can comprise:

Sample extraction unit, for based on from above-mentioned audio frequency data to be played and current playing audio-fequency data, the parameter value characterizing above-mentioned bent wind characteristic parameter information that each audio frequency frame of extracting is corresponding, above-mentioned parameter value corresponding to audio frequency frame of selecting to be positioned between above-mentioned default sample region is as sample;

Estimation unit, for adopting the sample mean estimation technique to obtain the trend estimation value of introversion wind characteristic parameter information between above-mentioned default sample region; And

Relatively unit, for comparing the size of the above-mentioned audio frequency data to be played trend estimation value corresponding with current playing audio-fequency data.

Based on aforesaid method and system, it is possible to according to the bent wind automatic adjustment height of music etc. or turn down volume, avoiding the sense of hearing damaging user, user does not need to carry out manual volume adjusting simultaneously, improves Consumer's Experience.

In addition, based on the method for volume output value and the device of above-mentioned adjustment audio frequency data, it is also possible to being applied to a kind of terminal, this terminal can be the terminal with audio player such as mobile phone, electronic toy, earphone, computer, IPAD. This terminal comprises following element:

1, the communication module of audio frequency data can be obtained from outside network; This communication module can be the communication module meeting the communications protocol such as 4G, 5G, TDMA.

2, for the storing device of stores audio data; This storing device can be the built-in terminal such as flash memory, hard disk, USD, SD card or the various types of storing devices being placed on terminal.

3, audio playing module, for the audio frequency data acquisition electrical signal play in described storing device, and read audio frequency data to be played and current playing audio-fequency data, extract the bent wind characteristic information of described audio frequency data to be played and current playing audio-fequency data and compare, obtain comparison result, according to described comparison result, calculate the volume gain value of the volume output value relative to described current audio plays, regulate the volume output value of described audio frequency data to be played according to described volume gain value after terminating to play current playing audio-fequency data, described electrical signal is adjusted according to this volume output value. this audio playing module is mainly used for realizing the broadcasting to audio frequency data, exports by audio frequency data and forms electrical signal, and the content based on step described in above-mentioned steps 110 to step 140 realizes the adjustment to electrical signal. such as, the amplitude of above-said current signal is adjusted according to volume output value. specific implementation details about audio playing module can join the related description in seing above. audio playing module in the present embodiment can be audio player built-in in terminal.

4, dio Output Modules, comprises for exporting the audio output interface of described electrical signal or described electrical signal is converted into the transverter of acoustical signal. Dio Output Modules can be the audio output interface of the equipment such as a kind of accessible loud speaker, earphone, such as, if terminal is computer, then only needs to provide audio output interface, and by loud speaker, electrical signal is converted into the sound that the mankind can hear. Again such as, dio Output Modules can be the transverter being converted into acoustical signal for electrical signal being built in terminal, and such as dio Output Modules can be that carry in terminal outer puts equipment, and so, terminal can carry outer playing function.

Each technology feature of the above embodiment or technical scheme can combine arbitrarily, for making description succinct, each all possible combination of technology feature in above-described embodiment is not all described, but, as long as the combination of these technology features does not exist contradiction, all it is considered to be the scope that this specification sheets is recorded.

The above embodiment only have expressed several enforcement modes of the present invention, and it describes comparatively concrete and detailed, but therefore can not be interpreted as the restriction to patent scope of the present invention. , it is also possible to make some distortion and improvement, it should be appreciated that for the person of ordinary skill of the art, without departing from the inventive concept of the premise these all belong to protection scope of the present invention. Therefore, the protection domain of patent of the present invention should be as the criterion with claims.

Claims

1. one kind regulates the method for the volume output value of audio frequency data, it is characterised in that, described method comprises:

Read audio frequency data to be played and current playing audio-fequency data;

According to comparison result, calculate the volume gain value of the volume output value relative to described current playing audio-fequency data, after terminating to play current playing audio-fequency data, regulate the volume output value of described audio frequency data to be played according to described volume gain value.

2. the method for the volume output value of adjustment audio frequency data according to claim 1, it is characterized in that, described bent wind characteristic information comprises: characterize the frequency values of low-and high-frequency switching times in audio frequency data, the center of mass values characterizing basic frequency band in the strong and weak audio amplitude of note vibrations, reflection audio frequency frame signal, reflection sound signal frequency spectrum data, beginning or ending with preset model in one or more combination.

3. the method for the volume output value of adjustment audio frequency data according to claim 1, it is characterized in that, described audio frequency data to be played and current playing audio-fequency data are the audio frequency data extracted in the audio frequency data extracted in the next files in stream media taking from streaming media playing list respectively and the stream medium data currently play; Or take from same files in stream media next section of part to be played respectively and currently playing audio frequency data corresponding to part.

4. the method for the volume output value of adjustment audio frequency data according to claim 1, it is characterised in that, the step of audio frequency data to be played described in described comparison and the bent wind characteristic information of current playing audio-fequency data comprises:

Based on the bent wind characteristic information that single audio frequency frame extracts, the similarity of audio frequency data to be played and current playing audio-fequency data described in comparison frame by frame, carry out sentencing at the beginning of similarity, non-similar time again based on the parameter value for characterizing bent wind characteristic information extracted in default sample region, compare described audio frequency data to be played and current playing audio-fequency data in described default sample region in the size of corresponding parametric values;

Or, based on the parameter value for characterizing bent wind characteristic information extracted in default sample region, compare described audio frequency data to be played and the size of current playing audio-fequency data interior corresponding parametric values in described default sample region.

5. the method for the volume output value of adjustment audio frequency data according to claim 4, it is characterized in that, it is the time interval at the maximum portion of audio data place of the time range, the time interval at audio frequency data sound intermediate frequency amplitude highest portion point place or the low-and high-frequency switching times that pre-set between described default sample region.

6. the method for the volume output value of adjustment audio frequency data according to claim 4, it is characterized in that, described based on single audio frequency frame extract bent wind characteristic information, described in comparison, the determination step of audio frequency data to be played and current playing audio-fequency data similarity comprises frame by frame:

7. the method for the volume output value of adjustment audio frequency data according to claim 4, it is characterised in that, the described method comparing described audio frequency data to be played and current playing audio-fequency data interior corresponding parametric values size in described default sample region comprises:

8. one kind regulates the device of the volume output value of audio frequency data, it is characterised in that: described device comprises:

Output module, for according to described comparison result, calculate the volume gain value of the volume output value relative to described current playing audio-fequency data, after terminating to play current playing audio-fequency data, regulate the volume output value of described audio frequency data to be played according to described volume gain value.

9. the device of the volume output value of adjustment audio frequency data according to claim 8, it is characterised in that, described characteristic information comparing module comprises:

10. the device of the volume output value of adjustment audio frequency data according to claim 8, it is characterised in that, described characteristic information comparing module comprises:

11. 1 kinds of terminals, it is characterised in that, comprising:

For the storing device of stores audio data;