Great idead Bento.
Slightly OT: m4a and FLAC don’t use ID3v2 tagging. Having said that, has anybody really seen the ID3v2 standard and what it’s capable of? A few exerpts from the standard taken from www.id3.org. Keep in mind, this is already implemented as a standard available to anyone who can read, including Native Instruments and Ableton.
-Musical Events(can anybody say Hotcues or Clean/Explicit songs with only 1 file by bleeping/reversing during the event). This frame allows synchronisation with key events in the audio. The type of event is as follows:
$00 padding (has no meaning)
$01 end of initial silence
$02 intro start
$03 main part start
$04 outro start
$05 outro end
$06 verse start
$07 refrain start
$08 interlude start
$09 theme start
$0A variation start
$0B key change
$0C time change
$0D momentary unwanted noise (Snap, Crackle & Pop)
$0E sustained noise
$0F sustained noise end
$10 intro end
$11 main part end
$12 verse end
$13 refrain end
$14 theme end
$15 profanity
$16 profanity end
-Synchronised tempo codes (beat grids that would be stored in a common format, available to all DJ programs, maybe even released to the public with the beatgrids already established)
For a more accurate description of the tempo of a musical piece, this
frame might be used. After the header follows one byte describing
which time stamp format should be used. Then follows one or more
tempo codes. Each tempo code consists of one tempo part and one time
part. The tempo is in BPM described with one or two bytes. If the
first byte has the value $FF, one more byte follows, which is added
to the first giving a range from 2 - 510 BPM, since $00 and $01 is
reserved. $00 is used to describe a beat-free time period, which is
not the same as a music-free time period. $01 is used to indicate one
single beat-stroke followed by a beat-free period.
The tempo descriptor is followed by a time stamp. Every time the
tempo in the music changes, a tempo descriptor may indicate this for
the player. All tempo descriptors MUST be sorted in chronological
order. The first beat-stroke in a time-period is at the same time as
the beat description occurs. There may only be one “SYTC” frame in
each tag.
-Synchronised lyrics/text (VJ’ing/Karaoke made easy, without buying new songs)
This is another way of incorporating the words, said or sung lyrics,
in the audio file as text, this time, however, in sync with the
audio. It might also be used to describing events e.g. occurring on a
stage or on the screen in sync with the audio. The header includes a
content descriptor, represented with as terminated text string. If no
descriptor is entered, ‘Content descriptor’ is $00 (00) only.
These are just 3 I beleive have potiential.