Related link: http://www.gonze.com
MusicBrainz uses audio fingerprints as primary keys to link metadata from different rips of the same song. To do this it uses a toolkit called TRM from a company called Relatable. TRM ids have severe drawbacks.
One, TRM generates way too many false negatives. In my testing I found that it was barely better at finding duplicate files than a byte hash like sha1. Two, it is closed source and probably encumbered by patents, so it can't become an open standard for audio fingerprints, and it can't be tweaked to support the needs of third party applications.
It's extremely clueful for MusicBrainz to use audio fingerprints for primary keys, a genuine innovation even. But MB is a metadata project, not a fingerprinter, and they couldn't use an better fingerprinter because one doesn't exist.
Details on my weblog.
Categories
Audio- tags:
- cd ripping
- mp3
Comments (7)
Read More Entries by Lucas Gonze.
7 Comments
Search
Topics of Interest
- xray
- xp
- xml
- X11
- wwdc
- workshop
- workflow
- wishlist
- windows server 2008
- windows mobile
- windows
- wiki
- wii
- wifi problem
- wifi
- webinar
- webcasting
- web services
- web
- waikiki
- voip
- vmware fusion
- vmware
- visual arts
- virtualization
- virtual synth
- video recording
- video production
- video casting
- video
- vector drawing
- utilities
- user interface
- usb headset
- usability
- upgrade
- update
- unix
- underexpose
- ui
- tv
- tsa
- tripod
- travel
- transcribe
- tone
- tips and tricks
- time machine
- time exposure
- tim grey
- tiger
- the digital story
- the creative beat
- The Creative Beat
- testing
- terminal
- temperature
- telephoto
- telecommuting
- tax
- tasmania
- synthesizers
- surround sound
- sun light exposure
- studio recording
- Studio Portraits
- studio
- store
- stop-motion
- stock
- statistics
- star trails
- stacking
- sql
- spiral
- special effects
- speakers
- spam
- sound effects
- software distribution
- Software
- software
- social networking
- soap
- smartart
- smart homes
- slideshow
- skyscrapers
- skydrive
- silverlight
- Sierra
- shutter speed
- sharpening
- sensor
- sensitivity
- security
- search engines
- sdmi
- sdk
- scott kelby
- science
- scanner
- scale
- saturate
- san francisco
- safari
- ruby
- rss
- rick smolan
- rick sammon
- rich web services
- riaa
- ria
- review
- retouching
- remote desktop
- reflections
- realnetworks
- rdc
- raw
- quicktime
- python
- purchase
- proofs
- programming
- productivity
- production
- process color
- process
- print on demand
- previews
- presentations
- preemie
- powerpoint 2008
- power
- point reyes
- podcasting
- podcast
- pod
- php
- photoshop world
- photoshop cs4
- Photoshop
- photoshop
- photoshoot
- photogrpahy business
- photogrpahy
- Photography
- Photographing People
- photographers
- photogram
- photo workshops
- photo titles
- photo teaching
- photo schools
- photo production
- photo organizing
- photo editing
- photo developing
- photo composition
- photo composite
- photgraphers
- phone
- perl
- performance
- patterns
- parallels
- papau new guinea
- pantone
- palm os
- owl
- os x
- organize
- openoffice
- opendns
- open source
- onlamp
- one-on-one
- olympics
- old media
- office live workspace
- office 2008
- office 2004
- office
- odbc
- obj-c
- nt
- notes
- noise
- nikon
- night photography
- night
- nicu
- newsletter
- news
- new media
- networking
- nda
- nature photography
- namibia
- mysql
- music software
- music production
- music composition
- music
- multiple exposure
- mpaa
- mp3 player
- mp3
- movies
- mouse
- motion
- moon
- monochrome
- mongolia
- mongol
- mobile phone
- mobile
- missing sync
- midi
- microsoft windows
- microsoft office 2008
- microsoft office
- microsoft
- microphones
- micro blogging
- messenger
- memory cards
- masking
- manuals
- make
- macworld
- macromedia
- macro lens
- macro
- macbook air
- mac os x
- mac heist
- mac
- luminance
- long exposure
- linux
- lightroom
- light
- lifechat
- licensing
- leopard
- lego mindstorms
- Las Vegas
- landscape
- korg
- keynote
- keyboard
- katrin eismann
- katie rose
- jxta
- jumping
- journal
- jini
- jeff schewe
- javascript
- java
- january upgrade
- j2ee
- ixmf
- itunes
- iso
- ipod touch
- ipod
- iphoto
- iphone development
- iphone 2
- iphone
- interview
- internet broadcasting
- intellectual property
- instant messaging
- industry
- indesign
- Indesign
- imposition
- image editing
- illustrator
- idisk
- ibm
- how-to
- honolulu
- hobby
- history
- histogram
- high ISO
- high focal range
- high dynamic range
- high dyanmic range
- hfr
- hexachrome
- HDR
- HD still/video
- HD
- hawaii
- harold davis
- hardware
- hands
- hacks
- gtd
- greg gorman
- graphics
- grain
- grab shot
- golden gate
- gimp
- games
- game audio
- game
- fusion
- fun
- ftp
- freeware
- formats
- focus stacking
- flowers
- flip4mac
- flickr
- flash
- firmware
- firefox
- filmmaking
- filemaker
- file transfer
- file sharing
- file management
- file compression
- fair use
- Face to Face
- exposure
- excel 2008
- excel
- evernote
- event
- error
- equalizers
- environmental
- entourage
- entertainment
- encryption
- embedded
- electricity
- effects
- education
- editor
- dynamic range
- dvd
- dslr
- drums
- drma
- drm
- dot mac
- domains
- documentation
- dns
- dmca
- dma
- distribution models
- distributed
- digitial photogrpahy
- digital SLRs
- Digital Photogrpahy
- Digital photogrpahy
- digital photogrpahy
- digital photography companion
- digital photography
- digital media
- digital darkroom
- development
- design
- derrick story
- dekepod
- deke
- dcma
- database
- data design
- d300
- CS4
- cross-processing
- creativity
- Creative Suite
- creative exposure
- corporate computing
- converter
- conversion
- content
- connectivity
- conference
- computer music
- composition
- composite
- command line
- color balance
- coldfusion
- coding
- code
- cocoa
- cmyk
- cloudy
- clone
- chiaroscuro
- cgi
- cd ripping
- Canon 5D Mark II
- canon
- cameras
- business
- budget
- browsers
- bridge
- books
- book covers
- bluetooth
- blogging
- black and white
- birding
- big cats
- beta invitation
- bento
- beijing
- battery
- backup
- audio recording
- audio production
- audio mixing
- audio editing
- audio compression
- art
- aqua
- appletv
- applescript
- applecare
- apple store
- apple
- app store
- api
- aperture
- apache
- aol
- annoyances
- animation
- aim
- ai
- adobe air
- adobe
- Adobe
- adjustment layers
- actionscript
- 3d
- 100 views
- .net
Related Books
Recent Blog Entries
- Apple's Software User Guide Diet
- Reality Leaves a Lot to the Imagination
- Audio Performance at 120 MPH!!!
- Waikiki Apple Store Grand Opening
- Katie Rose in Chiaroscuro
- Skin Your Synth (etc.)
- Canon 5D Mark II - The Coolest Video
- How Important is a Caption?
- LightZone 3.6 Basic Edition: A Photo Experimentation Tool
- Firefox 3.0.2 Fixes Three Mac Specific Issues

how to make friends and influence people
I agree fully that open/free is better for many reasons and remember that my initial mistaken reaction was to a brief intro. I just think you may be factually incorrect in your assesment of trm and your test seemed rather lax which undermines the parts of your argument I agree with.
That 94.4% figure you quote, for example, I could be wrong but I think higher is better, since the other 5.6% give false positives, i.e. identify more than one unique track.
The figure that perhaps you meant to quote is the 24.9% of tracks that have more than one trm identifier which means a newly generated trm may not match one held in the DB even if they are the exact same song.
But obviously some margin of error is necessary given different ripping techniques and encoding schemes combined wih plain user error and dirty CD's making the same song 'different', so whether this is particularly high or low I can't tell.
how to make friends and influence people
Well, whether a bleeped version is the same as the original is an application-level decision. Sometimes you want less exact matching. And that's pretty much my point -- applications are not able to make that decision for themselves because there's no open solution.
how to make friends and influence people
I was referring more to concentrating on 'false negatives' of files when you didn't *know* they where the same song (thinking in particular of bleeped pofanities on single versions of rap songs) and no mention of the balance between false-positives and false-negatives.
how to make friends and influence people
You should repeat my tests, then. This would be a good thing anyway, because we need to know more details about when TRM identifiers do work.
*However*, MusicBrainz' own database stats confirm my results. See http://www.musicbrainz.org/stats.html, and check out the number of TRM IDs which identify exactly one track -- 94.4%!
how to make friends and influence people
Sorry, my mistake, I didn't realise this was a summary as the link was to the domain rather than a permalink to an individual article.
The piece as a whole sounds much more reasonable (though I have doubts about your testing methodology, but I assume it is merely a rhetorical device).
read full entry at http://gonze.com
I'm not sure if you've read the whole (much longer) piece at http://gonze.com, because I do go into the kind of detail you're talking about. The format of these O'Reilly weblogs doesn't really emphasize that link -- I'll edit to do that.
The purpose is to start a conversation about open standards for audio fingerprints used as primary keys. It's extremely clueful for MusicBrainz to use audio fingerprints for primary keys, a genuine innovation even. But MB is a metadata project, not a fingerprinter, and they couldn't use an open fingerprint because one doesn't exist.
how to make friends and influence people
Are you trying to start a conversation or a fight?
I think you would achieve whatever goals you have if your criticism was a bit more constructive (or even informative).
Are there free alternatives?
You claim it is not a great tech, is it really better than nothing?
What do you want me, i.e. the general oreilly reading public, to do about it?