I have a fairly large music collection. It is well organized and properly named and tagged because my Plex server requires it and all in either FLAC or MP3. I recently inherited an additional music collection from a deceased friend. It is NOT well organized and includes OGG, M4A, WMA, WAV formats. There are duplicates files and albums, misfiled albums, incorrectly tags, damaged files, misspellings, bad folder names, etc. Before I can even consider adding to my collection, I need to clean out any duplicates - both within this collection and then by comparing to my own, re-catalog anything I'm interested in, check/correct the tags, and so on. This is going to require many separate and lengthy tasks. There isn't a single tool that I've found that will accomplish more than one task. I did find a replacement for the older FSLINT program that is current and a flatpack: Czkawka
Here are/were the tasks ahead of me:
CONVERT FILE TYPES:
I used FIND and FFMPEG to batch convert all the files to either MP3 or FLAC and then deleted the sources. That was pretty easy except I could never figure out how to correctly get the file extensions right. I.e. converting "somefile.wma" resulted in "somefile.wma.flac". I finally gave up and left it that way. Here's the command I used:
I haven't tried to fix this yet.
To delete them after conversion, I used "find" to:
TAG FILES CORRECTLY:
I use "MusicBrainz Picard" and it works very well until it encounters a mistagged group of files. Then I have to manually search through the MiuscBrainz database and try and find a match. This is very time consuming. Thankfully, 60-70% are close enough the Picard identifies them. Unfortunately, the search function on the MusicBrainz websight is not very helpful. For example, the smallest difference in the album name will fail the search: searching "Face Value" finds the correct album, but searching "Face Value [Deluxe Version] fails completely. One small plus here is Picard will rename the files when cataloged so the ".wav.flac" extenions are corrected during the tagging process.
FIND DUPLICATE FILENAMES / ALBUMS:
It helps having only two file types (FLAC and MP3), but there still are duplicates files that are of different file types. I need to figure out how to compare file names without looking at extensions, then determine which albums I should keep or delete. I'm haven't solved this one quite yet but I just discovered "Czkawka" (formally fslint) has this ability by matching on only name and no other criteria.
The hard part is having to decide on solid enough criteria that I can automate a task. For example, I prefer flac over mp3 if I have the same song, so deleting an mp3 instead of flac would be necessary.
Ultimately, I would prefer:
I have some work ahead of me!
Here are/were the tasks ahead of me:
CONVERT FILE TYPES:
I used FIND and FFMPEG to batch convert all the files to either MP3 or FLAC and then deleted the sources. That was pretty easy except I could never figure out how to correctly get the file extensions right. I.e. converting "somefile.wma" resulted in "somefile.wma.flac". I finally gave up and left it that way. Here's the command I used:
Code:
find . -type f -iname '*.m4a' -exec ffmpeg -i {} {}.flac \;
To delete them after conversion, I used "find" to:
- list all the extensions
- count and make a note the number of each type
- run the ffmpeg conversion
- verify the conversion succeeded by doing the math
- i.e. n(flac)+n(wav) before conversion should equal n(flac) after the wav conversion
- using the "-delete' option of find to delete the converted files
TAG FILES CORRECTLY:
I use "MusicBrainz Picard" and it works very well until it encounters a mistagged group of files. Then I have to manually search through the MiuscBrainz database and try and find a match. This is very time consuming. Thankfully, 60-70% are close enough the Picard identifies them. Unfortunately, the search function on the MusicBrainz websight is not very helpful. For example, the smallest difference in the album name will fail the search: searching "Face Value" finds the correct album, but searching "Face Value [Deluxe Version] fails completely. One small plus here is Picard will rename the files when cataloged so the ".wav.flac" extenions are corrected during the tagging process.
FIND DUPLICATE FILENAMES / ALBUMS:
It helps having only two file types (FLAC and MP3), but there still are duplicates files that are of different file types. I need to figure out how to compare file names without looking at extensions, then determine which albums I should keep or delete. I'm haven't solved this one quite yet but I just discovered "Czkawka" (formally fslint) has this ability by matching on only name and no other criteria.
The hard part is having to decide on solid enough criteria that I can automate a task. For example, I prefer flac over mp3 if I have the same song, so deleting an mp3 instead of flac would be necessary.
Ultimately, I would prefer:
- FLAC over MP3
- My current collection over the new one UNLESS the new one holds FLAC and my current is MP3.
- Pass over any files tagged so poorly they don't register on Musicbrainz for later review.
- Naming folders as the Artist is named: Joe Brown not Brown, Joe and The Cult not Cult, The or just Cult.
- Fixing folder names
- Listing album names AND file types of both music sets
- Comparing albums an eliminating duplicates
- Merging desired additions
I have some work ahead of me!
Comment