| Adamon 2006-01-27, 9:04 pm |
| Hello, I've decided to use PERL for this problem of mine. I'd like to know what specific concepts I should look up so I can quickly solve this. I'd like to make it clear that I'm not asking for anyone to provide me code, but to tell me what I should search for in the first place.
Task: Need to organize vast amount of data so I can efficiently use it.
Problem: The data is scattered across multiple discs, and my hard drive. It's in different formats, and there are many duplicate files, some with different names.
Solution:
-Gather all information on all the files(metadata) and store in plain text.
-Compare all the file metadata and weed out obvious duplicates. *
-Sort all original files by "Needed", and "Unneeded".
-Futher sort all "Needed" original files by type.
-Burn all "Needed" files to new DVDs and discard old DVDs.
*In the second step of my "Solution", I want to collect the file sizes down the byte. So that if two files are the same size AND have the same name, one gets deleted (doesn't matter which). And if two files have same name, but different size, rename one file with a random symbol (again, doesn't matter which. Or is there an even more precise way to generate some completely unique code from each file (without taking file name into account) to make sorting out duplicates even easier?
Once I'm done with this, I'll learn on my own how to add tags to files on my computer or DVD so that I can search for files using their metadata tags that are stored on the HD. Then I'll never have to guess which DVD my files are on! |