43 Folders

Back to Work

Merlin’s weekly podcast with Dan Benjamin. We talk about creativity, independence, and making things you love.

Join us via RSS, iTunes, or at 5by5.tv.

”What’s 43 Folders?”
43Folders.com is Merlin Mann’s website about finding the time and attention to do your best creative work.

Digital Filing System

Hello, I have a question for the hive mind, and this seems like as good of a place as any.

I have a massive number of PDF files (I'm a student/academic type) and no really good organizational strategy for them. I've used DevonThink Pro for a while, and while it's good, I have some issues with it:

- I have about a gig and a half of PDF files, and no really good enduring organizational structure for the files.
- I've found that keeping it running all the time, as I'm wont to do, often is a strain on my somewhat aging laptop. (I have the penultimate g4 15 powerbook model)
- I'm going to have to start from scratch, basically if I want to continue to use it (a long painful story...

So I'm looking for a new idea of how to keep these things organized. I've thought about solutions like Yep!, and Eagle Filer, and Yojimbo, and even staying with DevonThink, but, going through all these files is going to be a pain in the ass, and I want something will be enduring, and scaleable. I have a gig and a half of files after an undergrad degree, and I'm going to start a doctoral program in the next couple of years, so this collection of files is only going to get bigger, so I want to devise a system that can grow with me. This probably means doing something open source/kludgey, but I'd like to consider this more fully before I'm all rash about it.

So, I'm interested in thinking about how to organize all these files, and here's my idea:

Automatically rename all the files, with sequentially numbered file names so "0000001.pdf," "0000002.pdf" etc. and then start a database with the file name, the author of the pdf, the title, journal title the originating project (so like the class name, or paper that I the file entered the system for; this is often a useful data point for me when I'm hunting for something I don't know the name of.)

Then I could dump all the files into a single database (for easy backup) and spotlight could do some indexing, and it might be pretty sweet. Also, theoretically this system could incorporate a book collection as well.

The pros:
- short manageable and consistent file names
- system could scale.
- no eccentric house of cards
- makes me more platform/computer independent

cons:
- arbitrary file names
- lots of coding work

I guess this means my questions are:

- Are there systems out there that do something like this that I just don't know about. I've been hacking around with PHP and mySQL (like most self respecting b2 users from way back when) for years now, so I'm kind of comfortable with that, but I'm not a really good developer type, so if someone else has done it, I'd be more than happy to use some other software.
- I haven't done any real coding of consequence (even the above hacking about) in at least four years, so are there folks out there that have a better clue about how this might work, I'd love to hear it.
- What kind of additional information do you think would be absolutely crucial to have in the database without going over board. I don't want there to be a lot of overhead on the data entry end of things, and I don't need to recreate Jstor or another index, this is about managing a personal collection, not indexing.
- How are you all staying on top of reference material?

Cheers,
tycho

TOPICS: Projects
Todd V's picture

re: Managing Digital Reference Materials

For the academic side of things I would recommend Bookends X. This is the most mac-savvy database program for managing your database of books and articles. And if most of your PDF's are articles, etc. you can add these as attachments with the attach file button on each entry. That's the way I would go with your academic stuff.

http://www.sonnysoftware.com/

For general digital reference I think you're just going to have to accept that you're going to need to do more data entry on the front end, but not as much as you think. The Finder-like programs like DevonThink, Yojimbo, Journler, etc. provide for ways of linking data that is already in the Finder. Why not just keep things organized in the Finder better? Some batch renaming application or applescript would probably suffice for this.

I designed the Ready-Set-Do! GTD implementation for the mac, so I use that for keeping all of my digital stuff organized. I also happen to be an academic myself currently finishing up my dissertation. Good luck with your further doctoral studies.

 
EXPLORE 43Folders THE GOOD STUFF

Popular
Today

Popular
Classics

An Oblique Strategy:
Honor thy error as a hidden intention


STAY IN THE LOOP:

Subscribe with Google Reader

Subscribe on Netvibes

Add to Technorati Favorites

Subscribe on Pageflakes

Add RSS feed

The Podcast Feed

Cranking

Merlin used to crank. He’s not cranking any more.

This is an essay about family, priorities, and Shakey’s Pizza, and it’s probably the best thing he’s written. »

Scared Shitless

Merlin’s scared. You’re scared. Everybody is scared.

This is the video of Merlin’s keynote at Webstock 2011. The one where he cried. You should watch it. »