How do I research obscure file types? [closed]

0 votes
asked Dec 6, 2010 by clweeks

A client has a large document-management system -- millions of TIFFs and PDFs and a fewer of other random files; images and other binaries. I'm converting formats, imprinting notes, reorganizing and redacting sensitive information when found. And that's all great for the vast bulk of the files.

But I occasionally find a new format and have to figure out what it is and how to handle it within the project's parameters. Usually this isn't too hard and when it has been, it's such a small handful that it doesn't matter too much if I just can't handle it. But right now, I have a larger handful of files that don't appear to have a sophisticated header but all start with "COM1.0" (43 4F 4D 31 2E 30).

So, I'd like help on two levels. What's a good way for me to research this (and others I might find in the future -- teach a man to fish, and all); when just Googling around fails me? And if you know what the file type is, I'd be keen to hear about it.

3 Answers

0 votes
answered Dec 6, 2010 by khachik
  1. Google
  2. If google fails, it may be something specific for your customer.
0 votes
answered Dec 6, 2010 by steve314

One specialist site is http://www.wotsit.org/ - there may be a few others. These give details when you can already identify the file format, though.

There are some more tips at http://www.garykessler.net/library/file_sigs.html

I did try doing a little searching and didn't turn up anything much, but I didn't try very hard.

0 votes
answered Dec 6, 2010 by edwin-buck

Good luck, but remember that not every file format is documented outside of the company that created it; and, few companies publish their file formats before they go under.

Depending on how old these files are, the odds of hitting a brick wall are high unless you have a few extra hints to work with (like the name of the program the files are associated with).

Welcome to Q&A, where you can ask questions and receive answers from other members of the community.
Website Online Counter

...