For Programmers: Free Programming Magazines  


Home > Archive > Compression > May 2004 > URLs to "standard" test files ?









You are viewing an archived Text-only version of the thread. To view this thread in it's original format and/or if you want to reply to this thread please [click here]

 

Author URLs to "standard" test files ?
Walter Dnes (delete the 'z' to get my real address

2004-05-12, 9:28 pm

I've got a couple of compression-optimization ideas I want to test
out. Are there any standard test files ? I've been following the group
for a few days, and I've seen occasional allusions to such files, but
not explicit URLs. I'm looking for the following...

- raw image files of 1, 2, and 3 bytes (i.e. 8/16/24 bits) per pixel
- similar to the above, except files with indexes mapping the colours,
in which case I'll also need to know what the index/map structure is
- English text files
- and just for laughs, the infamous "million-digit-file"

If you don't hear back from me, you can assume that my ideas came to
naught.

--
Walter Dnes; my email address is *ALMOST* like wzaltdnes@waltdnes.org
Delete the "z" to get my real address. If that gets blocked, follow
the instructions at the end of the 550 message.
Davis King

2004-05-12, 9:28 pm

Try here.
http://corpus.canterbury.ac.nz/

Walter Dnes (delete the 'z' to get my real address) wrote:
> I've got a couple of compression-optimization ideas I want to test
> out. Are there any standard test files ? I've been following the group
> for a few days, and I've seen occasional allusions to such files, but
> not explicit URLs. I'm looking for the following...
>
> - raw image files of 1, 2, and 3 bytes (i.e. 8/16/24 bits) per pixel
> - similar to the above, except files with indexes mapping the colours,
> in which case I'll also need to know what the index/map structure is
> - English text files
> - and just for laughs, the infamous "million-digit-file"
>
> If you don't hear back from me, you can assume that my ideas came to
> naught.
>


Matt Mahoney

2004-05-12, 9:28 pm


"Davis King" <kingd@cis.ohio-state.edu> wrote in message
news:c6r14i$fn1$1@news.cis.ohio-state.edu...
> Try here.
> http://corpus.canterbury.ac.nz/


Not many people use the Canterbury corpus for benchmarks. The Calgary
corpus, though older, is still used. I have links to several benchmarks on
my page at http://cs.fit.edu/~mmahoney/compression/ (scroll down about 1/3).
Not all of these are actively maintained. ACT hasn't been updated since
2002, and Canterbury since 2000. These benchmarks are for general purpose
lossless compression, not video or audio (where lossy compression is usually
used).

The million digits file and more benchmarks can be found at
datacompression.info

-- Matt Mahoney


Eduardo

2004-05-12, 9:28 pm

"Walter Dnes (delete the 'z' to get my real address)" <wzaltdnes@waltdnes.org> wrote in message news:<c6qno2$eu5sl$3@ID-146822.news.uni-berlin.de>...
> I've got a couple of compression-optimization ideas I want to test
> out. Are there any standard test files ? I've been following the group
> for a few days, and I've seen occasional allusions to such files, but
> not explicit URLs. I'm looking for the following...
>
> - raw image files of 1, 2, and 3 bytes (i.e. 8/16/24 bits) per pixel
> - similar to the above, except files with indexes mapping the colours,
> in which case I'll also need to know what the index/map structure is
> - English text files
> - and just for laughs, the infamous "million-digit-file"
>
> If you don't hear back from me, you can assume that my ideas came to
> naught.


for image and video try http://thanglong.ece.jhu.edu/~cjtu/link.html
also at gutemberg proyect you have english text files

hth
Sponsored Links







Also available: Server administration forum archive | Web Design forum archive | Software forum archive | Hardware reviews archive

Copyright 2008 codecomments.com