For Programmers: Free Programming Magazines  


Home > Archive > PERL Beginners > February 2006 > Splitting msg board archive into individual files









You are viewing an archived Text-only version of the thread. To view this thread in it's original format and/or if you want to reply to this thread please [click here]

 

Author Splitting msg board archive into individual files
originals@gmail.com

2006-02-01, 6:56 pm

I need to split an archive of a discussion forum saved as one huge txt
file (400k words) into individual txt files--one per message.

Posts are stamped with a date and time, messages can be of any length.
Posters are sometimes address by their time (as it was an anon forum)
but the full time/date stamp is always unique to the start of a
message.

New to perl but have installed activeperl and can run a .pl script from
the command line.

If anyone could provide a script for this job, I'd really appreciate
it.

Someone has very kindly provided a script for me on c.l.p.m. but it
doesn't seem to be working. If anyone could fix it for me that would be
great--I've tried but can't work it out. eg why the tr operator to
replace a pace with a "_"?

Google Groups url of clpm thread: http://tinyurl.com/c8j8c

cheers.

05.11.01 10:01 AM

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx

05.11.01 10:41 AM

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx

05.12.01 10:50 PM

10:01, xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx

usenet@DavidFilmer.com

2006-02-01, 6:56 pm

originals@gmail.com wrote:

> Someone has very kindly provided a script for me on c.l.p.m.


I may have answered your question there in a recent reply

> Google Groups url of clpm thread: http://tinyurl.com/c8j8c


--
http://DavidFilmer.com

Sponsored Links







Also available: Server administration forum archive | Web Design forum archive | Software forum archive | Hardware reviews archive

Copyright 2008 codecomments.com