| originals@gmail.com 2006-02-01, 6:56 pm |
| I need to split an archive of a discussion forum saved as one huge txt
file (400k words) into individual txt files--one per message.
Posts are stamped with a date and time, messages can be of any length.
Posters are sometimes address by their time (as it was an anon forum)
but the full time/date stamp is always unique to the start of a
message.
New to perl but have installed activeperl and can run a .pl script from
the command line.
If anyone could provide a script for this job, I'd really appreciate
it.
Someone has very kindly provided a script for me on c.l.p.m. but it
doesn't seem to be working. If anyone could fix it for me that would be
great--I've tried but can't work it out. eg why the tr operator to
replace a pace with a "_"?
Google Groups url of clpm thread: http://tinyurl.com/c8j8c
cheers.
05.11.01 10:01 AM
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
05.11.01 10:41 AM
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
05.12.01 10:50 PM
10:01, xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxx
|