For Programmers: Free Programming Magazines  


Home > Archive > Tcl > August 2007 > Web Scraping Login Required









You are viewing an archived Text-only version of the thread. To view this thread in it's original format and/or if you want to reply to this thread please [click here]

 

Author Web Scraping Login Required
Mike Labedz

2007-08-30, 8:11 am

I'm trying to scrape my play list off of Yahoo Music and store it.

I've read http://wiki.tcl.tk/2915 and understand the retrieval/parsing
side of a web page. What I'm missing is whether this is possible for
a site that requires a login? If so does anyone have any examples or
recommended packages to look at.

Thank you in advance
schlenk@uni-oldenburg.de

2007-08-30, 8:11 am


Mike Labedz wrote:
> I'm trying to scrape my play list off of Yahoo Music and store it.
>
> I've read http://wiki.tcl.tk/2915 and understand the retrieval/parsing
> side of a web page. What I'm missing is whether this is possible for
> a site that requires a login? If so does anyone have any examples or
> recommended packages to look at.
>
> Thank you in advance

Usually this uses http authentication. http://wiki.tcl.tk/6087 is
relevant here, but you might need some cookie support too and if the
site uses vast amounts of javascript to handle login (ajax style) you
might be left with a mess like scripting IE or firefox to do the
download.

Michael

Cameron Laird

2007-08-30, 8:36 pm

In article <1188476317.240151.144430@d55g2000hsg.googlegroups.com>,
<schlenk@uni-oldenburg.de> wrote:
>
>Mike Labedz wrote:
>Usually this uses http authentication. http://wiki.tcl.tk/6087 is
>relevant here, but you might need some cookie support too and if the
>site uses vast amounts of javascript to handle login (ajax style) you
>might be left with a mess like scripting IE or firefox to do the
>download.

Sponsored Links







Also available: Server administration forum archive | Web Design forum archive | Software forum archive | Hardware reviews archive

Copyright 2008 codecomments.com