For Programmers: Free Programming Magazines  


Home > Archive > AWK > April 2006 > How to extract hrefs and their locations from html









You are viewing an archived Text-only version of the thread. To view this thread in it's original format and/or if you want to reply to this thread please [click here]

 

Author How to extract hrefs and their locations from html
runnig

2006-04-05, 7:56 am

Could you help me please?
I want to extract all the html references from an ordinary html file
using awk and put them into another file

Juergen Kahrs

2006-04-05, 7:56 am

runnig wrote:

> I want to extract all the html references from an ordinary html file
> using awk and put them into another file


If you use XHTML, then all the XML tools can be used.
You were asking for an awk solution. Use Google to
search for some scripts that can handel tagged data.
If this isnt good enough, have a look at this one:

http://sourceforge.net/projects/xmlgawk/
http://home.vrweb.de/~juergen.kahrs/gawk/XML/
runnig

2006-04-05, 7:56 am

thanks, xmlgawk looks pretty good

Sponsored Links







Also available: Server administration forum archive | Web Design forum archive | Software forum archive | Hardware reviews archive

Copyright 2008 codecomments.com