





Code: Select all
'catlinks' class
"visualClear"
"mw-head" class
"p-personal" class
"pt-login"
"left-navigation"
"p-namespaces" class
"ca-special" class
"p-variants" class
"#"
"menu"
"right-navigation"
"p-views" class
"p-cactions" class
"#"
"menu"
"p-search"
"searchInput"
"/w/index.php" id
'hidden' name
"simpleSearch"
"searchInput" name
"searchButton" type
"mw-panel" class
"p-logo"
"portal" id
"body"
"n-mainpage-description"
"n-aboutsite"
"n-topics"
"n-alphindex"
"n-randompage"
"portal" id
"body"
"n-help"
"n-portal"
"n-recentchanges"
"n-contact"
"n-sitesupport"
"portal" id
"body"
"t-specialpages"
"footer"
"footer-places"
"footer-places-privacy"
"footer-places-about"
"footer-places-disclaimer"
"footer-places-mobileview"
"footer-icons" class
"footer-copyrightico"
"http://wikimediafoundation.org/"
"footer-poweredbyico"
"http://www.mediawiki.org/"
"clear:both"
"text/javascript"
"/w/index.php?title
"http://bits.wikimedia.org/de.wikipedia.org/load.php?debug
"text/javascript"
"text/javascript" src
"text/javascript"
Code: Select all
|grep "a href"|
Code: Select all
curl "http://de.wikipedia.org/w/index.php?title=Spezial:Alle_Seiten" | grep 'href' | awk -F= '{ print $2 }' | awk -f\> '{ print $1 }'
awk: can't open file { print $1 }
source line number 1 source file { print $1 }
context is
>>> <<<
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 54049 100 54049 0 0 61831 0 --:--:-- --:--:-- --:--:-- 67900
Code: Select all
elinks -dump -no-numbering "http://de.wikipedia.org/w/index.php?title=Spezial:Alle_Seiten" | grep '^[ \t.]*http' | sed 's/^[ .\t]*//'Code: Select all
lynx -dump "$url" | grep -e " .[0-9+]\.\ "| cut -d "." -f 2-
Nabend.72_6f_6c_61_6e_64 wrote: Was muss man sed als Parameter anhängen, dass er alles von "<a href" bis ">" (also der a-tag zu) ausgibt?
Code: Select all
curl -s "$URL" | sed -ne 's/.*href="\([^["]*\)".*/\1/pg'Code: Select all
curl -s "$URL" | sed -ne 's/.*<a href="\(http:\/\/[^["]*\)".*/\1/pg' 
Code: Select all
curl -s "http://de.wikipedia.org/w/index.php?title=Spezial:Alle_Seiten" | sed -e 's/href/\nhref/g' | sed -ne 's/.*href="\([^["]*\)".*/\1/pg'