You are viewing a plain text version of this content. The canonical link for it is here.
Posted to taglibs-user@tomcat.apache.org by "Chen, Gin" <Gi...@tvratings.com> on 2002/06/13 22:49:24 UTC

Proxy Scrape

Hi,
	I'm having a problem with scrapes and proxies.

	I'm using Tomcat 4.0.1 with the newest Scrape and ORO taglib.

	My code is (ofcourse sensitive info changed):


<%@ taglib uri="http://jakarta.apache.org/taglibs/scrape-1.0" prefix="scrp"
%>
<html>
<head>
	<title>Untitled</title>
<scrp:page url="http://fifaworldcup.yahoo.com/en/t/s/g2.html" time="20"
    proxyServer="www-proxy.nielsenmedia.com"
    proxyPort="80" proxyName="chengt" proxyPass="tenletters">
   
   <scrp:scrape id="qt" begin="<style type=text/css>" end="</style>"
anchors="true"/>
   
   <scrp:scrape id="qt1" begin="<table width=\"577\" border=\"0\"
cellspacing=\"0\" cellpadding=\"0\">"
    end="<td colspan=\"3\"><img
src=\"http://us.i1.yimg.com/us.yimg.com/i/fifa/gen/o.gif\" width=\"1\"
height=\"5\"></td>
</tr>
</table>" anchors="true"/>
</scrp:page>

<scrp:result scrape="qt"/>
</head>

<body>
<img src="http://us.i1.yimg.com/us.yimg.com/i/fifa/gen/ban/tr1.gif">
<scrp:result scrape="qt1"/>

</body>
</html>

But I am not getting anything for the second scrape. What am I doing wrong?
Also, I'm having a problem that if I reload the page after making a change,
it gives me a message saying qt or qt1 is has already been used.
How do I get rid of that?
Lastly, how do I search for html that is on multiple lines like that second
scrap?
I cant find in the docs where they talk about that case.

Thanks,
-Tim

--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>