You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oro-user@jakarta.apache.org by Yun Fang <su...@hotmail.com> on 2001/09/29 08:44:48 UTC

How to write regular express to remove the html tag content?

Hi,All

This is my first to post mail.I try remove the content in tag"<head>" and 
"</head>"  of the following html document.

=======================================================================

<html><head><meta http-equiv="Content-Type" content="text/html; 
charset=gb2312"><title>¿ª½±ÐÅÏ¢</title><STYLE type=text/css>A:link {COLOR: 
#4f8dec; TEXT-DECORATION: none}A:visited {COLOR: #4f8dec; TEXT-DECORATION: 
none}A:active {COLOR: #4f8dec; TEXT-DECORATION: none}A:hover {COLOR: 
#ff0000; TEXT-DECORATION: none}.font {FONT-SIZE: 10pt; LINE-HEIGHT: 10pt}A 
{TEXT-DECORATION: none; TEXT-TRANSFORM: none}A:hover {TEXT-DECORATION: 
nones}BODY {FONT-FAMILY: ËÎÌå; FONT-SIZE: 10pt}TD {FONT-FAMILY: ËÎÌå; 
FONT-SIZE: 10pt}.c2 {FONT-FAMILY: ËÎÌå; FONT-SIZE: 
10.5pt}</STYLE></head><body bgcolor="#F5F5F5" text="#000000" 
background="../imgs/bkg.gif" align=right><p 
align='center'><b>2001Äê09ÔÂ27ÈÕ£¨ÐÇÆÚËÄ£© 
µçÄÔÌåÓý²ÊƱ¿ª½±Çé¿ö</b></p>£ £ ½ñÌìÈ«¹ú¹²ÓÐ 13 
¸öÊ¡ÊеĵçÄÔÌåÓý²ÊƱ¿ª½±£¬ÏúÊÛ¶î×Ü¼Æ 2416.1282 ÍòÔª£¬²úÉú´ó½± 4 
×¢£¬ËÄ»¨Ñ¡4²úÉúÒ»µÈ½± 1 ×¢£¬¿ª½±Ê¡ÊÐΪ£º Ìì½òÊС¢ ½­ËÕÊ¡¡¢ Õã½­Ê¡¡¢ °²»ÕÊ¡¡¢ 
¸£½¨Ê¡¡¢ ½­Î÷Ê¡¡¢ ɽ¶«Ê¡¡¢ ºþ±±Ê¡¡¢ ¹ã¶«Ê¡¡¢ º£ÄÏÊ¡¡¢ ±±¾©ÊУ¨ËÄ»¨Ñ¡4£©¡¢ 
Ìì½òÊУ¨ËÄ»¨Ñ¡4£©¡¢ ºþÄÏÊ¡£¨ËÄ»¨Ñ¡4£©¡¢ ËÄ´¨Ê¡£¨ËÄ»¨Ñ¡4£© 
£¬¾ßÌ忪½±Çé¿öÈçÏ£º<hr><p align='right'><br>µ¥Î»£¨Ôª£©<br><table border='2' 
cellspacing='1' cellpadding='1'  align='center'><tr bgcolor='#F7CD77' 
align='center'><td rowspan='2'>Ê¡ÊÐ</td><td rowspan='2'>Íæ·¨</td><td 
rowspan='2'>ÆÚºÅ</td><td rowspan='2'>ÏúÊÛ¶î</td><td 
rowspan='2'>Öн±ºÅÂë</td><td colspan='2'>ÌصȽ±</td><td 
colspan='2'>Ò»µÈ½±</td><td colspan='2'>¶þµÈ½±</td><td 
colspan='2'>ÈýµÈ½±</td><td colspan='2'>ËĵȽ±</td><td 
colspan='2'>ÎåµÈ½±</td><td colspan='2'>ÁùµÈ½±</td><td  
rowspan='2'>ÏÂÆÚ´ó½±Ô¤²â</td><tr bgcolor='#F7CD77' 
align='center'><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td></tr><tr 
 ><td align='center'>Ìì½òÊÐ</td><td align='center'>29Ñ¡7</td><td 
align='center'>01028     </td><td align='right'>265058</td><td 
align='center'>01&nbsp;03&nbsp;04&nbsp;07&nbsp;18&nbsp;21&nbsp;25+02</td><td 
align='right'>0</td><td align='right'>0</td><td align='right'>0</td><td 
align='right'>0</td><td align='right'>10</td><td align='right'>1566</td><td 
align='right'>21</td><td align='right'>200</td><td align='right'>399</td><td 
align='right'>50</td><td align='right'>375</td><td align='right'>10</td><td 
align='right'>4198</td><td align='right'>5</td><td 
align='right'>380000</td></tr><tr bgcolor='#F7CD77'><td 
align='center'>½­ËÕÊ¡</td><td align='center'>29Ñ¡7</td><td 
align='center'>01309     </td><td align='right'>3560000</td><td 
align='center'>05&nbsp;08&nbsp;11&nbsp;13&nbsp;14&nbsp;15&nbsp;24+25</td><td 
align='right'>3</td><td align='right'>666666</td><td 
align='right'>42</td><td align='right'>9200</td><td 
align='right'>273</td><td align='right'>1000</td><td 
align='right'>1526</td><td align='right'>300</td><td 
align='right'>7122</td><td align='right'>50</td><td 
align='right'>17476</td><td align='right'>20</td><td 
align='right'>147951</td><td align='right'>5</td><td 
align='right'>2000000</td></tr><tr ><td align='center'>Õã½­Ê¡</td><td 
align='center'>29Ñ¡7</td><td align='center'>01245     </td><td 
align='right'>1431150</td><td 
align='center'>03&nbsp;09&nbsp;17&nbsp;20&nbsp;23&nbsp;25&nbsp;26+12</td><td 
align='right'>0</td><td align='right'>0</td><td align='right'>3</td><td 
align='right'>12105</td><td align='right'>86</td><td 
align='right'>844</td><td align='right'>240</td><td 
align='right'>200</td><td align='right'>2312</td><td 
align='right'>50</td><td align='right'>4220</td><td align='right'>10</td><td 
align='right'>23596</td><td align='right'>5</td><td 
align='right'>600000</td></tr><tr bgcolor='#F7CD77'><td 
align='center'>°²»ÕÊ¡</td><td align='center'>30Ñ¡7</td><td 
align='center'>01304     </td><td align='right'>1269550</td><td 
align='center'>01&nbsp;04&nbsp;10&nbsp;21&nbsp;24&nbsp;26&nbsp;29+12</td><td 
align='right'>0</td><td align='right'>0</td><td align='right'>0</td><td 
align='right'>0</td><td align='right'>20</td><td align='right'>3029</td><td 
align='right'>86</td><td align='right'>300</td><td align='right'>865</td><td 
align='right'>20</td><td align='right'>1857</td><td align='right'>10</td><td 
align='right'>28756</td><td align='right'>5</td><td 
align='right'>1750000</td></tr><tr ><td align='center'>¸£½¨Ê¡</td><td 
align='center'>36Ñ¡7</td><td align='center'>01260     </td><td 
align='right'>6844492</td><td 
align='center'>01&nbsp;08&nbsp;09&nbsp;12&nbsp;17&nbsp;26&nbsp;27+21</td><td 
align='right'>1</td><td align='right'>4773733</td><td 
align='right'>2</td><td align='right'>151885</td><td 
align='right'>125</td><td align='right'>2126</td><td 
align='right'>309</td><td align='right'>500</td><td 
align='right'>5561</td><td align='right'>50</td><td 
align='right'>6761</td><td align='right'>20</td><td 
align='right'>125095</td><td align='right'>6</td><td 
align='right'>3600000</td></tr><tr bgcolor='#F7CD77'><td 
align='center'>½­Î÷Ê¡</td><td align='center'>33Ñ¡7</td><td 
align='center'>01206     </td><td align='right'>958692</td><td 
align='center'>02&nbsp;04&nbsp;06&nbsp;20&nbsp;23&nbsp;32&nbsp;24</td><td 
align='right'>0</td><td align='right'>0</td><td align='right'>2</td><td 
align='right'>25122</td><td align='right'>14</td><td 
align='right'>500</td><td align='right'>129</td><td align='right'>50</td><td 
align='right'>544</td><td align='right'>30</td><td 
align='right'>8287</td><td align='right'>5</td><td 
align='right'>27000</td><td align='right'>2</td><td 
align='right'>2800000</td></tr><tr ><td align='center'>ºþ±±Ê¡</td><td 
align='center'>29Ñ¡7</td><td align='center'>01219     </td><td 
align='right'>1924424</td><td 
align='center'>01&nbsp;08&nbsp;15&nbsp;22&nbsp;26&nbsp;28&nbsp;29+20</td><td 
align='right'>0</td><td align='right'>0</td><td align='right'>1</td><td 
align='right'>43680</td><td align='right'>83</td><td 
align='right'>1052</td><td align='right'>171</td><td 
align='right'>200</td><td align='right'>2215</td><td 
align='right'>50</td><td align='right'>4366</td><td align='right'>10</td><td 
align='right'>59662</td><td align='right'>5</td><td 
align='right'>3850000</td></tr><tr bgcolor='#F7CD77'><td 
align='center'>¹ã¶«Ê¡</td><td align='center'>36Ñ¡7</td><td 
align='center'>01089     </td><td align='right'>4977144</td><td 
align='center'>03&nbsp;19&nbsp;21&nbsp;27&nbsp;31&nbsp;32&nbsp;35+24</td><td 
align='right'>0</td><td align='right'>0</td><td align='right'>1</td><td 
align='right'>105052</td><td align='right'>82</td><td 
align='right'>1601</td><td align='right'>167</td><td 
align='right'>600</td><td align='right'>3547</td><td 
align='right'>40</td><td align='right'>3941</td><td align='right'>20</td><td 
align='right'>80474</td><td align='right'>10</td><td 
align='right'>&nbsp;</td></tr><tr ><td align='center'>º£ÄÏÊ¡</td><td 
align='center'>29Ñ¡7</td><td align='center'>01229     </td><td 
align='right'>45220</td><td 
align='center'>08&nbsp;09&nbsp;13&nbsp;15&nbsp;21&nbsp;27&nbsp;28</td><td 
align='right'>£ </td><td align='right'>£ </td><td align='right'>0</td><td 
align='right'>0</td><td align='right'>1</td><td align='right'>2802</td><td 
align='right'>49</td><td align='right'>60</td><td align='right'>792</td><td 
align='right'>6</td><td align='right'>£ </td><td align='right'>£ </td><td 
align='right'>£ </td><td align='right'>£ </td><td 
align='right'>500000</td></tr></table><p 
align='right'><br>µ¥Î»£¨Ôª£©<br><table border='2' cellspacing='1' 
cellpadding='1'  align='center'><tr bgcolor='#F7CD77' align='center'><td 
rowspan='2'>Ê¡ÊÐ</td><td rowspan='2'>Íæ·¨</td><td rowspan='2'>ÆÚºÅ</td><td 
rowspan='2'>ÏúÊÛ¶î</td><td rowspan='2'>Öн±ºÅÂë</td><td 
colspan='2'>Ò»µÈ½±</td><td colspan='2'>¶þµÈ½±</td><td 
colspan='2'>ÈýµÈ½±</td><td colspan='2'>ËĵȽ±</td><td 
colspan='2'>ÎåµÈ½±</td><td colspan='2'>ÁùµÈ½±</td><td 
colspan='2'>ÆߵȽ±</td><td  rowspan='2'>ÏÂÆÚ´ó½±Ô¤²â</td><tr 
bgcolor='#F7CD77' 
align='center'><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td></tr><tr 
 ><td align='center'>ɽ¶«Ê¡</td><td align='center'>35Ñ¡7</td><td 
align='center'>01218     </td><td align='right'>1470872</td><td 
align='center'>04&nbsp;06&nbsp;12&nbsp;17&nbsp;21&nbsp;29&nbsp;34+08</td><td 
align='right'>0</td><td align='right'>0</td><td align='right'>2</td><td 
align='right'>26517</td><td align='right'>24</td><td 
align='right'>3314</td><td align='right'>49</td><td 
align='right'>300</td><td align='right'>643</td><td align='right'>50</td><td 
align='right'>1276</td><td align='right'>20</td><td 
align='right'>20658</td><td align='right'>5</td><td 
align='right'>5000000</td></tr></table><p 
align='right'><br>µ¥Î»£¨Ôª£©<br><table border='2' cellspacing='1' 
cellpadding='1'  align='center'><tr bgcolor='#F7CD77' align='center'><td 
rowspan='2'>Ê¡ÊÐ</td><td rowspan='2'>Íæ·¨</td><td rowspan='2'>ÆÚºÅ</td><td 
rowspan='2'>ÏúÊÛ¶î</td><td rowspan='2'>Öн±ºÅÂë</td><td 
colspan='2'>ÐÒÔ˽±</td><td colspan='2'>Ò»µÈ½±</td><td 
colspan='2'>¶þµÈ½±</td><td colspan='2'>ÈýµÈ½±</td><td  
rowspan='2'>ÏÂÆÚ´ó½±Ô¤²â</td><tr bgcolor='#F7CD77' 
align='center'><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td></tr><tr 
 ><td align='center'>±±¾©ÊÐ</td><td align='center'>ËÄ»¨Ñ¡4</td><td 
align='center'>01051     </td><td align='right'>503100</td><td 
align='center'>2 &nbsp;A &nbsp;9 &nbsp;Q </td><td align='right'>0</td><td 
align='right'>0</td><td align='right'>3</td><td align='right'>33389</td><td 
align='right'>321</td><td align='right'>100</td><td 
align='right'>6394</td><td align='right'>10</td><td 
align='right'>&nbsp;</td></tr><tr bgcolor='#F7CD77'><td 
align='center'>ºþÄÏÊ¡</td><td align='center'>ËÄ»¨Ñ¡4</td><td 
align='center'>01408     </td><td align='right'>85468</td><td 
align='center'>4 &nbsp;Q &nbsp;6 &nbsp;J </td><td align='right'>0</td><td 
align='right'>0</td><td align='right'>1</td><td align='right'>28679</td><td 
align='right'>60</td><td align='right'>100</td><td 
align='right'>1098</td><td align='right'>5</td><td 
align='right'>&nbsp;</td></tr><tr ><td align='center'>ËÄ´¨Ê¡</td><td 
align='center'>ËÄ»¨Ñ¡4</td><td align='center'>01056     </td><td 
align='right'>607816</td><td align='center'>8 &nbsp;J &nbsp;4 &nbsp;9 
</td><td align='right'>£ </td><td align='right'>£ </td><td 
align='right'>9</td><td align='right'>16281</td><td 
align='right'>424</td><td align='right'>100</td><td 
align='right'>8407</td><td align='right'>5</td><td 
align='right'>290000</td></tr></table><p 
align='right'><br>µ¥Î»£¨Ôª£©<br><table border='2' cellspacing='1' 
cellpadding='1'  align='center'><tr bgcolor='#F7CD77' align='center'><td 
rowspan='2'>Ê¡ÊÐ</td><td rowspan='2'>Íæ·¨</td><td rowspan='2'>ÆÚºÅ</td><td 
rowspan='2'>ÏúÊÛ¶î</td><td rowspan='2'>Öн±ºÅÂë</td><td 
colspan='2'>Ò»µÈ½±</td><td colspan='2'>¶þµÈ½±</td><td 
colspan='2'>ÈýµÈ½±</td><td  rowspan='2'>ÏÂÆÚ´ó½±Ô¤²â</td><tr 
bgcolor='#F7CD77' 
align='center'><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td><td>×¢Êý</td><td>ÿע½±½ð</td></tr><tr 
 ><td align='center'>Ìì½òÊÐ</td><td align='center'>ËÄ»¨Ñ¡4</td><td 
align='center'>01041     </td><td align='right'>218296</td><td 
align='center'>4 &nbsp;6 &nbsp;3 &nbsp;4 </td><td align='right'>1</td><td 
align='right'>47652</td><td align='right'>215</td><td 
align='right'>100</td><td align='right'>3563</td><td 
align='right'>10</td><td 
align='right'>&nbsp;</td></tr></table></body></html>

===========================================================

I select the awk compiler of demo.html , and write the pattern 
"<head>\w+</head>"
but nothing result return!! What's error happen?

Please help.

Best regard!



_________________________________________________________________
Get your FREE download of MSN Explorer at http://explorer.msn.com/intl.asp