I have a script (v11.1) that trolls through a Websphere-developed site
and extracts data from a HTML table. The script has been running fine
for about a year but in the last 24 hours has started failing with
this error:
Access Violation at xxx in module ntdll.dll. Read of address
yyy.
I have also tried it on a different machine (both Windows XP SP3) with the
latest version v11.1.22 and found the same behaviour. I'm guessing
something has changed in the HTML input that is causing the script to barf.
Using the debugger, the error seems to occur when using IE_ExtractTag. I have wrapped this function in a subroutine as follows, where
Get_Cell_Text_Var_1 is the cell number I wish to capture:
Code: Select all
SRT>Get_Cell_Text
IE_ExtractTag>%IE[0]%,,TD,Get_Cell_Text_Var_1,0,cell_text,r
END>Get_Cell_Text
below. The first and second TD text values are captured correctly.
The attempt to capture the third cell actually returns the value from
the second cell. The call to get the fourth cell results in the
access violation.
Interestingly, I've also looked at the HTML table with webrecorder and12/03/10 09:00:00
131091
Flyer - 1Pp Or 2Pp
CLK3 offset colour between 1,000 and 10,000
the tag extraction tool, and they seem to identify the TD values
unambiguously.
Any ideas?
Thanks.