Author Posts

December 16, 2013 at 8:41 am

I'm trying to scrape a page. It seems the page doesn't work well unless one uses cookies, so I decided to try to use the IE COM object.

Things seem to be fine, but then I try to use a regex to pull out information, and it always seems to be blank. I've also tried [string] instead of tostring().

Anything obvious that I would be missing when trying to use the regex below?

$ie=new-object -com internetexplorer.application
$ie.visible=$true
$ie.navigate("http://forums.udacity.com/tags/ud617/#ud617")
[regex]::matches('$ie.document.body.innerhtml.tostring()','http://forums.udacity.com/users/\d+/(\w+)')

December 16, 2013 at 9:04 am

Solved: Don't use quotes in the first element of the matches method.

[regex]::matches($ie.document.body.innerhtml.tostring(),'http://forums.udacity.com/users/\d+/(\w+)')