Friday, January 13, 2012

Getting the HTML source code of an .swf URI


I was wondering whether one could get the HTML source code from an .swf URI?



For instance a web page such as:



http://media.flixfacts.com/360view/acer_uk/002/acer_uk-002-en.swf



When I use curl to scrape this page it brings back the swf source not the html source.



Any ideas?

2 comments:

  1. I'm not really sure what's confusing/unexpected - you're not scraping an HTML file, you're scraping a SWF file (which is a binary format, so you're likely getting a bunch of gibberish).

    Bottom Line: There is no HTML in a SWF file.

    ReplyDelete
  2. SWF is the extension of Flash files. They don't have HTML. They're an utterly different proprietary format. The only thing that will display Flash files is Flash Player, or some programs can extract the resources (images, sounds, and so on) inside them.

    ReplyDelete