seo, marketing, advertising, adsense

Google Now Crawling Forms and More

Google recently announced that they can now crawl data submitted via forms.

The other part that I found interesting is that they apparently can “scan” javascript and Flash to find links. Previously, spiders were not able to crawl javascript, but this “scanning” now allows for data retrieval. That’s big news! All those links that I thought were useless because they were displayed via javascript may count for something now!

Google is constantly trying new ideas to improve our coverage of the web. We already do some pretty smart things like scanning JavaScript and Flash to discover links to new web pages, and today, we would like to talk about another new technology we’ve started experimenting with recently.

In the past few months we have been exploring some HTML forms to try to discover new web pages and URLs that we otherwise couldn’t find and index for users who search on Google. Specifically, when we encounter a

element on a high-quality site, we might choose to do a small number of queries using the form. For text boxes, our computers automatically choose words from the site that has the form; for select menus, check boxes, and radio buttons on the form, we choose from among the values of the HTML. Having chosen the values for each input, we generate and then try to crawl URLs that correspond to a possible query a user may have made. If we ascertain that the web page resulting from our query is valid, interesting, and includes content not in our index, we may include it in our index much as we would include any other web page.

  • Share/Bookmark

Discussion Area - Leave a Comment