Question:
I would really appreciate if anyone could answer some of my
questions..
There's a search engine for jobs at www.indeed.com (shurely you've
seen it already) that captured my interest. I was wondering, does that
search engine really crawl all the data from various job posting
sites, or do those sites syndicate the data in some way (RSS or
something similar). My question might be slightly obscure since I am
not a programmer, but I would really like to know what's happening
behind that kind of a search engine...
Answer:
-Best place to ask is here : http://www. indeed.com/jsp/contactus.jsp
-I think it would be a crawl of the sites. With some hard coded parsing,
tailored to each site, in order to extract the right criteria from each
page.
It would probably take a lot of maintenance to keep up with small changes in
layout on the target sites.
-If it works similarly to http://www.alljobsuk.com/ then it will be an
FTP feed