(For Rick Mendes at Intuit)
I am indexing data with the crawler for one of my projects. One of the sets of data comes from a desktop application that generates a sitemap for their help content. The sitemap produces URL they term relative, but I call them pseudo relative because they don't include the leading slash. Here is a snippet of the sitemap file: <url><loc>text/en/quicktax/worksheets/qt_ws_medical.html</loc></url> When I crawl the site hosting this sitemap, the crawler does not index any of the documents. The documents are on the site. Is there any way to configure it so it will not ignore these URLs.