HTML Logo by World Wide Web Consortium (www.w3.org). Click to learn more about our commitment to accessibility and standards.

Moving forward with Composr

ocPortal has been relaunched as Composr CMS, which is now in beta. ocPortal 9 will be superseded by Composr 10.

Head over to compo.sr for our new site, and to our migration roadmap. Existing ocPortal member accounts have been mirrored.


Difference between the number of URLs submitted in XML sitemap and the number indexed by Google

Login / Search

 [ Join | More ]
 Add topic 
Posted
Rating:
#75901 (In Topic #15687)
Avatar

Community saint

This thread is just meant to see if there is any kind of pattern between the number of URLs submitted by ocp_sitemap.xml and the number included in Google's index. I have been going through my sitemap and eliminating some URLs (some were bugs, most were permissions not being set properly). In the process of whittling down the number of URLs submitted, I've noticed that the number indexed by Google is always about 50 fewer than what is submitted in the XML sitemap. Has anyone else noticed a similar correlation. Do other people have roughly 50 fewer URLs included in Google's index than what they submitted (based on the information Google's Webmaster Tools for sitemaps?

Google doesn't provide any tools for identifying which items are indexed and which are not unfortunately but I wonder if this discrepancy is more or less consistent for others using the ocP-generated sitemap.

Thanks for any input.

Bob

Back to the top
 
Posted
Rating:
#75917
Avatar

I just tested/confirmed that you can test to see if a URL is indexed via Googling "site:<url>".


Become a fan of ocPortal on Facebook or add me as a friend. Add me on on Twitter.
Was I helpful?
  • If not, please let us know how we can do better (please try and propose any bigger ideas in such a way that they are fundable and scalable).
  • If so, please let others know about ocPortal whenever you see the opportunity.
  • If my reply is too Vulcan or expressed too much in business-strategy terms, and not particularly personal, I apologise. As a company & project maintainer, time is very limited to me, so usually when I write a reply I try and make it generic advice to all readers. I'm also naturally a joined-up thinker, so I always express my thoughts in combined business and technical terms. I recognise not everyone likes that, don't let my Vulcan-thinking stop you enjoying ocPortal on fun personal projects.
  • If my response can inspire a community tutorial, that's a great way of giving back to the project as a user.
Back to the top
 
Posted
Rating:
#75920
Avatar

Community saint

Yeah, I did this last night in regards to my other post. But it's quite pain to go through the complete XML sitemap to check.

I was just wondering it other people observe the same pattern in which case it makes i likely the the ocP-generated sitemap includes URLs useless to Google.

Bob
Back to the top
 
Posted
Rating:
#75921
Avatar

Google make it clear they don't index all URLs, bt yes it would be interesting to know which ones it didn't.


Become a fan of ocPortal on Facebook or add me as a friend. Add me on on Twitter.
Was I helpful?
  • If not, please let us know how we can do better (please try and propose any bigger ideas in such a way that they are fundable and scalable).
  • If so, please let others know about ocPortal whenever you see the opportunity.
  • If my reply is too Vulcan or expressed too much in business-strategy terms, and not particularly personal, I apologise. As a company & project maintainer, time is very limited to me, so usually when I write a reply I try and make it generic advice to all readers. I'm also naturally a joined-up thinker, so I always express my thoughts in combined business and technical terms. I recognise not everyone likes that, don't let my Vulcan-thinking stop you enjoying ocPortal on fun personal projects.
  • If my response can inspire a community tutorial, that's a great way of giving back to the project as a user.
Back to the top
 
Posted
Rating:
#75924
Avatar

Community saint

Yes, I know that they don't index all URLs from a site but in my previous experiences with some other software, they did index all URLs submitted in the XML sitemap. This issue may well be limited to my site but it would be interesting to see if others observe the same behavior. then it will just take a bit of work to determine with URLs are not included and why.

Bob
Back to the top
 
Posted
Rating:
#76147
Avatar

Community saint

I was looking through the sitemap tonight and checking on Google and see the number of non-indexed sitemap-submitted URLs is down to 40.

 I wonder why they don't make it easy to see which of your sitemap's URLs is indexed/not-indexed in their Webmaster tools? That would be so handy to know if there is a problem with a URL in the sitemap.

Off to the Google forums I head.  :)

Bob
Back to the top
 
1 guests and 0 members have just viewed this: None
Control functions:

Quick reply   Contract

Your name:
Your message: