Indexing PrintTopic


https://forum.reallusion.com/Topic366181.aspx
Print Topic | Close Window

By R Ham - 7 Years Ago
Admin,
You may not be aware that Google is indexing (as an example)
https://forum.reallusion.com/PrintTopic349797.aspx

I think you would prefer that Google indexed
https://forum.reallusion.com/Topic349797.aspx

Perhaps you wish to tell Google to stop that. :)
Or not.

By Kelleytoons - 7 Years Ago
Interesting you mention this -- it has pissed me off more than once that a Google search invariably brings up the print topic.
By rollasoc - 7 Years Ago
Annoys me too...
By R Ham - 7 Years Ago
rollasoc (4/22/2018)
Annoys me too...

It can be easily fixed.
By toystorylab - 7 Years Ago
I also had this several times...
By R Ham - 7 Years Ago
toystorylab (4/24/2018)
I also had this several times...

The webmaster controls which files may be indexed by the bot and which may not. If this weren't true, all your vulnerable configuration files would be indexed on Google. It would be a security disaster. :Whistling:
By R Ham - 7 Years Ago
I see there's been no improvement in this condition. I have to say, this reflects poorly on the webmaster.
By 4u2ges - 7 Years Ago
Rottenham (5/6/2018)
I see there's been no improvement in this condition. I have to say, this reflects poorly on the webmaster.


I would be surprised if there is any webmaster around. Forum has number of problem pointed out hundreds of times. I am not going to repeat as it is really pointless.
By animagic - 7 Years Ago
A search reveals many PrintTopic entries on Google, not just from the RL forum. So it seems to be a "feature" of Google's indexing bot.
By 4u2ges - 7 Years Ago
Printing and/or caching pages is the feature of many forums. At the very top of every thread, here at this forum, if you click "options", it would drop-down 3 links and "Print this Topic" is one of them. 
That is what getting crawled and indexed by bots. Everything started with "Print" should be excluded from indexing. 
On the other hand, users information cannot be reached by any bot, since it is stored in database and only retrieved, when client starts a session by supplying ID and password.
By R Ham - 7 Years Ago
animagic (5/7/2018)
A search reveals many PrintTopic entries on Google, not just from the RL forum. So it seems to be a "feature" of Google's indexing bot.


Blocking robot access to certain directories and file types is an industry standard practice. The primary means for this is the robots.txt file. Compliance with the standard is the norm, although it is voluntary.  Robots.txt is an ASCII text file located in the website root.  It can be lengthy, but it is typically short and simple. Here is a brief blurb on this.  You don't need to be an NSA hacker to view or edit this file. There is nothing mysterious about it.

This is the robots.txt file for this forum. Notice that only the Alexabot robot is disallowed from indexing PrintTopic. All other robots are free to do as they wish. Notice also that no asterisks are used in the Alexabot PrintTopic exclusion. It might be more effective if it was written as described in the above Moz blurb.
:doze:
By Peter (RL) - 7 Years Ago
Rottenham (4/22/2018)
Admin,
You may not be aware that Google is indexing (as an example)
https://forum.reallusion.com/PrintTopic349797.aspx

I think you would prefer that Google indexed
https://forum.reallusion.com/Topic349797.aspx

Perhaps you wish to tell Google to stop that. :)
Or not.



Thanks for the feedback. This has been passed on to our web team.