Feeds:
RSS
Atom

Yesterday I checked one of my sites in the Google webmaster tools. It showed me warnings for many pages of the site beyond the home page. It looked like Google found pages two times: once with a normal URL (ending with a slash) and another without ending slash. It treated pages as separate and it thinks that I have double content on the site. Pretty bad as it lowers page rank.

Interestingly all links on the site end with slash. There are no links without ending slash at all. However if such link is requested, RealURL will recognize it and return proper page (duplicate for Google). This is caused by the use of "appendMissingSlash" RealURL configuration option. So if someone links to the page but omits slash in the end, he lowers your page rank. Not truly your (or my fault) but it happens. I have to repeat: this is not a fault of you, me or RealURL. It is created by a wrong linking from external sites.

I thought about a cure and implemented a new option in RealURL. Now "appendMissingSlash" accepts also "redirect" keyword. It means that RealURL will redirect to the proper address (with ending slash). It is possible also to specify the code for redirection ("redirect[301]") but RealURL uses 301 by default.

This version is not yet in TER. I am testing it on one site along with a bugfix #9412 for absRefPrefix. I just thought I announce it a bit earlier. For now you can check your sites in Google webmaster tools.

P.S. No, I cannot send you T3X with this version. But you can always checkout latest development version from TYPO3 SVN if you wish. But I am not responsible for the risk you take by doing it.

Like it? Then bookmark it! digg.comdel.icio.usgoogle.comMyLink.deYahooMyWebTechnoratiFurllive.comnetscapeTagThatWebnews

13 Comments

  1. on Wednesday, 24-09-08 09:27 Thomas
    Hello Dimitry,
    would the redirect be possible for this case?

    www.domain.com/mysite - redir -> www.domain.com/mysite.html

    and

    www.domain.com/mysite/ - redir -> www.domain.com/mysite.html

    This config also causes dup. content.

    Thanks.
    Regards, Thomas

  2. on Wednesday, 24-09-08 09:28 Thomas
    Hello Dimitry,
    would the redirect be possible for this case?

    www.domain.com/mysite - redir -> www.domain.com/mysite.html

    and

    www.domain.com/mysite/ - redir -> www.domain.com/mysite.html

    This config also causes dup. content.

    Thanks.
    Regards, Thomas
  3. on Tuesday, 30-09-08 09:50 Thomas
    Sorry for duplicate content. ;) This was purposeless, but there was no feedback after submitting the form.
  4. on Sunday, 12-10-08 21:58 Dmitry Dulepov
    I did not consider .html in this case... If this does not work, please, write a bug report!
  5. on Sunday, 12-10-08 22:10 Dmitry Dulepov
    The problem with "no feedback" is due to ve_guestbook, it does create page URLs properly :( I fixed it locally.
  6. on Wednesday, 15-10-08 18:28 Steffen Müller
    I remember we had some triple content issues with pages using 'defaultToHTMLsuffixOnPrev' => '.html' for filename:

    /pagename.html
    /pagename/
    /pagename

    all of the above deliver the same content.

    I will test this tomorrow and open a bug if not yet done.
  7. on Thursday, 16-10-08 21:05 Steffen Müller
    Here comes the bugreport:
    http://bugs.typo3.org/view.php?id=9582
  8. on Monday, 20-10-08 09:45 Michael Stucki
    Hi Dmitry,

    I'm just wondering: Why should one not want to have the (obviously) duplicate URLs being redirected?
    In other words: Please remove the new option again and make it the default behaviour if "appendMissingSlash" is set.

    - michael
  9. on Monday, 20-10-08 10:31 Dmitry Dulepov
    If option is there, it stays. Otherwise compatibility will be broken. I will think about defaults, this idea is good.
  10. on Monday, 20-10-08 13:11 Michael Stucki
    Consider it a "buggy feature". What do users want? Do they want two versions of the same page, or do they want that the page also works without the trailing slash?

    I think the latter is the case. Therefore nothing gets broken. You still solve the same problem, but with a different (a better) approach.

    - michael
  11. on Tuesday, 21-10-08 11:20 Andreas Becker
    I don't see any problems there.
    If you want to get the site with and without a slash, just forward one of them with a 301-redirect. With this, google won't prosecute you, cause you send the right header information.

    kind regards,
    andy
  12. on Tuesday, 06-01-09 15:51 Andy Meier
    Setting 'appendMissingSlash' => 'redirect' breaks my normal redirect configuration!

    $TYPO3_CONF_VARS['EXTCONF']['realurl']['www.domain.tld']['redirects'] = array('mypage.htm'=>'/de/todo/');

    www.domain.tld/mypage.htm will become www.domain.tld/mypage.htm/

    andy
  13. on Tuesday, 06-01-09 16:00 花蓮民宿
    花蓮入口網|花蓮|花蓮民宿|花蓮美食|花蓮消費券|花蓮旅遊|花蓮|花蓮電影|花蓮海洋公園|花蓮遠來飯店|花蓮提拉米蘇|花蓮客運|蜂王乳|花蓮太魯[..] href="http://www.tel038.com.tw/h_news_table.php?icn=6">花蓮廣告|花蓮地圖|花蓮旅遊|花蓮民宿|花蓮房屋|花蓮租車旅遊-TTA租車公司|花蓮租車|花蓮汽車|花蓮租車旅遊網|花蓮餐廳<[..] href="http://inhl.intaichung.com.tw/html/front/bin/ptdetail.phtml?Part=restaurant0048&Rcg=103381">[..] href="http://www.kuoham.com/">花蓮火腿|花[..] href="http://drliao.intaichung.com.tw">花蓮名產|花蓮3600|花蓮租屋

Leave a Reply