Skip to:
Content

bbPress.org

Opened 13 years ago

Closed 13 years ago

#1396 closed defect (bug) (wontfix)

Duplicate content and canonical url

Reported by: dudd's profile dudd Owned by:
Milestone: 2.0 Priority: high
Severity: major Version:
Component: General - Administration Keywords:
Cc: nightgunner5@…

Description

We have duplicate content with « topic » and « reply » pages : two url with the same content.

http://localhost/wp/forums/topic/my-topic
http://localhost/wp/forums/reply/reply-to-my-topic/

The canonical url must be : http://localhost/wp/forums/topic/my-topic
to include on the head of the « reply » pages.

But If we see the canonical tag on reply page we have :
<link rel='canonical' href='http://localhost/wp/forums/reply/reply-to-my-topic/' />

This is wrong, should be :
<link rel='canonical' href='http://localhost/wp/forums/topic/my-topic/' />

On google we can read :
« A canonical page is the preferred version of a set of pages with highly similar content. »

http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=139394

Change History (2)

#1 @paulgregory
13 years ago

(With apologies for sticking my oar in without actually checking what the code does at the present moment...)

I dispute that a Reply page is "highly similar" to a Topic page, although obviously it does include some duplicate content.

I believe the Reply page has the correct canonical URL, it's just not one that we want to be indexed. The canonical change suggested stops it being indexed separately; I believe it should not be indexed AT ALL.

I therefore strongly suggest that bbPress instead simply include the relevant meta for a no-index on all Reply pages. A benefit of this approach is that there is no need to look up the URL of the Topic page so it's probably easier and quicker.

#2 @Nightgunner5
13 years ago

  • Cc nightgunner5@… added
  • Resolution set to wontfix
  • Status changed from new to closed

The page for the reply contains only the reply itself. The topic page contains every reply in the topic as well as the reply form. I don't think this counts as duplicate content, as it could be compared to the front page of a blog containing multiple posts. Is each post on a blog duplicate content?

Note: See TracTickets for help on using tickets.