Jump to content

Wikipedia talk:Lists of common misspellings/Repetitions

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

Source

[edit]

I would suggest that the program that scanned for these probably included some non-main space pages. Rich Farmbrough 22:23 26 April 2006 (UTC).

Duplicates

[edit]

Having done rather alot to hunt down duplicate words and phrases in Wikipedia, I have to conclude that the sections Most frequent duplicates and Most frequent duplicate word pairs are not at all helpful. This seems to be because the statistics were generateed in the first place by ignoring punctuation leading to a very high proportion of false positive and, they are in any case out-of-date.

I propose that these sections be deleted. Gaius Cornelius 19:37, 19 November 2006 (UTC)[reply]

Even without ignoring punctuation, half the things on that list are clearly intentional duplications. The Beatles song Please Please Me is obviously responsible for the #1 hit on the list. Duran Duran is certainly not an error, or Walla Walla. And I'm assuming that Talk pages were included in this search, which would explain "blah", "giggidy", "yada", and many others. I'd say repeat the search, but don't ignore punctuation, and do ignore Talk pages, where informal language (including intentional duplication) is to be expected. I'm guessing that the punctuation parameter will siginificantly reduce occurances of yo-yo and AT-AT, at the very least. Lurlock 14:46, 11 April 2007 (UTC)[reply]
I've been experimenting with changing these plain text listings into searchable links to match the alphabetic pages. While they do result in a large number of false positives, the first few that I've done also turn up a lot of actual errors. They also seem to ignore Talk pages. The longer the word in question, the better the results (e.g., "about about" turns up more real errors than "a a"). I'll start editing the page and people can try them out. Feel free to ignore or revert if you don't agree. Also, someone who has more experience with the search parameters can try to force the links to ignore/include punctuation as appropriate. JimVC3 (talk) 18:51, 30 October 2008 (UTC)[reply]
Control F and going through results by 500's is your friend. Just put a space before the two words and after them, keep pressing next, and it makes it a lot easier. :3 Glacialfox (talk) 20:38, 26 November 2011 (UTC)[reply]

Most frequent duplicate word pairs: A game

[edit]

Random comment: It's fun trying to realise the reason for the duplication (other than the obvious vandalism and plain ol' mistakes), try it. For example: gorilla is most likely because the subspecies of (I believe? May be a different one) Mountain Gorilla's scientific name is gorilla gorilla gorilla. Hurray hurray for taxonomy. KimiNewt (talk) 21:55, 28 June 2009 (UTC)[reply]