begood Posted April 6, 2010 Report Posted April 6, 2010 It generates domainname typo permutations then tests them to learn if they are in use, estimates their popularity and more.TYPES OF TYPOS SUPPORTEDCharacter Omission.These typos are created by leaving out a letter of the domain name, one letter at a time. For example, www.goole.com and www.gogle.comAdjacent Character Swap.These typos are created by swapping the order of adjacent letters in the domain name. For example, www.googel.com and www.ogogle.comAdjacent Character Replacement.These typos are created by replacing each letter of the domain name with letters to the immediate left and right on the keyboard. For example, www.googke.com and www.goohle.comAdjacent Character Insertion.These typos are created by inserting letters to the immediate left and right on the keyboard of each letter. For example, www.googhle.com and www.goopgle.comMissing Dot.These typos are created by omitting a dot from the domainname. For example, wwwgoogle.com and www.googlecomStrip Dashes.These typos are created by omitting a dash from the domainname. For example, www.domain-name.com becomes www.domainname.comSingular or Pluralise.These typos are created by making a singular domain plural and vice versa. For example, www.google.com becomes www.googles.com and www.trademe.co.nz becomes www.trademes.co.nzDOMAIN TESTSIs the domain valid?UrlCrazy has a database of valid top level and second level domains. This information has been compiled from wikipedia and domain registrars. We know whether a domain is valid by checking if it matches toplevel and second level domains. For example, www.trademe.co.bz is a valid domain in Belize which allows any second level domain registrations but www.trademe.xo.nz isn't because xo.nz isn't an allowed second level domain in New Zealand.Popularity EstimateWe can estimate the relative popularity of a typo by measuring how often that typo has been made on webpages. Querying cuil.com for the number of search results for a typo gives us a indication of how popular a typo is.The drawback of this approach is that you need to manually identify and omit legitimate domains such as googles.comFor example, consider the following typos for google.com. Quantity Typo25424 gogle.com24031 googel.com22490 gooogle.com19172 googles.com19148 goole.com18855 googl.com17842 ggoogle.com16490 googe.com16367 googgle.com15029 google.cm14773 gogole.com13227 googlle.com11646 googlee.com11345 googlr.com7417 foogle.com6132 hoogle.com5313 googlw.com5208 giogle.com5151 googke.com4838 goigle.com4662 ogogle.com4630 gopgle.com4415 goofle.com4118 wwwgoogle.com3894 goohle.com3399 gooigle.com2675 gfoogle.com1942 googlecom.com1534 gopogle.com1356 googfle.com1089 googhle.com892 googlew.com747 googlke.com618 goiogle.com614 goopgle.com413 ghoogle.com341 goolge.com232 googler.com228 gpogle.comhttp://code.google.com/p/urlcrazy/ Quote