Unilingua

Tex Texin tex at xencraft.com
Tue Sep 20 08:50:05 CEST 2005


Web pages have many many clues to their language that do not come from the
contents of the individual page.
tex

Harald Tveit Alvestrand wrote:
> 
> --On lørdag, september 17, 2005 10:43:10 +0100 Debbie Garside
> <debbie at ictmarketing.co.uk> wrote:
> 
> > Tex wrote:
> >
> >> If and when someone gives me a way to review a document and determine the
> >> proper language tag, and we all agree on the right tag, and it doesn't
> >> require three linguists to do the determination, I'll believe we have a
> >> system worth all these refinements.
> >
> > The software is currently under development; a tool that can determine the
> > language used within a document.
> 
> Google's been using such a tool for years; it does not expose its tags, but
> allows you to search for them (for a limited list of languages).
> 
> Works well most of the time.

-- 
-------------------------------------------------------------
Tex Texin   cell: +1 781 789 1898   mailto:Tex at XenCraft.com
Xen Master                          http://www.i18nGuy.com
                         
XenCraft		            http://www.XenCraft.com
Making e-Business Work Around the World
-------------------------------------------------------------


More information about the Ietf-languages mailing list