ASIWG feedback on IDNA200X for Arabic Script

Eric Brunner-Williams ebw at abenaki.wabanaki.net
Sat Jul 12 19:30:53 CEST 2008


Agree. If HIGH HAMZA (0674) is combining in Kazakh ("forms digraphs", a 
textual claim, author unknown) and is a character in Jawi (informed 
user, national standard, etc), that is, is both A and not A, which is 
the controlling property?

Eric

Alireza Saleh wrote:
> I'm also agree with Sarmad, According to the ASIWG meetings we tried to 
> discover the correct context of each characters within the Arabic 
> Script. As we should keep the exception list very small, it would be 
> better to re consider the characters which miss classified and find out 
> a rule in order to get more accurate classification. Patrik, would it be 
> possible for you to summarize the classification procedure you run on 
> the Unicode table to classify the characters ?
>
> Best Regards,
> Alireza
>
>
> Sarmad Hussain wrote:
>   
>> It could be a combination of both.  
>>
>> However, we will need to look at the Unicode character properties to be able
>> to respond.  For example, if Quranic characters are distinctly labelled as
>> such then they can be separated out through the algorithm, but if they are
>> just have the property of combining marks then through the exception list,
>> of course.
>>
>> Let ASIWG look at this and get back to you.  
>>
>> Best regards,
>> Sarmad
>>
>>   
>>     
>>> -----Original Message-----
>>> From: Patrik Fältström [mailto:patrik at frobbit.se]
>>> Sent: Saturday, July 12, 2008 12:21 PM
>>> To: Sarmad Hussain
>>> Cc: idna-update at alvestrand.no; 'ASIWG Coordination'
>>> Subject: Re: ASIWG feedback on IDNA200X for Arabic Script
>>>
>>> On 1 jul 2008, at 18.23, Sarmad Hussain wrote:
>>>
>>>     
>>>       
>>>> Please find a report on Arabic Script IDN Working Group (ASIWG)
>>>> feedback on
>>>> IDNA200X for Arabic Script block at http://crulp.org/asiwg.pdf.
>>>>       
>>>>         
>>> Thanks for the input.
>>>
>>> As the editor of the tables document, I must though ask you what
>>> changes you propose to the document?
>>>
>>> - Adding of all of those codepoints to the exception list?
>>> - Changes to some of the algorithms based on Unicode Properties?
>>> - Both (in some combination)?
>>>
>>>     Patrik
>>>     
>>>       
>> _______________________________________________
>> Idna-update mailing list
>> Idna-update at alvestrand.no
>> http://www.alvestrand.no/mailman/listinfo/idna-update
>>   
>>     
>
> _______________________________________________
> Idna-update mailing list
> Idna-update at alvestrand.no
> http://www.alvestrand.no/mailman/listinfo/idna-update
>
>
>   



More information about the Idna-update mailing list