Why not use the same decode way for the LocalPart as the way for the Domain Name in the IDN

Discussion:

dongxiaoli

2004-04-15 06:23:28 UTC

Hello!

Why not we use the same decode way for the LocalPart as the way for the Domain Name in the IDN.
For example in this way my name 董晓荔 will became xn--1jvp95dgxa,and now I have got some Free mail used this name. One of these is xn--***@263.net.
Then I can use my Email address in Chinese Laguage: 董晓荔@263.net
Thanks..

With best regards, Xiaoli Dong. E-mail: ***@cnnic.cn

Paul Hoffman / IMC

2004-04-15 14:27:59 UTC

Permalink

Post by dongxiaoli
Why not we use the same decode way for the LocalPart as the way
for the Domain Name in the IDN.

Because you can't *encode* many legitimate email addresses using
IDNA. IDNA prohibits all ASCII characters other than LDH, so mailbox
names like "josé.muñoz" could not be encoded. That is why we rejected
the idea as non-workable from the very beginning.

--Paul Hoffman, Director
--Internet Mail Consortium

Adam M. Costello

2004-04-18 03:43:39 UTC

Permalink

Post by dongxiaoli
Why not we use the same decode way for the LocalPart as the way for
the Domain Name in the IDN. For example in this way my name ??????
will became xn--1jvp95dgxa, and now I have got some Free mail used
Because you can't *encode* many legitimate email addresses using IDNA.
IDNA prohibits all ASCII characters other than LDH, so mailbox names
like "josé.muñoz" could not be encoded.

That's true if UseSTD3ASCIIRules is set, but if we wanted to reuse
IDNA for local parts, we could specify that UseSTD3ASCIIRules is
never set for local parts, in which case ToASCII("josé.muñoz") =
"xn--jos.muoz-d1a7f".

I haven't thought about the various issues for a while, but at the
moment the only thing I can think of that prevents us from reusing IDNA
ToASCII for local parts is the problem with structured local parts. For
example, both humans and computers commonly add and remove prefixes
and suffixes to local parts, like owner- and +tag. The result will
be different depending on whether you are working with the ASCII form
(which is likely if "you" are legacy software) or the Unicode form
(which is likely if "you" are a human).

Xiaoli, have you seen the IMAA draft (draft-hofmann-imaa-03.txt)? It
addresses this problem by applying Punycode (and half of Nameprep)
independently to segments of the local part. It's a different ToASCII
operation, but it reuses the same Punycode and Nameprep algorithms that
IDNA uses.

AMC

dongxiaoli

2004-04-21 01:33:07 UTC

Permalink

Hello,Hoffman ,Adam!

OK,IC, thanks a lot.

With best regards, Xiaoli Dong. E-mail: ***@cnnic.cn

dongxiaoli

2004-06-04 09:15:50 UTC

Permalink

Hello everyone.

Have anybody can tell me WHY we need do "*Divide the sequence
into segments*" in the ToASCII process as described in the
IMAA draft (draft-hofmann-imaa-03.txt)??

And another question: "The "*protected code points" are 0..40,
5B..60, 7B..7F (in other words, those corresponding to ASCII characters
other than letters and digits). *" is from draft-hofmann-imaa-03.txt.
But the code points of digits range between 0...40,so why said the
*digits is not protected code points*?

thanks..

Xiaoli Dong

CNNIC

John C Klensin

2004-06-04 13:01:23 UTC

Permalink

--On Friday, 04 June, 2004 17:15 +0800 dongxiaoli

Post by dongxiaoli
Hello everyone.
Have anybody can tell me WHY we need do "*Divide the
sequence into segments*" in the ToASCII process as described
in the
IMAA draft (draft-hofmann-imaa-03.txt)??

In many circumstances, there is a good deal of information that
is already encoded into email local-parts. The most common of
these, these days, are subaddresses and internal routing codes,
but external routing and address component identification, such
as those used for X.400 addresses and fax-like attribute
identification when those are mapped to the Internet, have been
common (and probably still are) in some communities.

In many of those cases, the presence of the encoding and its
components must be recognized by the delivery MTA or, at the
boundary of an outright violation of the standards (however
necessary) by organizational-boundary relays such as
SMTP-handling firewalls. If the clues that the codings are
present are hidden by the i18n encoding, we gain
internationalization at the cost of the features supported by
that information-encoding.

We have no standard for any of those information encodings --
they have, so far, been protected by an extremely strong rule
that no MTA before the final delivery one is permitted to
interpret the contents of a local-part in any way. All
previous systems and MTAs, including the originating user and
MUA, are required to treat the entire local-part address an an
atomic and opaque string.

The segments and delimiters of draft-hoffman-imaa-03 are
intended to alleviate the problem with hiding these encodings
while providing internationalization for the various address
components.

This issue was the original philosophical departure point
between draft-hoffman-imaa and draft-klensin-emailaddr-i18n.
The latter takes a somewhat stronger and more protective view of
the local-part addresses, noting particularly that the approach
in draft-hoffman-imaa will not protect either information in
local-parts that is length-encoded (e.g., "the address contains
a 'project distribution' component beyond the mailbox name for
the recipient; the last three characters are the project
identifier, the other characters are the mailbox name") or
information that is separated by non-ASCII delimiters. For
example, were you to want to use subaddresses with a mailbox
whose name was in Chinese, the rules of draft-hoffman-imaa would
require that you separate the mailbox and subaddress elements
with an ASCII delimiter, not one that might appear more
appropriate for Chinese, if the classification and action were
to be done by an MTA that did not, itself, run ToUnicode.
There is some additional discussion on this issue in
draft-klensin-emailaddr-i18n-02.txt.

Post by dongxiaoli
And another question: "The "*protected code points"
are 0..40, 5B..60, 7B..7F (in other words, those corresponding
to ASCII characters other than letters and digits). *" is from
draft-hofmann-imaa-03.txt. But the code points of digits
range between 0...40,so why said the *digits is not protected
code points*?

Paul or Adam will need to answer this one.

best,
john

J-F C. (Jefsey) Morfin

2004-06-07 13:16:48 UTC

Permalink

Dear John,
since you were one of the most reasonable actors of the WG-IDN and that the
LHS will face the same real world oppositions as the RHS, I would be
interested in your comments about the current MINC positions over the
Arabic/Poland situation. This is a direct consequence of the concept of
"internationalization" I fought as inapropriate (IMO) to the real world and
of the demands from the ICANN governance.

NB. If you do not have it, I have a copy of the letter as a Member of the
MINC, but I do not know if it is online. I can copy it to those interested
but I suppose netiquette opposes that I put it myself on line?

A part from the impossible situation where it puts ICANN which has to chose
between a non existing Arabic moral authority and now Europe, through a
choice involving three parties described as at war (real life, real world)
and no one understand right now the political impact, my concern is that
RHS "only" involves 260 TLDs while LHS involves millions of domain holders.

1. you have not come with a satisfactory way to MLize RHS against my own
patch to intlnzation. I see now that the MINC letter opposes your global
understanding of the problem - please read carefully the part about ".xxx".

2. this confirms my opinon that an internationlization of the LHS cannot
work and that multinationlization must use another upper layer - so not to
conflict with the current situation but to complete it.

Don't you think that time has come now to start a clean sheat study of the
real life e-intergovernance and to deduct from it what the constraints on
the datacoms architecture?

jfc

Post by John C Klensin
--On Friday, 04 June, 2004 17:15 +0800 dongxiaoli

Post by dongxiaoli
Hello everyone.
Have anybody can tell me WHY we need do "*Divide the
sequence into segments*" in the ToASCII process as described
in the
IMAA draft (draft-hofmann-imaa-03.txt)??

In many circumstances, there is a good deal of information that
is already encoded into email local-parts. The most common of
these, these days, are subaddresses and internal routing codes,
but external routing and address component identification, such
as those used for X.400 addresses and fax-like attribute
identification when those are mapped to the Internet, have been
common (and probably still are) in some communities.
In many of those cases, the presence of the encoding and its
components must be recognized by the delivery MTA or, at the
boundary of an outright violation of the standards (however
necessary) by organizational-boundary relays such as
SMTP-handling firewalls. If the clues that the codings are
present are hidden by the i18n encoding, we gain
internationalization at the cost of the features supported by
that information-encoding.
We have no standard for any of those information encodings --
they have, so far, been protected by an extremely strong rule
that no MTA before the final delivery one is permitted to
interpret the contents of a local-part in any way. All
previous systems and MTAs, including the originating user and
MUA, are required to treat the entire local-part address an an
atomic and opaque string.
The segments and delimiters of draft-hoffman-imaa-03 are
intended to alleviate the problem with hiding these encodings
while providing internationalization for the various address
components.
This issue was the original philosophical departure point
between draft-hoffman-imaa and draft-klensin-emailaddr-i18n.
The latter takes a somewhat stronger and more protective view of
the local-part addresses, noting particularly that the approach
in draft-hoffman-imaa will not protect either information in
local-parts that is length-encoded (e.g., "the address contains
a 'project distribution' component beyond the mailbox name for
the recipient; the last three characters are the project
identifier, the other characters are the mailbox name") or
information that is separated by non-ASCII delimiters. For
example, were you to want to use subaddresses with a mailbox
whose name was in Chinese, the rules of draft-hoffman-imaa would
require that you separate the mailbox and subaddress elements
with an ASCII delimiter, not one that might appear more
appropriate for Chinese, if the classification and action were
to be done by an MTA that did not, itself, run ToUnicode.
There is some additional discussion on this issue in
draft-klensin-emailaddr-i18n-02.txt.

Paul or Adam will need to answer this one.
best,
john

John C Klensin

2004-06-07 14:51:08 UTC

Permalink

--On Monday, 07 June, 2004 15:16 +0200 "J-F C. (Jefsey) Morfin"

Post by J-F C. (Jefsey) Morfin
Dear John,
since you were one of the most reasonable actors of the WG-IDN
and that the LHS will face the same real world oppositions as
the RHS, I would be interested in your comments about the
current MINC positions over the Arabic/Poland situation. This
is a direct consequence of the concept of
"internationalization" I fought as inapropriate (IMO) to the
real world and of the demands from the ICANN governance.
NB. If you do not have it, I have a copy of the letter as a
Member of the MINC, but I do not know if it is online. I can
copy it to those interested but I suppose netiquette opposes
that I put it myself on line?

Since it was addressed primarily to ICANN and the ICANN Board, I
have seen a copy of it. I suspect most readers of the IMAA list
have not. Given the quite inflamed and inflammatory content/
tone of the letter, I think it would be inappropriate for me to
attempt to summarize it, so, while I want to respond to the
portion of this that can easily be explained out of context,
further discussion should almost certainly be taken elsewhere
(and the old IETF IDN mailing list isn't an appropriate place
either).

To review a situation which I hope everyone here understands,
when the IDN standards were produced, the IDN WG, and thence the
IETF, made a quite explicit decision to specify mappings and
codings but to become as little involved as possible with the
specification of usage, which characters and character
combinations should actually be used, etc. The IESG reinforced
the nature of this decision by issuing a "statement" along with
the standards that might be described as "this is what we have
done, but you don't have a complete picture until you do some
other things, and you should be careful about them".

Following that extremely broad guideline, ICANN adopted a
guideline/ recommendation that strongly suggested that IDN
registrations occur only "by language". This was consistent
with extended discussions on the IDN list (while the standards
were under development) about the advantages of "one label / one
language" (or "one script") strategies in eliminating a number
of opportunities for problems, but the IDN WG did not make a
recommendation on that subject (for or against). ICANN also
created, as a convenience, a registry for the tables of
characters permitted by given DNS registries. E.g., if the DNS
ccTLD registry for Lower Slobbovia wanted to permit
language-based registrations in Klingon, and wanted to make
their "Klingon script" definition public and available to
others, they could register their Klingon table (including
character variants if they wanted to go that route) with IANA.
Of course, to do that, they would first have to convince the
Unicode Consortium and ISO to define and create standard code
points for Klingon.

Post by J-F C. (Jefsey) Morfin
From the standpoint of this list, or any other IETF-related

list, there is one important message from the above: with one
very small exception, none of the about involves the IETF in any
way. It is up to the registries what characters they support
and what to call them. The registry of tables is an ICANN-IANA
registry, not an IETF-IANA registry. Even if the IETF were
involved in it (which it is not, in any way) the registry is a
"pure" registry, with no decision-making processes: any TLD
administrator can submit any list of character they want, under
any name, to be registered. The small exception applies to
Klingon, which makes a good example, but is probably not a major
concern for languages and scripts used by contemporary humans
who use the Internet: IDNA is firmly tied to Unicode 3.2 and,
regardless of what UTC might do about Klingons and Klingon in
the future, it would require some IETF action (and presumably
expansion of Stringprep) to deal with a later, expanded, version
of Unicode.

Now the situation and letter to which you refer involved one in
which a ccTLD registry filed a table of characters with IANA
that they associated with a language name. The language is not
an official or majority language in that country. Someone who
is, I think, a native speaker of that language and who has roots
in a country where it is the majority and official language,
took serious exception to the registration, especially since no
registration for the language or script had occurred by anyone
he was willing to recognize as authoritative. Again, not a set
of IETF issues in any way. Not even an ICANN issue unless ICANN
is willing to go into the business of "world cultural authority"
figuring out what bodies should be able to decide, for everyone
else in the world, what characters are or are not in a given
script and for what purposes they may be used. And ICANN isn't
nearly that stupid, nor is any other rational international
organization I know of, governmental, treaty, NGO, or completely
private.

This particular letter raises a number of other issues about
cultures, international politics, and other topics, but they are
even further from anything related to IETF (or even ICANN) work.

And, in my opinion, none of it suggests any reason for an
IETF-related discussion on this subject, nor does it justify
your conclusions below. One thing it certainly does do is to
illustrate the fact that people get extremely sensitive about
their languages and how they are used. But we all know that
already, and it was one of the things that drove IETF to stay
very far away from those issues. And, even more certainly, it
has nothing to do with the rather specific technical question
which was asked and which Adam and I answered (I think and hope
consistently, although in different terms).

john

---------------

Post by J-F C. (Jefsey) Morfin
A part from the impossible situation where it puts ICANN which
has to chose between a non existing Arabic moral authority and
now Europe, through a choice involving three parties described
as at war (real life, real world) and no one understand right
now the political impact, my concern is that RHS "only"
involves 260 TLDs while LHS involves millions of domain
holders.
1. you have not come with a satisfactory way to MLize RHS
against my own patch to intlnzation. I see now that the MINC
letter opposes your global understanding of the problem -
please read carefully the part about ".xxx".
2. this confirms my opinon that an internationlization of the
LHS cannot work and that multinationlization must use another
upper layer - so not to conflict with the current situation
but to complete it.
Don't you think that time has come now to start a clean sheat
study of the real life e-intergovernance and to deduct from it
what the constraints on the datacoms architecture?
jfc

Post by John C Klensin
--On Friday, 04 June, 2004 17:15 +0800 dongxiaoli

Post by dongxiaoli
Hello everyone.
Have anybody can tell me WHY we need do "*Divide the
sequence into segments*" in the ToASCII process as
described in the
IMAA draft (draft-hofmann-imaa-03.txt)??

In many circumstances, there is a good deal of information
that is already encoded into email local-parts. The most
common of these, these days, are subaddresses and internal
routing codes, but external routing and address component
identification, such as those used for X.400 addresses and
fax-like attribute identification when those are mapped to
the Internet, have been common (and probably still are) in
some communities.
In many of those cases, the presence of the encoding and its
components must be recognized by the delivery MTA or, at the
boundary of an outright violation of the standards (however
necessary) by organizational-boundary relays such as
SMTP-handling firewalls. If the clues that the codings are
present are hidden by the i18n encoding, we gain
internationalization at the cost of the features supported by
that information-encoding.
We have no standard for any of those information encodings --
they have, so far, been protected by an extremely strong rule
that no MTA before the final delivery one is permitted to
interpret the contents of a local-part in any way. All
previous systems and MTAs, including the originating user and
MUA, are required to treat the entire local-part address an an
atomic and opaque string.
The segments and delimiters of draft-hoffman-imaa-03 are
intended to alleviate the problem with hiding these encodings
while providing internationalization for the various address
components.
This issue was the original philosophical departure point
between draft-hoffman-imaa and draft-klensin-emailaddr-i18n.
The latter takes a somewhat stronger and more protective view
of the local-part addresses, noting particularly that the
approach in draft-hoffman-imaa will not protect either
information in local-parts that is length-encoded (e.g., "the
address contains a 'project distribution' component beyond
the mailbox name for the recipient; the last three characters
are the project identifier, the other characters are the
mailbox name") or information that is separated by non-ASCII
delimiters. For example, were you to want to use
subaddresses with a mailbox whose name was in Chinese, the
rules of draft-hoffman-imaa would require that you separate
the mailbox and subaddress elements with an ASCII delimiter,
not one that might appear more appropriate for Chinese, if
the classification and action were to be done by an MTA that
did not, itself, run ToUnicode. There is some additional
discussion on this issue in
draft-klensin-emailaddr-i18n-02.txt.

Post by dongxiaoli
And another question: "The "*protected code points"
are 0..40, 5B..60, 7B..7F (in other words, those
corresponding to ASCII characters other than letters and
digits). *" is from draft-hofmann-imaa-03.txt. But the
code points of digits range between 0...40,so why said the
*digits is not protected code points*?

Paul or Adam will need to answer this one.
best,
john

J-F C. (Jefsey) Morfin

2004-06-07 22:31:16 UTC

Permalink

Dear John,
I am afraid your answer is out of the scope of my question.
My question does not relate in any way to the specificity of the
case, nor to the management of the case by ICANN, nor why
the WG-IDN took that options and why. It is related to the
very existance of the case and of its technical reasons.

This is IETF. So the point is technical. The way the ideas are
expressed are of no interest. What is of interest is the technical
roots of each of the protests. To which extent these technical
objections only relate to RHS or may also concern the LHS. And
what could we do to avoid the related problem in specifying an
LHS solution.

I personnaly (but I may be wrong) read most of them as objecting
to the IETF/ICANN adopted technical strategy. You describe that

Post by John C Klensin
IETF, made a quite explicit decision to specify mappings and
codings but to become as little involved as possible with the
specification of usage, which characters and character
combinations should actually be used, etc. The IESG reinforced
the nature of this decision by issuing a "statement" along with
the standards that might be described as "this is what we have
done, but you don't have a complete picture until you do some
other things, and you should be careful about them".
Following that extremely broad guideline, ICANN adopted a
guideline/ recommendation that strongly suggested that IDN
registrations occur only "by language". This was consistent
with extended discussions on the IDN list (while the standards
were under development) about the advantages of "one label / one
language" (or "one script") strategies in eliminating a number
of opportunities for problems, but the IDN WG did not make a
recommendation on that subject (for or against). ICANN also
created, as a convenience, a registry for the tables of
characters permitted by given DNS registries.

The question is: can the above strategy, together with the
IETF disclaimer below, be copied for LHS for average registrants,
when RHS (ie. ccTLD) leads to situation inflaming international
highly representative organizations?

Post by John C Klensin

Post by J-F C. (Jefsey) Morfin
From the standpoint of this list, or any other IETF-related

The rest of your reponse concerns ICANN and is not an
IETF issue. Except that I indicate that IETF/IESG specified
a solution which calls for "bodies able to decide for everyone else
in the world, what characters are or are not in a given script and

Post by John C Klensin
And ICANN isn't nearly that stupid, nor is any other
rational international organization I know of, governmental,
treaty, NGO, or completely private.
And, in my opinion, none of it suggests any reason for an
IETF-related discussion on this subject,

If "this subject" is any LHS specification which may lead to
a similar comment by a group representing the wide
majority of the Internet users. Please abstain and IETF
publish it is not able to address ML.ML.ML

My personnal technical position is simple. As a user
organization I read a mail asking to review the WG-IDN
proposition. I came and saw the proposed technical options
will lead to the kind of conflict and dead-end documented
today by the MINC. I was the objected the most vocal - but
not by everyone by far. So I tried to help - after all I could
have been wrong.

Now, when this happens, I do not say you should have
objected what you state as obvious today. I just ask "is it
reasonable now that we have the proof that these options
lead to this kind of conflict on the Right side, to keep
them for the Left side"?

And I ask if saying "this is not my cup of tea, because
I have said I am not involved in the decisions you will take
to make work the impossible system I designed" will be
enough to give crediblity and momentum to any LHS
replication of the RHS solution?

Post by John C Klensin
nor does it justify your conclusions below. One thing it
certainly does do is to illustrate the fact that people get
extremely sensitive about their languages and how they
are used. But we all know that already, and it was one of
the things that drove IETF to stay very far away from those
issues.

Then IETF should have declined to propose a solution.
You cannot stay far away from a standard you propose
or you lose credibility.

Knowing the above, what technical alternative do we have
to help users not to be confronted to that kind of situation?
Here is the point. IETF's role is to publish RFC and
standards to be used, not to be disputed.

IMHO this is feasible, but with a totally different perspective,
and probably in a totaly different working way, associating
concerned cultural organizations to get the necessary
usage and foreign technical culture. In making them share
into the requirement definitions, in the specifications work.
Also in forgeting external politically originated constraints
(ICANN).

I certainly accept that this might change IETF. But what
if the result is an Arabic non compatible root?

I fully share the idea that cultures need self-determination.
All the cultures, not only one.

The role of IETF is to make it technically possible without
hurting the global stability. Not to make this the top IETF
priority is IMO a _technical_ and lethal fault. We had the
opportunity to address the problem at length, we have been
warned by several ITU assemblies and Gov consensuses.
We have been warned by the WSIS resolutions. We did
not listen to them.

Now the problem is with us.
jfc

PS. I apologize for having kept Dongxialoi's question attached
by mistake. Quite confusing. Just used a reply to the list
and did not cut it.

Paul Hoffman / IMC

2004-06-07 22:47:35 UTC

Permalink

Post by J-F C. (Jefsey) Morfin
This is IETF.

No, it is not. It is a mailing list. And, to be specific, it is a
mailing list that is not associated with any IETF WG.

I have said this before many times, but you see to have forgotten.
I'll change the title on the mailing list soon to help folks remember.

--Paul Hoffman, Director
--Internet Mail Consortium

John Cowan

2004-06-08 00:56:51 UTC

Permalink

Post by J-F C. (Jefsey) Morfin
The question is: can the above strategy, together with the
IETF disclaimer below, be copied for LHS for average registrants,
when RHS (ie. ccTLD) leads to situation inflaming international
highly representative organizations?

Yes. Indeed, it will work better for email providers, because there
are many of them in most environments. Each can set a private policy
about what characters, or sequences of characters, they will or will
not accept in local-parts.

In general, companies are far less malleable under political pressure than
government agencies. The only way to forcibly shift one from its policy
will be by private lawsuit, and in a vanishing number of cases will it
be worth doing that rather than just switching to another email provider.

--
Where the wombat has walked, John Cowan <***@reutershealth.com>
it will inevitably walk again. http://www.ccil.org/~cowan

J-F C. (Jefsey) Morfin

2004-06-08 02:29:29 UTC

Permalink

Dear John,
I understand where you come from with this point. But the point here is
quite the opposite. It is to say that I am entitled to see my culture
respected and that culture to self determine the way it wants to express
itself on the internet.

For example, a question debated in here in many accoasion is ; to be case
sensitive or not? This is not a question for the IETF. This is not a
question for the Mail service provider, this is a question for the user to
decide what he wants.

But non only the IETF specification must permit it, but it must deliver a
tool which will permit the user to decide a parameter set according to the
Academic, Politic, Family, whatever Lingual Authority he wants to chose.

This may have a lot of implication. MINC rises the point of ".xxx" being or
not translated by ICANN decision. They say it is to be by lingual
organization decision. Why? Because most of the Arabic countries are
Moslems, and they oppose pornography in their constitution. Not translating
it permits to address the demand of the users to protect them against this
risk - because "xxx" cannot not be typed easily on an Arabic keyword.

You will say that they could bar it.
- one: this is not to others to decide for their law. Would you accept
easily if the US car driving had to adapt to the French standard because
internationalized (you must come back on the right lane on an highway) ?
- two: there is a big difference between censoring and not helping.
- etc.

We are not only talking of e-mail providers but of every mail. This is a
standard. Let say this. You have your email system home. You define a
configuration and you discover that a new standard has killed your
configuration because some other people said they known better what was
good for you. And you discover that the new configuration doe snot conform
to the law. Then that it violates your beliefs.

What would you say?

In general, companies are far less malleable under political pressure than
government agencies. The only way to forcibly shift one from its policy
will be by private lawsuit, and in a vanishing number of cases will it be
worth doing that rather than just switching to another email provider.

hmmm. I hope you do not mean that Govs are malleable under the pressure of
a foreign Gov? Again we talk of universal standard which are to respect the
sovereignty, the law, the decision of countries, of states, of people. A
standard is not here to force because the designer was not able to do
otherwise, it is to serve the users to dialog together the way they want.
The first human right on a network is to be able to refuse. To refuse
hackers, to refuse spam, to refuse adult picture, to refuse cultural
corruption, to refuse privacy violation (you will note that all these
points which look quite political, legal or societal are parts of the LHS).

IMHO the LHS (and more generally the whole mail solution) should stay
untouched. Because it is here. No one can object to all what it may provide
today. This permits the current minimum to stay. Then this should be
considered as the low level structure of the new system encapsulating it.
The propositions today should therefore be user layer oriented. Then the
technical underlaying structure will be able to adapt, and probably
simplify in the future. The "mail layer" is too complex. Let split it in
adding a new layer, an interface and leaving part of the current system
obsoleting. Leave users organizations specify the user layer or debate it
with manufacturers. Let just help them simplifying the current solution
when possible.

Otherwise, how do you want to provide what people most need today : address
menus? target groups? originating classes? issue related addresses,
multilingal relations (I send in one language, I receive in another one, or
several ones depending on copies), etc.
jfc

Adam M. Costello

2004-06-05 23:12:00 UTC

Permalink

"The "*protected code points" are 0..40, 5B..60, 7B..7F (in other
words, those corresponding to ASCII characters other than letters and
digits). *" is from draft-hofmann-imaa-03.txt.

Oops, thanks for catching that. The prose describes the actual
intention. The number ranges should be 0..2F, 3A..40, 5B..60, 7B..7F.

Have anybody can tell me WHY we need do "*Divide the sequence
into segments*" in the ToASCII process as described in the
IMAA draft (draft-hofmann-imaa-03.txt)??

Before internationalized local parts can be safely created in a mail
domain, a couple questions need to be considered:

1) Are local parts case-sensitive within this domain?

2) Are local parts structured within this domain, using delimiters
other than protected characters?

If the answer to either question is "yes", then internationalized local
parts will not work correctly in the domain unless the MTAs for the
domain are upgraded to understand the IMAA encoding.

However, if the answer to both questions is "no", then there is no need
to upgrade the MTAs. Internationalized local parts can be created in
the domain and they will work correctly.

The concept of protected characters and segmentation is designed to
make it effortless to support internationalized local parts in the vast
majority of real mail domains.

The typical examples are domains that support user+***@domain, or
which treat owner-***@domain specially. Because plus-sign and
hyphen-minus are protected characters, it doesn't matter whether
the joining/splitting of the components is performed on the ASCII
form or the Unicode form, because the result will be the same either
way. If IMAA did not have segmentation, any software that dealt with
prefixed/suffixed local parts would have to be careful about the proper
order of conversions, and would therefore have to be IMA-aware.

AMC

dongxiaoli

2004-06-07 02:36:05 UTC

Permalink

Thanks Adam and john.You do me a great favor. :-)

Xiaoli Dong