Toad World® Forums

Need two option for UTF-8 in SAVE DIALOGUEs


#1

UTF-8 can be save with or without BOM (See
http://www.w3.org/International/questions/qa-utf8-bom for a discussion of BOM.)

However not all applications can handle a file with BOM at the beginning. For
instance our batch C compiler can’t. So we have to save text that contains
UTF-8 characters as UTF-8 no BOM. Fortunately UltraEdit does this. It gives us a
choice of UTF-8 or UTF-8 No BOM.

Please give us the same options in the Toad SAVE Dialogues.


#2

Erwin I added this to the Pond just to keep track of it, but it should probably
be done for the spring release regardless.

As an aside, I thought only UTF-8 at one point was the only encoding where it
was optional, so I was surprised by UltraEdit’s addition of it for UTF-16, but
apparently it’s no longer required (though highly recommended) for that encoding
either.

Interestingly, here is a side bar quote from the web site you referenced:

In UTF-16 and UTF-32 encodings, unless there is some alternative indicator, the
BOM is essential to ensure correct interpretation of the file’s contents.

Some protocols, such as Microsoft .txt files, require the BOM in all cases.


#3

The problem is that some C compiler croak on the BOM. We found that out the hard
way.

PS

here is a very quick run down on our upgrade of our data warehouse to UTF-8. You
may need to be aware of some of them for Toad changes.

Some of the issues are


#4

Hi Erwin,

The problem is that some C compiler croak on the BOM. We
found that out the hard way.
I would think that those very same C compilers would also croak on the
text if it was in UTF8 as well - if it can’t cope with the BOM, then how
on earth will it cope with the (potential) for multi-byte characters?

I suspect some seriously weird error messages may well be the result!

Cheers,
Norm. [TeamT]

Information in this message may be confidential and may be legally privileged. If you have received this message by mistake, please notify the sender immediately, delete it and do not copy it to anyone else.

We have checked this email and its attachments for viruses. But you should still check any attachment before opening it.
We may have to make this message and any reply to it public if asked to under the Freedom of Information Act, Data Protection Act or for litigation. Email messages and attachments sent to or from any Environment Agency address may also be accessed by someone other than the sender or recipient, for business purposes.

If we have sent you information and you wish to use it please read our terms and conditions which you can get by calling us on 08708 506 506. Find out more about the Environment Agency at www.environment-agency.gov.uk


#5

you may want to check the 1st column of the 1st line of the XML
i suspect you may have a suspicious character insrted into the Byte Order Mark
before declaration e.g.

contents of fubar.xml:
fubar!

Martin


Please do not modify this transmission as altering these messages makes it
harder for spam-filter to process. Thank You