[Date Prev][Date Next][Thread Prev][Thread Next][Thread Index]

Re: [XaraXtreme-dev] Discussion of string portablility problems

From: Luke Hart <lukeh@xxxxxxxx>
Date: Mon, 03 Apr 2006 17:21:09 +0100
Subject: Re: [XaraXtreme-dev] Discussion of string portablility problems

Phil Martin wrote:

Personally I would be very happy to drop support for non-Unicodebuilds - it's an albatross.
I think your camStr plan sounds like a good one - lets us use %scleanly and creates a clean string-handling API. The downside, Isuppose, is greater reliance on wxWidgets but that's a theoreticalproblem more than a real one.
Phil

Gerry Iles wrote:
There is no single solution that will solve all these issues as theystem from the code starting as TCHAR based that was only ever builtin non-unicode mode.
One way of removing some of the complexity and handling (some of)these issues would be for us to drop support for non-unicode builds.This would allow the direct use of %ls to mean a string parameterthough the use of %s to write a non-unicode string into a Unicode onewould not work on Windows and any code that relied on this would needto change. This would remove the need for the _T macro though wewould still have issues in some of our macros if they are called withconcatenated strings. We would then also make the interface to XarLibonly support wide character strings making it much easier for thecode to be shared between XaraLX and XarLib. I was thinking that thiswould be a sensible route to go but it would still need a lot of codeto be changed (a script could replace most of the _T() and tchar typefunctions but there would be quite a lot of other code to be changed)and does nothing to address the difference between our own functionsand printf (e.g. you would have to use %ls in printf type calls and%s in MakeMsg ones).
wxWidgets defines its own versions of all the strings functions (e.g.wxStrcpy) and the wxchar.h file has very complex conditionals in itto support all sorts of platforms. It actually implements a layerover all the printf type functions on linux to make them behave likethe MSVC ones as that is “more useful for us”. We could define ourown set of string functions (e.g. camStrcpy) for use by the kernel(or Oil code that could be easily ported away from wx) that the wxOillayer simply defines as mapping to the wx equivalents. This wouldenable us to go back to the original meaning of %s always meaning aTCHAR string. XarLib could then share the code from XaraLX and wouldeither just use the wxchar parts of wxWidgets (probably best for now)or would define its own wrapper for linux. I actually now think thiswould be a better option than dropping Unicode support as itstandardises our %s usage and removes the PERCENT_S macros and shouldonly require a some search and replaces and list of #define camStrcpywxStrcpy type defines. This would also considerably simplifycompatdef.h.
Does anyone have any other suggestions or other comments on any of this?

Gerry

------------------------------------------------------------------------
*From:* owner-dev@xxxxxxxxxxxxxxxx[mailto:owner-dev@xxxxxxxxxxxxxxxx] *On Behalf Of *Gerry Iles
*Sent:* 03 April 2006 13:03
*To:* dev@xxxxxxxxxxxxxx
*Subject:* [XaraXtreme-dev] Discussion of string portablility problems
I am currently working on creating an OpenSource version of theXarLib library (see http://www.xara.com/support/docs/webformat/spec/)that will build on Linux, Mac and Windows. This library really needsto build in both Unicode and non-unicode versions and shouldn’trequire wxWidgets. The library consists of the Xar format loading andsaving code from XaraLX with a small wrapper around it providing asimple interface. While trying to update some of the XaraLX filesfrom the latest version I have come across several problems thatreally need to be sorted out properly…
Use of _T(), # and ##
I went into this in my message of 29/03 but I’ve looked into it a bitmore and the MS compiler errors if you try to concatenate a narrowand a wide string. Gcc behaves differently e.g.:
char testa[256] = "Hello" "world";

char testb[256] = "Hello" L"world";

char testc[256] = L"Hello" "world";

char testd[256] = L"Hello" L"world";

wchar_t wtesta[256] = "Hello" "world";

wchar_t wtestb[256] = "Hello" L"world";

wchar_t wtestc[256] = L"Hello" "world";

wchar_t wtestd[256] = L"Hello" L"world";

TCHAR ttesta[256] = "Hello" "world";

TCHAR ttestb[256] = "Hello" L"world";

TCHAR ttestc[256] = L"Hello" "world";

TCHAR ttestd[256] = L"Hello" L"world";

MSVC gives the following errors (in a Unicode build):

(2) : error C2308: concatenating mismatched wide strings

(3) : error C2308: concatenating mismatched wide strings
(3) : error C2440: 'initializing' : cannot convert from 'constunsigned short [8]' to 'char [256]'
There is no context in which this conversion is possible
(4) : error C2440: 'initializing' : cannot convert from 'constunsigned short [11]' to 'char [256]'
There is no context in which this conversion is possible
(5) : error C2440: 'initializing' : cannot convert from 'const char[11]' to 'wchar_t [256]'
There is no context in which this conversion is possible

(6) : error C2308: concatenating mismatched wide strings
(6) : error C2440: 'initializing' : cannot convert from 'const char[17]' to 'wchar_t [256]'
There is no context in which this conversion is possible

(7) : error C2308: concatenating mismatched wide strings
(9) : error C2440: 'initializing' : cannot convert from 'const char[11]' to ‘TCHAR [256]'
There is no context in which this conversion is possible

(10) : error C2308: concatenating mismatched wide strings
(10) : error C2440: 'initializing' : cannot convert from 'const char[17]' to 'TCHAR [256]'
There is no context in which this conversion is possible

(11) : error C2308: concatenating mismatched wide strings
Concatenation of the mismatched strings gives an error but thencontinues compilation using the type of the first sub-string.
GCC gives the following:

2:error: char-array initialised from wide string

3:error: char-array initialised from wide string

4:error: char-array initialised from wide string

5:error: int-array initialised from non-wide string

9:error: int-array initialised from non-wide string
Also, removing the lines that error shows that the other lines allproduce a fully wide string (e.g. lines 10, 11 and 12 all produce aTCHAR string saying “Helloworld”.
So, it appears that when concatenating strings, if any of thesub-strings are wide then gcc promotes all of the substrings to wideones. This explains why our macros that do this compile on linux butnot on MSVC.
Use of %s in C runtime printf type functions
It appears as though on linux %s always means a narrow char stringeven in the wide version of the function. To pass a wide string youmust specify %ls. However, on windows, MS have “helpfully” changedthe wide string version (e.g. swprintf) so that %s means a TCHARpointer and to force a narrow string you have to use %hs (you canstill use %ls to force a wide string). This can be handled by thePERCENT_S macro (with new PERCENT_Sa and PERCENT_Sw macros forforcing to narrow or wide) but there are quite a lot of calls toprintf type functions that pass a %s that need to be changed to workcorrectly (e.g. at present they will not work correctly on linuxbecause they use %s but pass a TCHAR*). Also there are quite a fewplaces where narrow strings are used deliberately and this requiresthat the PERCENT_S macros are not used (as they are defined as widestrings on Unicode builds).
Other printf type functions
XaraLX also implements some of its own printf type functions,_MakeMsg, CCvsprintf etc. These only handle a subset of the standard% codes and all treat %s as a TCHAR* and don’t allow the use ofdifferent width strings.
Other problematic string usage
There are various places in the XaraLX code where narrow stringfunctions are specifically used but are passed TCHAR* as parameters.Presumably this sort of thing is being fixed when the files areinitially ported.
I think the best solution for this would be for all of our printftype functions to behave the same and preferably not require the useof PERCENT_S type macros but this would involve updating our ownfunctions (e.g. to support floating point) and also making thestandard printf type functions on linux treat %s as TCHAR* (either bya wrapper that rewrites the format string or by writing our ownversion). Alternatively, we would have to carefully document exactlywhat each does support and document how they must be used to beportable.
Gerry

Anything that will remove the frankly hideous stuff in compatdef.h hasgotr to be good. I am slightly worried though that wxStrcpy (andwxStrcpyn, wxStrcat and wxStrncat) aren't in the wx documentation,altought they are present in the headers. Having said that I can'timagine that they'd be removed.


   Luke

References:
- RE: [XaraXtreme-dev] Discussion of string portablility problems
  - From: Gerry Iles
- Re: [XaraXtreme-dev] Discussion of string portablility problems
  - From: Phil Martin

Prev by Date: Re: [XaraXtreme-dev] Discussion of string portablility problems
Next by Date: RE: [XaraXtreme-dev] Discussion of string portablility problems
Previous by thread: Re: [XaraXtreme-dev] Discussion of string portablility problems
Next by thread: RE: [XaraXtreme-dev] Discussion of string portablility problems
Index(es):
- Thread