RE: [XaraXtreme-dev] Discussion of string portablility problems

There is no single solution that will solve all these issues as they stem from the code starting as TCHAR based that was only ever built in non-unicode mode.

One way of removing some of the complexity and handling (some of) these issues would be for us to drop support for non-unicode builds. This would allow the direct use of %ls to mean a string parameter though the use of %s to write a non-unicode string into a Unicode one would not work on Windows and any code that relied on this would need to change. This would remove the need for the _T macro though we would still have issues in some of our macros if they are called with concatenated strings. We would then also make the interface to XarLib only support wide character strings making it much easier for the code to be shared between XaraLX and XarLib. I was thinking that this would be a sensible route to go but it would still need a lot of code to be changed (a script could replace most of the _T() and tchar type functions but there would be quite a lot of other code to be changed) and does nothing to address the difference between our own functions and printf (e.g. you would have to use %ls in printf type calls and %s in MakeMsg ones).

wxWidgets defines its own versions of all the strings functions (e.g. wxStrcpy) and the wxchar.h file has very complex conditionals in it to support all sorts of platforms. It actually implements a layer over all the printf type functions on linux to make them behave like the MSVC ones as that is “more useful for us”. We could define our own set of string functions (e.g. camStrcpy) for use by the kernel (or Oil code that could be easily ported away from wx) that the wxOil layer simply defines as mapping to the wx equivalents. This would enable us to go back to the original meaning of %s always meaning a TCHAR string. XarLib could then share the code from XaraLX and would either just use the wxchar parts of wxWidgets (probably best for now) or would define its own wrapper for linux. I actually now think this would be a better option than dropping Unicode support as it standardises our %s usage and removes the PERCENT_S macros and should only require a some search and replaces and list of #define camStrcpy wxStrcpy type defines. This would also considerably simplify compatdef.h.

Does anyone have any other suggestions or other comments on any of this?

Gerry

From: owner-dev@xxxxxxxxxxxxxxxx [mailto:owner-dev@xxxxxxxxxxxxxxxx] On Behalf Of Gerry Iles
Sent: 03 April 2006 13:03
To: dev@xxxxxxxxxxxxxx
Subject: [XaraXtreme-dev] Discussion of string portablility problems

I am currently working on creating an OpenSource version of the XarLib library (see http://www.xara.com/support/docs/webformat/spec/) that will build on Linux, Mac and Windows. This library really needs to build in both Unicode and non-unicode versions and shouldn’t require wxWidgets. The library consists of the Xar format loading and saving code from XaraLX with a small wrapper around it providing a simple interface. While trying to update some of the XaraLX files from the latest version I have come across several problems that really need to be sorted out properly…

Use of _T(), # and ##

I went into this in my message of 29/03 but I’ve looked into it a bit more and the MS compiler errors if you try to concatenate a narrow and a wide string. Gcc behaves differently e.g.:

char testa[256] = "Hello" "world";

char testb[256] = "Hello" L"world";

char testc[256] = L"Hello" "world";

char testd[256] = L"Hello" L"world";

wchar_t wtesta[256] = "Hello" "world";

wchar_t wtestb[256] = "Hello" L"world";

wchar_t wtestc[256] = L"Hello" "world";

wchar_t wtestd[256] = L"Hello" L"world";

TCHAR ttesta[256] = "Hello" "world";

TCHAR ttestb[256] = "Hello" L"world";

TCHAR ttestc[256] = L"Hello" "world";

TCHAR ttestd[256] = L"Hello" L"world";

MSVC gives the following errors (in a Unicode build):

(2) : error C2308: concatenating mismatched wide strings

(3) : error C2308: concatenating mismatched wide strings

(3) : error C2440: 'initializing' : cannot convert from 'const unsigned short [8]' to 'char [256]'