org.mmbase.util.transformers
Class CP1252Surrogator
java.lang.Object
org.mmbase.util.transformers.ReaderTransformer
org.mmbase.util.transformers.ConfigurableReaderTransformer
org.mmbase.util.transformers.CP1252Surrogator
- All Implemented Interfaces:
- CharTransformer, ConfigurableTransformer, Transformer
- public class CP1252Surrogator
- extends ConfigurableReaderTransformer
- implements CharTransformer
Surrogates the Windows CP1252 characters which are not valid ISO-8859-1. It can also repair
wrongly encoded Strings (byte arrays which were actually CP1252, but were considered ISO-8859-1
when they were made to a Java String).
- Since:
- MMBase-1.7.2
- Version:
- $Id: CP1252Surrogator.java,v 1.2.2.4 2005/02/02 10:11:21 michiel Exp $
- Author:
- Michiel Meeuwissen
|
Method Summary |
java.lang.String |
getEncoding()
Returns the encoding that is currently active |
static byte[] |
getTestBytes()
|
static java.lang.String |
getTestString()
|
static void |
main(java.lang.String[] args)
For testing only. |
java.io.Writer |
transform(java.io.Reader r,
java.io.Writer w)
|
java.util.Map |
transformers()
Returns which transformations can be done by an object of this class. |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
WELL_ENCODED
public static final int WELL_ENCODED
- See Also:
- Constant Field Values
WRONG_ENCODED
public static final int WRONG_ENCODED
- See Also:
- Constant Field Values
CP1252Surrogator
public CP1252Surrogator()
CP1252Surrogator
public CP1252Surrogator(int conf)
transform
public java.io.Writer transform(java.io.Reader r,
java.io.Writer w)
- Specified by:
transform in interface CharTransformer- Specified by:
transform in class ReaderTransformer
transformers
public java.util.Map transformers()
- Description copied from interface:
ConfigurableTransformer
- Returns which transformations can be done by an object of this class.
- Specified by:
transformers in interface ConfigurableTransformer- Specified by:
transformers in class ConfigurableReaderTransformer
getEncoding
public java.lang.String getEncoding()
- Description copied from interface:
ConfigurableTransformer
- Returns the encoding that is currently active
- Specified by:
getEncoding in interface ConfigurableTransformer- Specified by:
getEncoding in class ConfigurableReaderTransformer
getTestBytes
public static byte[] getTestBytes()
getTestString
public static java.lang.String getTestString()
main
public static void main(java.lang.String[] args)
- For testing only.
Use on a UTF-8 terminal:
java -Dfile.encoding=UTF-8 org.mmbase.util.transformers.CP1252Surrogator
Or, on a ISO-8859-1 terminal: (you will see question marks, for the CP1252 chars)
java -Dfile.encoding=ISO-8859-1 org.mmbase.util.transformers.CP1252Surrogator
Or, if - may God forbid - you have a CP1252 terminal:
java -Dfile.encoding=CP1252 org.mmbase.util.transformers.CP1252Surrogator
This last thing you may simulate with something like this:
java -Dfile.encoding=CP1252 org.mmbase.util.transformers.CP1252Surrogator | konwert cp1252-utf8
MMBase build 1.7.4.20050922