michael@0: .\" Hey, Emacs! This is -*-nroff-*- you know... michael@0: .\" michael@0: .\" gensprep.8: manual page for the gensprep utility michael@0: .\" michael@0: .\" Copyright (C) 2003 IBM, Inc. and others. michael@0: .\" michael@0: .TH gensprep 8 "18 March 2003" "ICU MANPAGE" "ICU @VERSION@ Manual" michael@0: .SH NAME michael@0: .B gensprep michael@0: \- compile StringPrep data from files filtered by filterRFC3454.pl michael@0: .SH SYNOPSIS michael@0: .B gensprep michael@0: [ michael@0: .BR "\-h\fP, \fB\-?\fP, \fB\-\-help" michael@0: ] michael@0: [ michael@0: .BR "\-v\fP, \fB\-\-verbose" michael@0: ] michael@0: [ michael@0: .BI "\-c\fP, \fB\-\-copyright" michael@0: ] michael@0: [ michael@0: .BI "\-s\fP, \fB\-\-sourcedir" " source" michael@0: ] michael@0: [ michael@0: .BI "\-d\fP, \fB\-\-destdir" " destination" michael@0: ] michael@0: .SH DESCRIPTION michael@0: .B gensprep michael@0: reads filtered RFC 3454 files and compiles their michael@0: information into a binary form. michael@0: The resulting file, michael@0: .BR .icu , michael@0: can then be read directly by ICU, or used by michael@0: .BR pkgdata (8) michael@0: for incorporation into a larger archive or library. michael@0: .LP michael@0: The files read by michael@0: .B gensprep michael@0: are described in the michael@0: .B FILES michael@0: section. michael@0: .SH OPTIONS michael@0: .TP michael@0: .BR "\-h\fP, \fB\-?\fP, \fB\-\-help" michael@0: Print help about usage and exit. michael@0: .TP michael@0: .BR "\-v\fP, \fB\-\-verbose" michael@0: Display extra informative messages during execution. michael@0: .TP michael@0: .BI "\-c\fP, \fB\-\-copyright" michael@0: Include a copyright notice into the binary data. michael@0: .TP michael@0: .BI "\-s\fP, \fB\-\-sourcedir" " source" michael@0: Set the source directory to michael@0: .IR source . michael@0: The default source directory is specified by the environment variable michael@0: .BR ICU_DATA . michael@0: .TP michael@0: .BI "\-d\fP, \fB\-\-destdir" " destination" michael@0: Set the destination directory to michael@0: .IR destination . michael@0: The default destination directory is specified by the environment variable michael@0: .BR ICU_DATA . michael@0: .SH ENVIRONMENT michael@0: .TP 10 michael@0: .B ICU_DATA michael@0: Specifies the directory containing ICU data. Defaults to michael@0: .BR @thepkgicudatadir@/@PACKAGE@/@VERSION@/ . michael@0: Some tools in ICU depend on the presence of the trailing slash. It is thus michael@0: important to make sure that it is present if michael@0: .B ICU_DATA michael@0: is set. michael@0: .SH FILES michael@0: The following files are read by michael@0: .B gensprep michael@0: and are looked for in the michael@0: .I source michael@0: /misc for rfc3454_*.txt files and in michael@0: .I source michael@0: /unidata for NormalizationCorrections.txt. michael@0: .TP 20 michael@0: .B rfc3453_A_1.txt michael@0: Contains the list of unassigned codepoints in Unicode version 3.2.0.\|.\|.. michael@0: .TP michael@0: .B rfc3454_B_1.txt michael@0: Contains the list of code points that are commonly mapped to nothing.\|.\|.. michael@0: .TP michael@0: .B rfc3454_B_2.txt michael@0: Contains the list of mappings for casefolding of code points when Normalization form NFKC is specified.\|.\|.. michael@0: .TP michael@0: .B rfc3454_C_X.txt michael@0: Contains the list of code points that are prohibited for IDNA. michael@0: .TP michael@0: .B NormalizationCorrections.txt michael@0: Contains the list of code points whose normalization has changed since Unicode Version 3.2.0. michael@0: .SH VERSION michael@0: @VERSION@ michael@0: .SH COPYRIGHT michael@0: Copyright (C) 2000-2002 IBM, Inc. and others. michael@0: .SH SEE ALSO michael@0: .BR pkgdata (8)