| Abstract |
Tagaligner is a program that segments and aligns corresponding translated sentences, contained in two markup-language-based files and generates a TMX translation memory from them for use in computer-assisted translation.
Tagaligner uses the tag structure of the webpages and XML-based languages to improve the results of classic geometrical aligners. The aligner has been tested with XHTML webpages preprocessed with the tidy program and using the ISO-8859-1 encoding, but may work for HTML files and other encodings. |