Contents

 _______________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.46 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.45 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.44 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.43 Update 2 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.43 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.42 Update 5 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.42 Update 4 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.42 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.40 Update 1 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.40 – Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.37 Update 1 - Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.37 Version Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.36 Version Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.35 Version Update 1 on Dec 17th

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.35 Support files Updates on Dec-16th

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.35 Support files Updates on Nov-23rd

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.35 Version Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.34 Version Release Notes

________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.33 Version Release Notes

 

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.46 - Release Notes

April 26th, 2012

Contents

New Features

Changes

Enhancements

Defects fixed

Support Files Updates

Core Configuration

Term and Punctuation Rules

Lists

Dictionaries

Documentation

Known Issues 

 

We are very pleased to announce a new release of Linguistic ToolBox ver. 2.46.  In this release, we have rationalized the functionality of the Segment Checker component and we have introduced several new ignorability options in order to enable users to block false positives and to allow more flexibility from the checks that need to be carried out.

 

With this release, we have also included new and updated support files, don’t forget to check these out! Special attention should be paid to the new and updated Microsoft Style Guide rules for these languages: Spanish, German, Portuguese (Portugal), Russian, Korean, Chinese (Traditional) and Chinese (Simplified).

 

If you have any technical questions or in case you identify any bugs, please contact us.

New Features

·         New Ignore options under Check inconsistency and Check reverse inconsistency.

 

o   Don’t check inconsistencies between locked and other segments: enables users to exclude from the checks those inconsistencies (direct and/or reverse) that are produced by locked segments and other segments. This feature is only appropriate for those teams that cannot modify locked segments due to inconsistencies.

 

o   Don’t check inconsistencies between ignored ‘matches equal or above’ and other segments: enables users  to exclude from the checks those inconsistencies (direct and/or reverse) that are produced by ignored ‘matches equal or above’ and other segments. This feature is only appropriate for those teams that cannot modify segments over a certain match percentage due to inconsistencies.

·         New Ignore options for the Check if target is empty.  These two options allow users to easily block false positives.

 

o   Ignore list: enables users to ignore source segments that don’t have an equivalent target by using a one column list of the segments that need to be ignored (Unicode .txt).This option is very convenient if the segments that need to be ignored don’t follow a certain pattern or for users unfamiliar with the creation of rules for Linguistic ToolBox. Note that each list entry must contain the whole segment that is supposed to be ignored, and that these entries are case-sensitive.

 

o   Ignore rule: enables users  to ignore source segments that don’t have an equivalent target by using a rule containing either regular expressions or list entries in a rule format. The rules created for this option, must have content in the Source column and must use the Error mode. This option is very useful to ignore those segments that follow a certain pattern, or where entries need to be case-insensitive

 

·   New Ignore options for the Random Sampling component.  New ignore options enable users  to be able to exclude specific segments from the sample to be generated.

 

o   Ignore duplicated segments Ignores instances of repeated segments.

o   Ignore empty target segments Ignores source segments that don’t have a corresponding target segment

o   Ignore alphanumeric only segments Ignores segments that only contain alphanumeric strings (including Roman numbers)

o   Ignore numeric only segments Ignores segments that only contain numeric strings

Changes

 

·      The Segment Checker tab has changed as follows with a view to make the UI more logical and intuitive.

 

·         The content has been divided into three different areas of functionality:

 

o   Inconsistency

o   Translation

o   Segment length

 

·         The below ignore options now only affect the Check inconsistency and Check reverse inconsistency:

 

o   Ignore case and punctuation

o   Ignore end punctuation

o   Ignore hotkeys

 

·         The below ignore options now only affect the Check if source equals target and Check if target is empty:

 

o   Ignore alphanumeric only segments

o   Ignore CaMel words

 

Enhancements

 

·         The parser has been improved to deal with the following situations:

o   Situations where there is more than one tag within a <ph> group.

o   Situations where there is more than one tag within an internal tag in .rtf files.

o   The pattern <ph><tag>text1</tag><tag2>text2</tag2></ph> is now interpreted as a whole and protected as ^TAG^.

·      The behaviour of the Check if Target is empty regular expression has been made more accurate when detecting unsegmented text that contains line breaks in .rtf files.

 

Defects fixed

 

·         Issues related to incorrect behaviour of the option Ignore case and punctuation in the Segment Checker.

·         Issue in the LocStudio Filter Settings window,  the filter Modified after was not applied to all files once the user clicked on the Apply Filters to all files button.

·         Issue with the parsing of segments containing references to footnotes in bilingual Trados Word files. Previously LTB processed those segments incorrectly and as a consequence bilingual checks could not be performed.

 

Support Files Updates

Core Configuration

 

·         Updated: The file LTB_Core_Config.zip has been updated with these changes:

·         Term and Punctuation:

o   Core ALL STANDARD Literals - CaMel-Alphanumerics [UL].rul (new)

o   New Core FI STANDARD Number Format [UL] EN-FI.rul (new)

o   Core ALL STANDARD Tags Generic.rul (updated)

o   Core ZH-TW STANDARD Duplicated Words [UL] ALL-ZHTW.rul (updated)

o   New Core DE STANDARD Number Format [UL] EN-DE.rul (updated)

o   New Core EN STANDARD Number Format [UL] EN-EN.rul (updated)

o   New Core EN STANDARD Number Format 1,000.0 to 1 000,0 [UL].rul (updated)

o   New Core EN STANDARD Number Format 1,000.0 to 1,000.0 [UL].rul (updated)

o   New Core EN STANDARD Number Format 1,000.0 to 1.000,0 [UL].rul (updated)

o   New Core ES STANDARD Number Format [UL] EN-ES.rul (updated)

o   New Core FR STANDARD Number Format [UL] EN-FR.rul (updated)

o   New Core IT STANDARD Number Format [UL] EN-IT.rul (updated)

o   New Core NL STANDARD Number Format [UL] EN-NL.rul (updated)

o   New Core PT-BR STANDARD Number Format [UL] EN-PTBR.rul (updated)

o   New Core SV STANDARD Number Format [UL] EN-SV.rul (updated)

 

Term and Punctuation Rules

 

·         New rules (23):

 

o  DE CUSTOM Microsoft Style Guide EN-DE.rul

Microsoft Style Guide Rule for German according to the Style Guide from February 2012.

 

o   JA CUSTOM Microsoft Character Spacing (MS Style Guide ver.7).rul

o   JA CUSTOM Microsoft Katakana (MS Style Guide ver.7).rul

o   JA CUSTOM Microsoft MacUI (MS Style Guide ver.7).rul

o   JA CUSTOM Microsoft Standard Expressions (MS Style Guide ver.7).rul

o   JA CUSTOM Microsoft Style Guide ver.7 - WindowsOS.rul

o   JA CUSTOM Microsoft Style Guide ver.7.rul

Microsoft Style Guide Rules for Japanese according to the Style Guide ver. 7. Please replace your ver. 6 rules from your configurations with the latest version of the rules.

 

o   KO CUSTOM Microsoft Style Guide EN-KO.rul

Microsoft Style Guide Rule for Korean according to the Style Guide from February 2012.

 

o   PT CUSTOM Microsoft Style Guide EN-PT.rul

Microsoft Style Guide Rules for Portuguese (Portugal) according to the Style Guide from March 2011.

 

o   RU CUSTOM Microsoft Style Guide EN-RU.rul

Microsoft Style Guide Rule for Russian according to the Style Guide from January 2012.

 

o   ZH-CN CUSTOM Microsoft Style Guide (UA) EN-ZHCN.rul

Rule only applicable for UA.

o   ZH-CN CUSTOM Microsoft Style Guide (UI) EN-ZHCN.rul

Rule only applicable for UI.

o   ZH-CN CUSTOM Microsoft Style Guide EN-ZHCN.rul

Microsoft Style Guide Rules for Chinese (Simplified) according to the Style Guide from February 2012.

 

o   ZH-TW CUSTOM Microsoft Style Guide EN-ZHTW.rul 

Microsoft Style Guide Rules for Chinese (Traditional) according to the Style Guide from January 2012.

   

o  New Core FI STANDARD Number Format [UL] EN-FI.rul

     New core rule for number checks. Checks that numeric values are present with the correct format (with thousands or decimal separators) in the target segment and that any numeric value present in the target segment is also present in the source. This rule has been updated to carry out more exhaustive checks and to include language specific information to make the rules more accurate and to reduce false positives.

 

o  Core ALL STANDARD Literals - CaMel-Alphanumerics [UL].rul

Checks for correspondence of "CaMel" words and alphanumeric expressions between source and target. Uses external lists Camel_Alphanumeric_Blockers.txt and ordinal_suffixes.txt to block matches in both source and target segments.

 

o  ALL STANDARD Punctuation (Parenthesis, Brackets and Braces).rul

Detects punctuation issues related to orphaned or duplicate parentheses/brackets/braces in the target. Note: This rule is source independent. This rule does not check for orphaned Japanese brackets.

 

o  ALL STANDARD Literals - Acronyms [UL].rul

Checks that acronyms are the same in the source and target. Use acronym_blockers.txt to prevent matches. Note: Only use this rule if you don’t have clear requirements on acronyms, if you do, use a custom terminology rule instead.

 

o  ALL STANDARD Strict Literals - Acronyms [UL].rul

Checks that acronyms are the same in the source and target. Use source_strict_acronym_blockers.txt and target_strict_acronym_blockers.txt to prevent matches. Note: Only use the STRICT if you know exactly which acronyms can be blocked, or for situations where there always needs to be a one to one correspondence.

 

o   DE STANDARD Detect Formal Style EN-DE.rul

Detects instances of usage of formal style in German. This rule is applicable for those cases where the guideline is to use the "personal" you (du) in German rather than the "formal" you (Sie). This rule is source-dependent.

o   DE STANDARD Detect Informal Style EN-DE.rul

Detects instances of usage of informal style in German. This rule is applicable for the general cases where the guideline is to use the "formal" you (Sie) in German rather than the "personal" you (du). This rule is source dependent.

o   DE STANDARD Strict Detect Formal Style ALL-DE.rul

Detects instances of usage of formal style in German. This rule is applicable for those cases where the guideline is to use the "personal" you (du) in German rather than the "formal" you (Sie). This rule is more strict than the DE STANDARD Formal Style version.

o   DE STANDARD Strict Detect Informal Style ALL-DE.rul

Detects instances of usage of informal style in German. This rule is applicable for the general case where the guideline is to use the "formal" you (Sie) in German rather than the "personal" you (du). This rule is more strict than the DE STANDARD Detect Informal Style version.

 

 

·         Updated rules (17):

 

o   ES CUSTOM Microsoft Style Guide EN-ES.rul

Microsoft Style Guide Rules for Spanish according to the Style Guide from February 2012.

 

o  DE CUSTOM Microsoft Common Errors EN-DE.rul

Rule updated to remove checks that are now included in the DE CUSTOM Microsoft Style Guide EN-DE.rul.

 

o  New Core FR STANDARD Number Format [UL] EN-FR.rul

o  New Core SV STANDARD Number Format [UL] EN-SV.rul

Rules updated to fix a defect and improve rule ID. 5 to allow using the Version Blockers to prevent matches in decimal numbers without an integer, for example: .5.

o  New Core EN STANDARD Number Format 1,000.0 to 1 000,0 [UL].rul

Rules updated to fix a defect and to remove Finnish from the Target language.

 

o   New Core EN STANDARD Number Format 1,000.0 to 1,000.0 [UL].rul

o   New Core EN STANDARD Number Format 1,000.0 to 1.000,0 [UL].rul

o   New Core EN STANDARD Number Format [UL] EN-EN.rul

o   New Core DE STANDARD Number Format [UL] EN-DE.rul

o   New Core ES STANDARD Number Format [UL] EN-ES.rul

o   New Core IT STANDARD Number Format [UL] EN-IT.rul

o   New Core NL STANDARD Number Format [UL] EN-NL.rul

o   New Core PT-BR STANDARD Number Format [UL] EN-PTBR.rul

 

o  Core ALL STANDARD Tags Generic.rul

Updated to catch up special cases where the tags are different between source and target but were not previously detected because they were internal groups, for example: <tag><ph></ph></tag>.

 

o  Core ZH-TW STANDARD Duplicated Words [UL] ALL-ZHTW.rul

Checks for accidentally duplicated words in Traditional Chinese. Use DupWordsPhrases_Blockers_ZHTW.txt list to define exceptions. (NOTE: Entries in the list file are CASE-SENSITIVE; be sure to enter any common case variants needed.) This rule has been updated to hardcode contextual exceptions.

 

o   ES STANDARD Common Errors ALL-ES.rul

o   ES STANDARD False Friends EN-ES.rul

Updated: new entries have been added and obsolete entries have been deleted.

 

 

·         Deleted rules (3):

 

o   ZH-TW STANDARD Style Check ALL-ZHTW.rul

All the checks contained within the deleted rule are now included in the rule Core ZH-TW STANDARD Spacing ALL-ZHTW.rul.

o   DE STANDARD STRICT Formal Style.rul

This rule has now been renamed to DE STANDARD Strict Detect Informal Style ALL-DE.rul for clarity purposes.

o   DE STANDARD Formal Style.ru.

This rule has now been renamed to DE STANDARD Detect Informal Style EN-DE.rul for clarity purposes.

 

Lists

·         CheckifSourceEqualsTarget_IgnoreList_EN-FR.txt

This list contains an exhaustive lists of words that are spelled the same in both English and French and can be added under the Segment Checker>Check if source equals target to block those segments that would cause false positives in English-French translations. Find this list at this location: ...\MyLTB\Support\Lists

Dictionaries

 

·         Updated dictionaries (5):

o   Assamese.xpl

o   Finnish.xpl

o   French.xpl

o   French (Canada).xpl

o   Sinhalese.xpl

 

·         New dictionaries (1):

o   Bengali.xpl

 

Documentation

·     Help file has been updated with changes related to this release.

·     Information about the functionality and usage of the rules installed with the Term and Punctuation has been updated with changes related to this release, a new field “User Lists” has been added to this table to indicate those .rul files that reference User lists: http://autoupdate.lionbridge.com/ltb/_LTB_Installation_Rules_Description.html  

·     Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

http://autoupdate.lionbridge.com/ltb/KnownIssues.htm 

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.45 - Release Notes

February 15th, 2012

Contents

New Features

Changes

Defects fixed

Support Files Updates

Core Configuration

Term and Punctuation Rules

Dictionaries

Documentation

Known Issues 

 

We are very pleased to announce a new release of Linguistic ToolBox ver. 2.45.  In this release, we have introduced several new features that are going to have a big impact on the user experience of our internal LTB users.

 

·         Easier way to create multilingual configuration files

·         Easier way to adapt Term and Punctuation rules to specific project requirements by using external user support lists

·         Windows 7 users now can drag and drop files in the Input files tab

 

With this release, we have also included new and updated support files, don’t forget to check these out! Especial attention must be paid to the new checks for number rules and to the character corruption rules for 38 new languages.

 

If you have any technical questions or in case you identify any bugs, please contact us.

New Features

 

·   New LTB default Installation path. LTB is no longer installed under the Program Files folder, the default installation for LTB going forward is:  C:\Users\User_Name\Documents\MyLTB

    Note: When installing this new version from the install.exe, previous executable files from the previous location will be automatically removed, however user specific support lists will remain in the previous installation folder, usually at this location: C:\Program Files\Lionbridge\Linguistic Toolbox

 

·   New Method to create Multilingual Configuration files. In this new version of LTB, all LTB internal users can now easily create multilingual configuration files. To do this users just need to select the Source language and in the Target language field, select [ All ]. Then, add all the relevant checks in LTB and save the multilingual configuration file as usual: File>Manage Configurations>Save.

 

·   New functionality to allow advanced users to easily use project specific support lists and to save project specific support files within a configuration file. In the Term and Punctuation component, now there is a new grid available called User lists. This grid now allows users to add user defined external support lists. This new functionality will allow internal users to be able to easily modify some of the Term & Punctuation rules by using external user support lists instead of having to modify certain rules with regular expressions. All the standard installation rules that have referenced external user support lists contain [UL] in the file name, to find out more about how to use the user support lists for each .rul file, please check the details in the rule Description (in the Linguistic Editor, go to Edit and then, select Properties). 

Changes

 

·      In the Term and Punctuation tab, the button Refresh lists according to selected languages has now been renamed to Refresh rules according to selected languages

 

·      Now, when activating the Term and Punctuation component, LTB does not load any rules in the grid automatically. This new behaviour speeds up the start up of the Term and Punctuation component. To add the required lists for each language, users can use the Add button to select individual rules or click on Refresh rules according to selected languages to see all rules available for the selected source and target languages.

 

·      New languages added: Luxembourgish, Maori (New Zealand), Quechua (Bolivia), Quechua (Ecuador), Quechua (Peru) and Uyghur (China),

Defects fixed

 

·         Issue preventing drag and drop of files in the Input file tab for Windows 7

 

·         Issue with language-neutral dictionaries not loading automatically from within a configuration file This issue has now been fixed and now configuration files including a language-neutral dictionary (a dictionary with Language:

 [ All ])  automatically display these dictionaries in the Spelling tab.

 

·         Issue with QUICK not eliminating substring matches to reduce noise This issue has now been fixed and will reduce substantially the output from the QUICK component. It is very common for SW reference files to contain terms that are substrings of other terms, for example “Print” and “Print All Pages”. For such cases, the QUICK check will ignore the substrings when it matches on the longer string (the “superstring”).

 

·         Issue with parsing double byte characters when processing unformatted segments Previously LTB used to incorrectly remove some double byte characters when processing unformatted segments in a Chinese OS.

 

·         Issue when adding a folder with a period in the name with the Add Folder button

 

·         Issue with LTB not recognizing in the Add Input files tab,  HTML files.

 

 

Enhancements

 

Support Files Updates

Core Configuration

 

·         Updated: The file LTB_Core_Config.zip has been updated with these changes:

·         Term and Punctuation:

o   New Core NL STANDARD Number Format [UL] EN-NL.rul (new)

o   New Core PT-BR STANDARD Number Format [UL] EN-PTBR.rul (new)

o    New Core SV STANDARD Number Format [UL] EN-SV.rul (new)

o   New Core DE STANDARD Number Format [UL] EN-DE.rul (updated)

o   New Core ES STANDARD Number Format [UL] EN-ES.rul (updated)

o   New Core IT STANDARD Number Format [UL] EN-IT.rul (updated)

o   New Core EN STANDARD Number Format [UL] EN-EN.rul (new)

o   New Core EN STANDARD Number Format 1,000.0 to 1,000.0 [UL].rul (new)

o   New Core EN STANDARD Number Format 1,000.0 to 1 000,0 [UL].rul (updated)

o   New Core EN STANDARD Number Format 1,000.0 to 1.000,0 [UL].rul (updated)

o   Core EN STANDARD Literals – Numbers.rul (removed from Core, and now renamed EN STANDARD Literals – Numbers.rul)

o   Core ALL STANDARD URLs [UL].rul (updated rule)

o   Core ALL STANDARD Duplicated Words [UL].rul (updated rule)

o   Core ZH-TW STANDARD Spacing ALL-ZHTW (new rule)

o   Core ZH-TW STANDARD Duplicated Words [UL] ALL-ZHTW.rul (new rule)

 

Term and Punctuation Rules

 

·         New rules (83):

 

o  New Core EN STANDARD Number Format [UL] EN-EN.rul

o  New Core NL STANDARD Number Format [UL] EN-NL.rul

o  New Core PT-BR STANDARD Number Format [UL] EN-PTBR.rul

o  New Core SV STANDARD Number Format [UL] EN-SV.rul

     New core rule for number checks. Checks that numeric values are present with the correct format (with thousands or decimal separators) in the target segment and that any numeric value present in the target segment is also present in the source. This rule has been updated to carry out more exhaustive checks and to include language specific information to make the rules more accurate and to reduce false positives.

 

o  New Core EN STANDARD Number Format 1,000.0 to 1,000.0 [UL].rul (new)

New core rule for number checks. Checks that numeric values are present with the correct format (with thousands or decimal separators) in the target segment and that any numeric value present in the target segment is also present in the source. This rule has been updated to carry out more exhaustive checks but does not include language specific information.

 

o   XX STANDARD Detect Corrupt Chars or Wrong Lang EN-XX.rul (x38 languages)

Detects possible corrupt characters in target (and to a limited extent in source) segments for Afrikaans, Albanian, Amharic, Assamese, Azeri, Bengali, Bosnian (Cyrillic), Bosnian (Latin), Byelorussian, Gaelic (Ireland), Gujarati, Hausa, Igbo, Kannada, Khmer, Kirghiz, Konkani, Macedonian, Malay, Malayalam, Maltese, Maori (New Zealand), Marathi, Nepali, Oriya, Punjabi, Quechua, Sesotho, Sinhalese, Swahili, Tamil, Tatar, Telugu, Tswana, Welsh, Xhosa, Yoruba and Zulu.

 

o   XX STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul (x38 languages)

Detects possible corrupt characters in monolingual files for these languages: Afrikaans, Albanian, Amharic, Assamese, Azeri, Bengali, Bosnian (Cyrillic), Bosnian (Latin), Byelorussian, Gaelic (Ireland), Gujarati, Hausa, Igbo, Kannada, Khmer, Kirghiz, Konkani, Macedonian, Malay, Malayalam, Maltese, Maori (New Zealand), Marathi, Nepali, Oriya, Punjabi, Quechua, Sesotho, Sinhalese, Swahili, Tamil, Tatar, Telugu, Tswana, Welsh, Xhosa, Yoruba and Zulu.

 

o  Core ZH-TW STANDARD Spacing ALL-ZHTW.rul

This rules replaces the previous rule: ZH-TW STANDARD Spacing ALL-ZHTW.rul and includes the functionality of the rule Core (Not French) STANDARD Spacing.rul.

o  Core ZH-TW STANDARD Duplicated Words [UL] ALL-ZHTW.rul

This rule replaces the previous rule ZH-TW STANDARD Literals - Duplicated Words ALL-ZHTW.rul and includes the functionality of the rule Core ALL STANDARD Duplicated Words [UL].rul

 

·         Updated rules (23):

 

o  New Core EN STANDARD Number Format 1,000.0 to 1 000,0 [UL].rul (updated)

o  New Core EN STANDARD Number Format 1,000.0 to 1.000,0 [UL].rul  (updated, previously named Core EN STANDARD Number Format 1,000.0 to 1.000,0.rul (updated)

 

o   EN STANDARD Literals – Numbers.rul (updated, previously named Core EN STANDARD Literals – Numbers.rul)

This rule has now been removed from the Core. Checks that any numeric value (sequence of digits) in the source segment is present in the target segment and that any numeric value present in the target segment is also present in the source. NOTE: This rule does not check for the correct thousand or decimal separators.

 

o   New Core DE STANDARD Number Format EN-DE [UL].rul

o   New Core ES STANDARD Number Format EN-ES [UL].rul

o   New Core FR STANDARD Number Format EN-FR [UL].rul

o   New Core IT STANDARD Number Format EN-IT [UL].rul

New changes have been introduced to make this rule even more accurate and to reduce false positives further.

 

o   Core ALL STANDARD URLs [UL].rul

Checks that any URL or e-mail address in the source segment is present in the target segment. Use external User lists to define Exceptions: 1. Use Whole_String_SOURCE_URL_Blockers.txt and Whole_String_TARGET_URL_Blockers.txt to specify precisely full item matched by the LTB rule that should be excluded. 2. Use Partial_String_SOURCE_URL_Blockers.txt and Partial_String_TARGET_URL_Blockers.txt to specify any portion of a URL/address that should be blocked. This blocks ANY partial match so use with CARE! The SOURCE blocker lists only prevent matches in source segments, and the TARGET blocker lists only prevent matches in target segments; the same rule looks in both segments, so you generally will need to put an entry in a blocker list for both segments to prevent false positives.

 

o   Core ALL STANDARD Duplicated Words [UL].rul

Updated to remove further false positives for French. Also Chinese (Traditional) and Chinese (Hong Kong) have been removed from the rule properties. Use the external user support list DupWordsPhrases_Blockers.txt list to define exceptions. (NOTE: Entries in the list file are CASE-SENSITIVE; be sure to enter any common case variants needed.)

 

o  XX STANDARD Detect Corrupt Chars or Wrong Lang EN-XX.rul (x2 languages)

Detects possible corrupt characters in target (and to a limited extent in source) segments for Armenian and Vietnamese.

o  XX STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul (x2 languages)

Detects possible corrupt characters in monolingual Armenian and Vietnamese.

 

o  ES STANDARD Monolingual Detect Old Spelling.rul

o  ES STANDARD Detect Old Spelling ALL-ES.rul

 Introduced new check on the usage of the % sign.

 

o  ES STANDARD Common Errors ALL-ES.rul

 New checks added.

 

o  FI STANDARD Rude_words_and_vulgarisms ALL-FI.rul

o  FI STANDARD Additional_grammar_and_punctuation_check ALL-FI.rul 

New checks have been added to these rules.

 

o  ZH-TW STANDARD PoliCheck ALL-ZHTW.rul

o  ZH-TW STANDARD Offensive Terms ALL-ZHTW.rul

o  ZH-TW STANDARD Punctuation EN-ZHTW.rul

o  ZH-TW STANDARD Forbidden CHS Terms All-ZHTW.rul

These rules has been reviewed to reduce the number of false positives.

Dictionaries

 

·         Updated dictionaries (6):

 

o   Albanian.xpl

o   English (UK).xpl

o   Finnish.xpl

o   French (Canada).xpl

o   Malay.xpl

o   Norwegian (Bokmal).xpl

 

·         New dictionaries (1):

o   Luxembourgish.xpl

 

Documentation

·     Help file has been updated with changes related to this release.

·     Information about the functionality and usage of the rules installed with the Term and Punctuation has been updated with changes related to this release, a new field “User Lists” has been added to this table to indicate those .rul files that reference User lists: http://autoupdate.lionbridge.com/ltb/_LTB_Installation_Rules_Description.html  

·     Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

There are updates for Known Issues related to this release: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm 

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.44 - Release Notes

October 31st, 2011

Contents

Changes

Defects fixed

Support Files Updates

Core Configuration

Term and Punctuation Rules

Entity Extactor Rules

Dictionaries

Documentation

Known Issues

 

We are very pleased to announce a new release of Linguistic ToolBox ver. 2.44.  In this release, we are tackling one of the options in LTB that usually has produced a large number of false positives: Segment Checker> Check if source equals target. This is a very useful check that detects instances of untranslated segments, but until now, we have not had any method to avoid false positives for this check other than blocking locked segments or matches equal or above to a certain percentage. In this new release we are introducing for the option Check if source equals target, two new Ignore options to allow users to easily block false positives for those segments that need to stay untranslated as per the instructions.

 

In addition, we have also carried out critical updates to the rules for Punctuation and Spacing, where the number of false positives have now been reduced drastically. We highly recommend our users to update any configuration files that use any of the Punctuation and Spacing rules installed by default with LTB.

 

If you have any technical questions or in case you identify any bugs, please contact us.

Changes

 

·   New Ignore options for Check if source equals Target.  These two options allow users to easily block false positives.

 

o   Ignore list: enables users to ignore segments that must remain unstranslated by using a one column list of the segments that need to be ignored (Unicode .txt).This option is very convenient if the segments that need to be ignored don’t follow a certain pattern or for users unfamiliar with the creation of rules for Linguistic ToolBox. Note that each list entry must contain the whole segment that is supposed to be ignored, and that these entries are case-sensitive.

 

o   Ignore rule: enables users  to ignore segments that must remain unstranslated by using a rule containing either regular expressions or list entries in a rule format. The rules created for this option, must have content in the Source column and must use the Error mode. This option is very useful to ignore those segments that follow a certain pattern, or where entries need to be case-insensitive.

 

·   Segment Checker Ignore options: The Segment Checker Ignore options that are common to the main features of this component have been moved under the Ignore Options box. Note that the functionality remains the same, this is only a cosmetic change in the UI.

 

·   New Language added: Spanish (United States).  Spanish (United States) has now been added to the list of Source and Target languages.

 

Defects fixed

·         Issue with how LTB used to process new line metacharacters. LTB used to replace soft carriage returns, \n and \r by a space when processing the segments that contained these metacharacters, as a consequence any Term and Punctuation checks related to these metacharacters were not working correctly for certain file types. Now, LTB when processing these segments does not change these instances by a space.

 

·         Issue with LTB not processing the translatable text included within an image tag This issue is now fixed and the translatable text is now processed correctly in image tag, below an example:

<img class="shadow" src="//www.lionbridge.com/help//images/example.jpg" alt="This is an example image">

 

·         Issue when a rule referenced an empty list. LTB used to produce an unhandled exception when a rule referenced an empty external list, now this has been fixed.

 

•      Issue when a importing rules from the Linguistic Editor template into a .rul file. Previously LTB added in certain occasions additional quotes when importing from the Linguistic Editor template, these additional quotes prevented the rules from finding any matches during execution.

Support Files Updates

Core Configuration

 

·         Updated: The file LTB_Core_Config.zip has been updated with these changes:

·         Term and Punctuation:

o  Core ALL STANDARD Duplicated Words.rul (updated)

o  CORE ALL STANDARD Punctuation.rul (updated)

o  Core ALL (Not French) STANDARD Spacing.rul (new rule, previously named: Core ALL (Not French) STANDARD Punctuation space before.rul )

o  Core FR STANDARD Spacing ALL-FR.rul (new rule, previously named: Core FR STANDARD Punctuation space before ALL-FR.rul)

o  Core FR-CA STANDARD Spacing ALL-FRCA.rul (new rule, previously named: Core FR-CA STANDARD Punctuation space before ALL-FRCA.rul)

o  Core FR STANDARD Date Time EN-FR.rul (updated rule)

o  New Core DE STANDARD Number Format EN-DE.rul (updated rule)

o   New Core ES STANDARD Number Format EN-ES.rul (updated rule)

o   New Core IT STANDARD Number Format EN-IT.rul (updated rule)

Term and Punctuation Rules

 

·         New rules (46):

 

o   Core ALL (Not French) STANDARD Spacing.rul (new rule, previously named: Core ALL (Not French) STANDARD Punctuation space before.rul )

This rule contains all spacing checks from the previous version of the rule Core ALL STANDARD Punctuation. This rule now uses formatted segments to avoid false positives related to the processing of certain tags.

o   Core FR STANDARD Spacing ALL-FR.rul (new rule, previously named: Core FR STANDARD Punctuation space before ALL-FR.rul)

This rule contains all spacing checks from the previous version of the rule Core ALL STANDARD Punctuation. This rule now uses formatted segments to avoid false positives related to the processing of certain tags.

o   Core FR-CA STANDARD Spacing ALL-FRCA.rul (new rule, previously named: Core FR-CA STANDARD Punctuation space before ALL-FRCA.rul)

This rule contains all spacing checks from the previous version of the rule Core ALL STANDARD Punctuation. This rule now uses formatted segments to avoid false positives related to the processing of certain tags.

o   ALL STANDARD Strict Spacing.rul

This rule contains all spacing checks from the previous version of the rule ALL STANDARD Strict Punctuation. This rule now uses formatted segments to avoid false positives related to the processing of certain tags.

 

o   FR STANDARD Grammar ALL-FR.rul

Detects common grammatical issues in French.

 

o   FR STANDARD Negative-Affirmative Sentences EN-FR 

Detect instances where affirmative sentences may have been translated as negative and vice versa. For example, prevents situations where "yes" may have been translated as "no".

o   FR-CA STANDARD False Friends EN-FRCA.rul

Detects instances of common false  friends for French (Canada).

 

o   FR-CA STANDARD Equivalent Trade and Registered Marks.rul

Checks that registered marks, trademarks, copyright marks, service marks and sound recording copyright marks found in the source have been rendered in the Target with the equivalent symbol for French (Canada). ONLY USE this rule when the requirement for these marks is to use the French (Canada) equivalent in the Target. DO NOT Use in conjunction with the rule: ALL STANDARD Check Trade and Registered Marks.

 

o  NB STANDARD Common English words left in Target EN-NB.rul

Detects instances of common English words left in the translation. In order to avoid duplicated output, this rule should only be used when in the Spelling component the option Ignore untranslated segments is enabled. This rule is case insensitive

 

•         Microsoft Rules

 

o   XX CUSTOM Microsoft Style Guide - Fictitious Information (e-mails) EN-XX.rul (New rule x 29 languages)

Checks for compliance with the Microsoft Fictitious information guidelines regarding e-mails.

 

o   DA CUSTOM Microsoft Style Guide Punctuation EN-DA.rul

Checks for punctuation errors according to the Microsoft Style Guide for Danish for February 2011.

 

o   FI CUSTOM Microsoft Style Guide Capital letter after colon EN-FI.rul

o   FI CUSTOM Microsoft Style Guide Copyright symbols EN-FI.rul

o   FI CUSTOM Microsoft Style Guide Noun case suffixes EN-FI

o   FI CUSTOM Microsoft Style Guide Punctuation EN-FI.rul

Checks for errors on different topics according to the Microsoft Style Guide for Finnish.

 

o   ZH-HK STANDARD Offensive Terms ALL-ZHHK.rul

Checks for instances of offensive terms in Chinese (Hong Kong SAR) segments.

o   ZH-HK STANDARD PoliCheck ALL-ZHHK.rul

Checks for the most common geopolitical errors for Chinese (Hong Kong SAR).

 

·         Updated rules (21):

 

o   Core ALL STANDARD Punctuation.rul

All rules related to spacing have been removed from this rule and moved into the STANDARD Spacing rules. In addition, this rule has been fully reviewed to avoid duplicated output.

o   ALL STANDARD Strict Punctuation.rul

All rules related to spacing have been removed from this rule and moved into the STRICT Spacing rules. All duplicated rules in the ALL STANDARD Strict Punctuation rule and the Core ALL STANDARD Punctuation have been removed from this rule. Thus going forward, this rule needs to be ALWAYS USED IN CONJUNCTION with the Core ALL STANDARD Punctuation.

o  ES STANDARD Punctuation ALL-ES.rul

This rule has been updated to add a Spanish specific check with regards to the conjunction “, y” that was previously placed in the ALL STANDARD STRICT Punctuation.rul.

o FR STANDARD Punctuation ALL-FR.rul

 

o  Core ALL STANDARD Duplicated Words.rul

In this release we have improved the exceptions for French: “vous vous” and have added an exception for Catalan: “a més a més”. We have added to this rule an external list to define exceptions: DupWordsPhrases_Blockers.txt list (NOTE: Entries in the list file are CASE-SENSITIVE; be sure to enter any common case variants needed and keep a copy of this list with a different name in order for this list not to be overwritten with the LTB autoupdate.)

 

o   New Core DE STANDARD Number Format EN-DE.ru

o   New Core ES STANDARD Number Format EN-ES.rul

o   New Core IT STANDARD Number Format EN-IT.rul

New changes have been introduced to make this rule even more accurate and to reduce false positives further.

 

 

o  ALL STANDARD Common English words left in target EN-ALL

Properties updated.

 

o  Core FR STANDARD Date Time EN-FR.rul

This rule has been updated to correct a typo in rule ID. 40

 

 o DA STANDARD Common English words left in Target EN-DA.rul

Detects instances of common English words left in the translation. In order to avoid duplicated output, this rule should only be used when in the Spelling component the option Ignore untranslated segments is enabled. This rule is case insensitive.

 

o   IT STANDARD Common Errors ALL-IT.rul

New entries have been added and new exceptions have been defined to make the rule more accurate.

o   IT STANDARD False Friends EN-IT.rul

Updated with new entries to avoid false positives related to the words: million and billion.

o   IT STANDARD Grammar ALL-IT.rul

Added new common grammar errors (especially to detect concordance typos).

o   IT STANDARD Punctuation EN-IT.rul

 All checks from this rule already included in the Core ALL Standard Punctuation.rul and in the new Core ALL (Not French) STANDARD Spacing.rul have now been removed to avoid duplication.

 

o   JA STANDARD CommonErrors ALL-JA.rul

 

o  ZH-TW STANDARD Common Typos ALL-ZHTW.rul

o  ZH-TW STANDARD PoliCheck ALL-ZHTW.rul

o  ZH-TW STANDARD Forbidden CHS Terms All-ZHTW.rul

o  ZH-TW STANDARD Offensive Terms ALL-ZHTW.rul

o  ZH-TW STANDARD Punctuation EN-ZHTW.rul

These rule has been reviewed to reduce the number of false positives.

Entity Extractor Rules

 

·         New rules (3):

 

o  EN CUSTOM Microsoft Fictitious Company Names and URLs.rul

Detects instances of Microsoft Fictitious Company Names/Product Names or URLs as per the Microsoft Fictitious Company Name list.

o  EN CUSTOM Microsoft Fictitious E-mails

Detects instances of e-mail addresses in the source  as per the guidelines from: Fictitious Email Addresses.xls (Microsoft Fictitious information regarding e-mails).

o  EN CUSTOM Microsoft Fictitious Proper Names.rul

Detects instances of Fictitious people names in Microsoft Content from these two lists:

1.    International fictitious Names List

2.    Fictitious people names secondary list

Localization teams must only use names from the two lists provided.

Dictionaries

·         Updated dictionaries (3):

o   Finnish.xpl

o   French.xpl

o   French (Canada).xpl

Documentation

·     Help file has been updated with changes related to this release.

·     Information about the functionality and usage of the rules installed with the Term and Punctuation has been updated with changes related to this version: http://autoupdate.lionbridge.com/ltb/_LTB_Installation_Rules_Description.html

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.43 Update 2 - Release Notes

September 5th, 2011

Contents

Changes

Defects fixed

Support Files Updates

Core Configuration

Term and Punctuation Rules

Dictionaries

Documentation

Known Issues

 

We are pleased to announce a new release of Linguistic ToolBox ver. 2.43 Update 2. This update includes new fields in LTB Statistics tab for the word count excluding Ignored segments, fixes for important defects and a number of new and updated support files.

If you have any technical questions or in case you identify any bugs, please contact us.

Changes

·   New fields in the LTB Statistics tab. The LTB Statistic tab now shows these new fields indicated below. Please note that now you can see the number of segments and word count including and excluding Ignore segments. Ignore segments on this context are those segments that have been ignored by these options: Ignore locked segments or marked as non-translatable and Ignore matches equal or above.

 

o   No. of segments: Including and Excluding Ignored segments.

o   No. of source words: Including and Excluding Ignored segments.

o   No. of target words: Including and Excluding Ignored segments.

o   Ratio False Positives to Source word count: Including and Excluding Ignored segments.

o   Ratio False Positives to Total No. Entries: Including and Excluding Ignored segments.

 

Defects fixed

·         Issue when processing Microsoft LocStudio input files with a Date filter applied. LTB produced this error message: "String was not recognized as a valid Date Time."

·         Issue with LTB duplicating checks when processing Microsoft LocStudio .Lsproj files. These .Lsproj files only contain references to .lcl files and as a consequence when processing these files through LTB the checks on .lcl files used to be incorrectly duplicated.

·         Issue with the Linguistic Editor’s Target Reference Mode and Source Reference Mode not taking into account case-sensitivity with the MCase option.

·         Issue with LTB not using the Default report path set in File>Preferences>Output when loading a configuration file.

Support Files Updates

Core Configuration

 

·         Updated: The file LTB_Core_Config.zip has been updated with these changes:

o   Term and Punctuation:

o   New Core ES STANDARD Number Format EN-ES.rul (new rule)

o   New Core IT STANDARD Number Format EN-IT.rul (new rule)

o   New Core DE STANDARD Number Format EN-DE.rul (updated rule)

o   New Core FR STANDARD Number Format EN-FR.rul (updated rule)

o    Core EN STANDARD Number Format 1,000.0 to 1.000,0.rul (updated rule)

Term and Punctuation Rules

 

·         New rules (3):

 

o   New Core ES STANDARD Number Format EN-ES.rul

o   New Core IT STANDARD Number Format EN-IT.rul

New core rule for number checks. Checks that numeric values are present with the correct format (with thousands or decimal separators) in the target segment and that any numeric value present in the target segment is also present in the source. This rule has been updated to carry out more exhaustive checks and to include language specific information to make the rules more accurate and to reduce false positives.

 

o   DA STANDARD Common English words left in Target EN-DA.rul

Detects instances of common English words left in the translation. In order to avoid duplicated output, this rule should only be used when in the Spelling component the option Ignore untranslated segments is enabled. This rule is case insensitive.

 

·         Updated rules (12):

 

o   ALL STANDARD Spacing.rul

Checks for extra or missing spaces in formatted files.

                                                                          

o   ALL STANDARD Common English words left in Target  EN-ALL.rul

o   FR STANDARD Common English words left in target EN-FR.rul

Updated to be case insensitive.

 

 

o   ES STANDARD Detect Old Spelling ALL-ES.rul (previously named: ES STANDARD Spanish Spelling Reform ALL-ES.rul)

o   ES STANDARD Negative-Affirmative Sentences EN-ES.rul

 

o   New Core DE STANDARD Number Format EN-DE.rul

o   New Core FR STANDARD Number Format EN-FR.rul

 

o   Core EN STANDARD Number Format 1,000.0 to 1.000,0.rul

Updated to remove all variants of Spanish and Italian from the list of languages for this rule.

 

·         Microsoft Rules

 

o    JA CUSTOM Microsoft MacUI (MS Style Guide ver.6).rul

Detects the Windows UI strings that are different from Mac UI strings according to the Microsoft Style Guide ver. 6.

 

o   JA CUSTOM Microsoft Style Guide ver.6.rul

Checks compliance with the Microsoft Style Guide Requirements (Style Guide 6). Note that most unchanged rules are still based on Style Guide 5.

 

o   JA CUSTOM Microsoft Character Spacing (MS Style Guide ver.6).rul

Updated to remove false positives when LTB detects editorial symbol and delimiter.

 

o   JA CUSTOM Microsoft Katakana (MS Style Guide ver.6).rul

Updated to revise the incorrect description in error log written out.

Dictionaries

·         Updated dictionary (1):

o   Italian.xpl

Documentation

·     Help file has not been updated with changes related to this release.

·     Information about the functionality and usage of the rules installed with the Term and Punctuation has been updated with changes related to this version: http://autoupdate.lionbridge.com/ltb/_LTB_Installation_Rules_Description.html

·     Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

There are updates for Known Issues related to this release: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm 

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.43 - Release Notes

August 3rd, 2011

Contents

New Features

Defects fixed

Support Files Updates

Core Configuration

Entity Extractor Rules

Term and Punctuation Rules

Dictionaries

Documentation

Known Issues

We are very pleased to announce a new release of Linguistic ToolBox ver. 2.43. In this version we have added a new Quality Control Component: the LTB Random Sampling. This new component automatically generates a random bilingual sample of a selected number of words to be used for LQI. This version also includes some fixes for important defects and a large number of new and updated support files. Please pay particular attention to the new updated rules for the Microsoft account.

If you have any technical questions or in case you identify any bugs, please contact us.

New Features

·        New component: LTB Random Sampling. The LTB Random Sampling component automatically generates a bilingual sample of a selected number of words to be used for LQI or other purposes. This functionality can be used either on its own or combined with pre-defined LTB checks that are executed on the automatically generated sample to provide the reviewer with additional information on potential linguistic issues.

Defects fixed

·         Issue with spell-checking on monolingual files. Previously users got this error message when trying to carry out the spell-checking on monolingual files: “Input files, target language [None] is not installed in Microsoft Word.”

·         Issue with LTB temporary dictionaries created to save “correct” and “incorrect” words. These dictionaries were case-insensitive instead of case-sensitive.

·         Issue with LTB not saving the Filter using previous report report within a configuration file.

·         Issue with how  support “lists” were converted to regular expressions.

Support Files Updates

Core Configuration

 

·         Updated: The file LTB_Core_Config.zip has been updated with these changes:

o   Term and Punctuation:

o       New Core DE STANDARD Number Format EN-DE.rul (new rule)

o       New Core FR STANDARD Number Format EN-FR.rul (new rule)

o       Core EN STANDARD Number Format 1,000.0 to 1.000,0.rul (updated rule)

o       Core EN STANDARD Number Format 1,000.0 to 1 000,0.rul (updated rule)

Entity Extractor Rules

 

·         New rules (1):

 

o   ALL STANDARD Segmentation Checker.rul

Checks for the most common type of segmentations issues in files prepared for translation: articles found at the end of the segment, instances of letters in initial uppercase not at the beginning of the sentence and possible sentence fragment (begins with lowercase, ends with period) and highlights situations where potential sub-segmentation may be required.

Term and Punctuation Rules

 

·         New rules (10):

 

o   New Core DE STANDARD Number Format EN-DE.rul

o   New Core FR STANDARD Number Format EN-FR.rul

New core rule for number checks. Checks that numeric values are present with the correct format (with thousands or decimal separators) in the target segment and that any numeric value present in the target segment is also present in the source. This rule has been updated to carry out more exhaustive checks and to include language specific information to make the rules more accurate and to reduce false positives.

 

o   ALL STANDARD Spacing.rul

Checks for extra or missing spaces in formatted files.

 

o   FR STANDARD Common English words left in target EN-FR.rul

Detects instances of common English words left in the translation. This rule has been modified to eliminate false positives caused by words that are the same in English and French.

 

o   IT STANDARD Grammar ALL-IT.rul

Detects common grammar issues for Italian.

 

·         Microsoft Rules

 

o   ALL CUSTOM Microsoft Style Guide TradeMarked Names.rul

Checks that trademarked names and the name Microsoft Corporation remain unlocalized, as per the list of Microsoft trademarks (updated on June 15th, 2011) available here.

 

o   NB CUSTOM Microsoft Style Guide EN-NB.rul

Checks compliance with the Microsoft Style Guide Requirements for Norwegian Bokmal (Updated on March 31st, 2011).

 

o   SV CUSTOM Microsoft Style Guide Fictitious Information EN-SV.rul

Checks for compliance with the Microsoft Style Guide regarding Sev. 1 - Fictitious Information

 

o   SV CUSTOM Microsoft Style Guide Genitive EN-SV.rul

Checks for compliance with the Microsoft Style Guide regarding Sev. 1 - Genitive.

 

o   SV CUSTOM Microsoft Style Guide Negative Affirmative EN-SV.rul

Checks the Swedish translation for compliance to Microsoft Style Guide regarding Sev. 1 - Negative and affirmative sentences.

 

·         Updated rules (19):

 

o   IT STANDARD Common Errors ALL-IT.rul

Specific checks for Grammar issues have been removed from this list and can be found now in the IT STANDARD Grammar ALL-IT.rul. Additional checks for common spelling mistakes have been added.

 

o   ES STANDARD Common Errors ALL-ES.rul

o   ES STANDARD Detect Old Spelling ALL-ES.rul (previously named: ES STANDARD Spanish Spelling Reform ALL-ES.rul)

o   ES STANDARD Grammar ALL-ES.rul

o   ES STANDARD Mistranslations EN-ES.rul

o   ES STANDARD Monolingual Detect Old Spelling.rul (previously named: ES STANDARD Monolingual Spanish Spelling Reform.rul)

o   ES STANDARD Negative-Affirmative Sentences EN-ES.rul

o   ES STANDARD Rude Words and Vulgarisms ALL-ES.rul

These seven rules for Spanish have been updated with feedback from users to remove false positives and new checks have been added.

 

o   FI STANDARD Rude_words_and_vulgarisms ALL-FI.rul

o   FI STANDARD Additional_grammar_and_punctuation_check ALL-FI.rul

 

o   EN STANDARD Detect Corrupt Chars or Wrong Lang ALL-EN.rul

o   ES STANDARD Detect Corrupt Chars or Wrong Lang EN-ES.rul

o   FI STANDARD Detect Corrupt Chars or Wrong Lang EN-FI.rul

These thee rules now check for sequences of corrupt characters and they exclude false positives related to the degree sign or to Greek letters.

 

o   Core EN STANDARD Number Format 1,000.0 to 1.000,0.rul

Updated to remove German from the list of languages for this rule.

o   Core EN STANDARD Number Format 1,000.0 to 1 000,0.rul

Updated to remove French from the list of languages for this rule.

 

·         Microsoft Rules

 

o   JA CUSTOM Microsoft MacUI (MS Style Guide ver.6).rul

Detects the Windows UI strings that are different from Mac UI strings according to the Microsoft Style Guide ver. 6.

 

o   JA CUSTOM Microsoft Style Guide ver.6.rul

Checks compliance with the Microsoft Style Guide Requirements (Style Guide 6). Note that most unchanged rules are still based on Style Guide 5.

 

o   JA CUSTOM Microsoft Character Spacing (MS Style Guide ver.6).rul

Updated to remove false positives when LTB detects editorial symbol and delimiter.

 

o   JA CUSTOM Microsoft Katakana (MS Style Guide ver.6).rul

Updated to revise the incorrect description in error log written out.

 

·         Deleted rules (1):

 

o    JA_CUSTOM_MS_TermStudio_EN_JA.rul

 Rule deleted due to large volume of false positives. Alternative rule will be provided in the near future to check for key Microsoft Terminology for Japanese.

Dictionaries

 

·         New dictionaries (1):

o   EN(UK)_Travel_&_Leisure.xpl

 

·         Updated dictionaries (6):

o   ES_Life Sciences.xpl

o   Finnish.xpl

o   French (Canada).xpl

o   Galician.xpl

o   Italian.xpl

o   Spanish.xpl

Documentation

·      Help file has been updated with changes related to this release.

·      Information about the functionality and usage of the rules installed with the Term and Punctuation is available but has not been updated with changes related to this version: http://autoupdate.lionbridge.com/ltb/_LTB_Installation_Rules_Description.html

·      Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

There are no updates for Known Issues related to this release: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.42 Update 5- Release Notes

June 2nd, 2011

Contents

Enhancements. 1

Defects fixed. 1

Support files Updates. 1

Core Configuration. 1

Term and Punctuation Rules. 1

Dictionaries. 2

Documentation. 2

Known Issues. 2

We are pleased to announce a new release of Linguistic ToolBox ver. 2.42 Update 5. In this version we have fixed important defects in relation to the parsing of LTB, especially for XLIFF file formats and have also released a large number of new and updated Term and Punctuation rules and dictionaries.

If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert through Service Desk.

Enhancements

·         XHTM and XHTML file extensions are now automatically identified in the Add Input files tab, under Type.

Defects fixed

·         Issue with LTB not processing XLIFF files when the group <group id=“x”> appears before <trans-unit>.

·         Issue with LTB not parsing correctly XLIFF from Idiom containing merged segments under certain particular conditions.

·         Issue with LTB parser not working correctly for the Change Report component when input files contains Tracked Changes.

·         Issue in the Segment Checker, the option Ignore CaMelWords was not working correctly when used in conjunction with the option Check if source equals target.

·         Issue with LTB producing an unhandled exception when processing files that contain segments in the header or footer.

Support files Updates

Core Configuration

 

·         Updated: The file LTB_Core_Config.zip has been updated with these changes:

o   Term and Punctuation:

§  CORE TH STANDARD Detect Redundant Punctuation.rul (new rule)

§  Core ALL STANDARD Tags Generic.rul (updated)

 

o   Spelling: New dictionaries have been added for these three languages: Albanian, Malay and Galician.

Term and Punctuation Rules

 

·         New rules (11):

 

o  CORE TH STANDARD Detect Redundant Punctuation.rul

Detects unnecessary punctuation in Thai including instances of a full stop at the end of sentence.

o  ES CUSTOM Microsoft Country Names EN-ES.rul

Checks that country names have been correctly translated from English into Spanish, as per the country names approved by Microsoft in TermStudio (May 3rd, 2011).

o  DE STANDARD Formal Style ALL-DE.rul

Detects instances of usage of formal style in German. This rule is applicable for those cases where the guideline is to use the "personal" you (du) in German rather than the "formal" you (Sie).

o  DE STANDARD STRICT Formal Style.rul

Detects instances of usage of formal style in German. This rule is applicable for those cases where the guideline is to use the "personal" you (du) in German rather than the "formal" you (Sie). This rule is stricter than the DE STANDARD Formal Style version.

o  DE STANDARD Grammar.rul

Detects common grammatical issues in German.

o  BG STANDARD Grammar.rul

Detects common grammatical issues in Bulgarian.

o  ZH-CN STANDARD Space Between Single Byte and Double Byte.rul

Detects instances of space between single byte character and double byte characters. Some accounts requires that there should be NO space between English character and Chinese character.

o  ZH-HK STANDARD TW terms appear in HK translation ALL-ZHHK.rul

Detects instances of terms in Chinese (Traditional) in Chinese (Hong Kong) content.

o  ZH-TW STANDARD HK terms appear in TW translation ALL-ZHTW.rul

Detects instances of Chinese (Hong Kong) terms in Chinese (Traditional) content.

o  ZH-TW STANDARD Forbidden CHS Terms All-ZHTW.rul

Checks for forbidden Chinese (Simplified) Terms in Chinese (Traditional) content.

o  ZH-TW STANDARD Punctuation EN-ZHTW.rul

Checks for common punctuation errors for Chinese (Traditional).

 

·         Updated rules (12):

 

o   Core ALL STANDARD Tags Generic.rul

Updated to remove false positives when the order of the tags has been altered.

o   ALL STANDARD Software Checks.rul

Updated to remove false positives related to ampersands not used as hotkeys.

o   ES CUSTOM Microsoft Style Guide EN-ES.rul

Updated according to the latest changes in the Microsoft Style Guide in March 2011.

o   ES CUSTOM Microsoft Geopolitical Issues EN-ES.rul

Updated as per the latest Policheck output received from Microsoft.

o   ES STANDARD Monolingual Spanish Spelling Reform.rul

o    ES STANDARD Spelling Reform ALL-ES.rul

Updated to add more exhaustive checks for the Spanish Spelling Reform.

o    ES Standard Spelling Errors ALL-ES.rul

Updated to remove the check for typos related to the usage of “se/sé”.

o   ES STANDARD Negative-Affirmative Sentences EN-ES.rul

Small update to deal also with mistranslations related to cannot / cannot / can’t.

o   ES STANDARD Neutral Spanish ALL-ES.rul

Updated to eliminate false positives related to variations of the verb “coger”.

o   ES STANDARD Rude Words and Vulgarisms ALL-ES

Rule related to the term "cas" has been removed.

o   ES STANDARD Common Errors ALL-ES.rul

Updated with new entries.

o   ZH-TW STANDARD Common Typos ALL-ZHTW.rul

Updated with new entries.

Dictionaries

 

·         New dictionaries (3):

o   Albanian.xpl

o   Malay.xpl (Brunei Darussalam)

o   Galician.xpl

 

·         Updated dictionaries (3):

o   Spanish.xpl

o   Hindi.xpl

o   DE_LifeSciences.xpl

Documentation

·      Help file has not been updated with changes related to this release.

·      Information about the functionality and usage of the rules installed with the Term and Punctuation is available but has not been updated with changes related to this version: http://autoupdate.lionbridge.com/ltb/_LTB_Installation_Rules_Description.html

·      Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

There are no updates for Known Issues related to this release: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm 

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.42 Update 4- Release Notes

March 24th, 2011

Contents

 

Changes

Defects fixed

Support files updates

Documentation

Known Issues

 

 

We are pleased to announce a new release of Linguistic ToolBox ver. 2.42 Update 4. In this version we have added four new languages to the list of languages for Linguistic ToolBox (Bosnian (Latin), Bosnian (Cyrillic), Lingala and Malagasy). In addition, we have fixed important defects in relation to the XLIFF Parser and we have improved the tokenization process for the spelling component to make the Spelling component more efficient and prevent false positives related to parsing issues.

If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·         For Lionbridge users, please use Service Desk.

·         For non Lionbridge users, you can contact us at GTLTBSupport@lionbridge.com.

Changes

·         New Languages.  Four new languages have been added to the list of languages of Linguistic ToolBox: Lingala, Malagasy, Bosnian (Latin) and Bosnian (Cyrillic). 

Defects fixed

·         Issue with LTB incorrectly merging segments in the LTB Output reports when processing XLIFF files. This issue happened when groups of segments contain a “merge-trans” value.

·         Issue with LTB not being able to parse particular types of XLIFF files with split/merged segments. When trying to process these files, LTB reported this error message: “Object reference not set to an instance of the object”.

·         Issue with LTB XLIFF parser not parsing correctly Xliff files without header and nested groups. In this situation LTB produced this error message: “Sequence contains no elements”.

·        Issue with LTB parsing for XLIFF files not displaying TAG marks to replace internal XLIFF formatting tags after the parsing has been completed.

·         Issue with LTB not processing specific XLIFF files. In these situations, LTB produced this error message: “Unexpected end tag. Line 1, position 3".

·         Issue with LTB parser for RTF incorrectly removing hyperlinks.

·         Issue with LTB parsers merging incorrectly words when there was no space in between tags. Now the LTB tokenization process for spelling has been changed to follow a double tokenization pass to address this issue and make the spell-check more efficient.

·        Issue in the Input tab, when clicking on the Add Folder button some users got an unhandled exception and were not able to use this functionality.

·         Issue in the Input files tab, when adding a folder containing Microsoft LocStudio files and selecting under the Type column: LCX, LTB did not process any of the files.

·         Issue with LTB rules that contained reserved “XML” characters in the Description field. Whenever Entity Extractor or Term and Punctuation rules contained any of the reserved XML characters in the Rule Description field such as: &, <, >, etc. whenever the rule was added the user would get an error messaging: “Invalid rule or rule not well formed”.

·        Issue in the Source Analysis component, when the path to the exceptions or known cases lists was incorrect, the error message displayed was not correct.

Support files updates

There are no updates to support files with this release.

Documentation

·       Help file has not been updated with changes related to this release.

·       Information about the functionality and usage of the rules installed with the Term and Punctuation is available at: http://autoupdate.lionbridge.com/ltb/_LTB_Installation_Rules_Description.html

·       Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

The Known issues have been updated: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm 

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.42 - Release Notes

January 20th, 2011

Contents

 

New Features

Enhancements

Defects fixed

Support files updates

Core Configuration

Term and Punctuation Rules

Entity Extractor Rules

Dictionaries

Documentation

Known Issues

 

 

We are pleased to announce a new release of Linguistic ToolBox ver. 2.42. In this version we have implemented a workaround to be able to ignore Idiom ICE Matches and have introduced a new field to filter by date when using Microsoft LocStudio files as input. LTB is now able to deal correctly with hotkeys for Asian languages that follow this format (&H) or (&amp;H), this change will reduce greatly the number of false positives related to hotkeys. We have also made a number of important enhancements to the LTB parsers for these file formats (Idiom, XLIFF, tab-separated file formats, XML file formats and DITA) and also, to the Change Report.

If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·         For Lionbridge users, please use Service Desk.

·         For non Lionbridge users, you can contact us at GTLTBSupport@lionbridge.com.

 

 

New Features

·         Idiom ICE Matches are automatically assigned a 200% match in the LTB output reports.  Idiom has an equivalent to our “Context Matches” called ICE (which stands for In-Context Exact match). In some cases, these are set to Locked when exported from Idiom World Server, but that isn’t technically necessary and isn’t always the customer’s preference. Sometimes, there are thousands of these ICE segments that are NOT locked but should be ignored for LTB checks. These ICE Matches are automatically assigned by LTB a 200% match, you can ignore them by using the option Ignore matches equal or above to 200%.

 

·         LocStudio Files – Filter Settings now include a new option to Filter by Date: Modified After.

 

Enhancements

·         Hotkeys.  LTB now also identifies hotkeys also when they follow this format (&H;) or (&amp;H) for these components QUICK, Segment Checker (Ignore hotkeys) and  Term and Punctuation (Ignore hotkeys (unformatted segments) and Speller.

 

·         The items on the LocStudio Files Filter Setting now appear ordered alphabetically.

·         Filter using previous report functionality has been enhanced to also work for situations where the order of the columns Source, Translation, Issue and Ignore? may have been altered.

 

·         The issues reported by the QUICK component have been renamed as follows:

 

o   "Target term not found in Glossary" has been renamed as "Target Term not found in reference"

o   "Multiple Targets found in glossary" has been renamed as "Multiple Targets found in reference"

o   "Source not found in glossary" has been renamed as "Source not found in reference"

 

·        The <menucascade> element from XML DITA files is parsed by LTB in the same way as DITA parsers.

 

·         When processing XLIFF files from Idiom, LTB output used to display the Idiom placeholders, for example {11}, {12}, etc. Now we have enhanced the parsing of LTB to ensure that these placeholders going forward are displayed as tags following the same behavior of standard XLZ or TMX file formats.

 

·         Change Report functionality has been enhanced to automatically ignore differences related to leading and trailing spaces in source and target segments.

 

Defects fixed

·         Issue with LTB displaying this warning message each time that LTB was run without using one or more of the Quality Control Components.

·         The warning message "This workbook contains links to other data sources" each time LTB runs without using one or more of the Quality Control Components, does not appear any longer.

·         Issue with Lionbridge internal users of Windows 7 (64 bits) not being able to use the Linguistic Editor from the Term & Punctuation component.

·         Issue in QUICK when using the option Report unmatched together with a Custom Tag Pattern. When these two options were used together QUICK did not report any unmatched entries.

·         Issue in QUICK when using the option Custom Tag Pattern, in the LTB Output Report the Formatted Source  and Formatted Translation columns did not show any format.

·         Issue with over segmentation in tab separated file formats (.xls, .csv, .txt) due to abbreviations using a period at the end. This issue produced a number of false positives in different components. This change only affects Excel file formats.

·         Issue with LTB not parsing correctly separator tags.  Previously LTB did not manage correctly separator tags (for example, <br>, <cite>, <code> or <div>) if these tags contained attributes.

·         Issue with LTB not parsing correctly <ph> tags for XLIFF files, as a consequence some internal text was incorrectly processed by LTB. Now both the LTB XLIFF Parser and the TMX parser substitute <ph> tags with ^TAG^.

·         Issue with the RTF Parser when dealing with extended characters. When extended characters were found in unsegmented text, the LTB Parser for RTF did not convert these extended characters into their equivalent Unicode character.

·        Issue with LTB not processing segments included in between <tu> tags that don’t have any attributes (for example, <tu id:XX> or <tu srclang:en>)

·         Issue with LTB not parsing correctly a Trados file.

·         Issue with LTB not ignoring segments that appear "crossed out" in the .XLIFF Editor. These "crossed out" segments are segments that have been expanded or shrunk, and should be ignored.

·         When parsing several tags without spaces in between LTB generates some false positives due to concatenations in certain situations.

·         Issue with LTB not ignoring &nbsp; entities when parsing Idiom XLZ files.

·         Issues in Change Report:

o   Align file names functionality has been restored.

o   Error messages related to added or removed segments in relation to the original file set have been rewritten for clarity purposes.

o   When selecting the option Ignore matches equal or above to: 101%, Change Report only applied this ignorability option to the reviewed files and not to the original files.

·                In the XML to TMX Converter, the tag  <nbsp /> was incorrectly removed by LTB when converting Helium files into TMX. With this fix, LTB will replace this tag with a space in the TMX.

·                Issue with LTB falsely reporting some entries as inconsistent with the Clean for MT (Beta) tool.

 

Support files updates

Core Configuration

·      Updated: The file LTB_Core_Config.zip has been updated with these changes:

 

o   In the Term and Punctuation:

o   CORE ALL STANDARD Punctuation.rul has been updated to remove these two checks below to reduce the output:

 

§  Punctuation: Extra or missing space at start of segment (Rule ID: 24)

§  Punctuation: Extra or missing space at end of segment (Rule ID: 25)

 

These two checks (Rule ID: 24 and 25) are still available in the ALL STANDARD Strict Punctuation.rul.

 

o   Core FR STANDARD Punctuation space before ALL-FR.rul

We have added a new check to ensure that there is always a space before the % sign for French. Previously named: Core FR STANDARD Punctuation ALL-FR.rul

o   Core FR-CA STANDARD Punctuation space before ALL-FRCA.rul

Rule ID: 2 has been modified as it reported incorrectly as issues situations when there was no space before semi colon (;). Previously named: Core FR-CA STANDARD Punctuation ALL-FRCA.rul

 

o   In the Segment Checker, the options below have been enabled:

§  Ignore case and punctuation

§  Ignore end punctuation

§  Ignore alphanumeric only segments

Term and Punctuation Rules

 

·      New rules:

 

o   ALL STANDARD Common English words left in target EN-ALL.rul

 

Detects instances of common English words left in the translation.

 

o   ES STANDARD Negative-Affirmative Sentences EN-ES.rul

o   NL STANDARD Negative-Affirmative Sentences EN-NL.rul

 

Both rules above, detect instances where affirmative sentences may have been translated as negative and vice versa. For example, prevents situations when "yes" may have been translated as "no".

 

o   ES STANDARD Spanish Spelling Reform ALL-ES.rul

o   ES STANDARD Monolingual Spanish Spelling Reform ALL-ES.rul

 

Checks for adherence to the changes in the Spanish Spelling Reform in bilingual and monolingual files. The Spanish Spelling Reform was approved by the 22 Academies of the Spanish Language on 28th November 2010.

 

o   ES STANDARD Passive Voice ALL-ES.rul

Extracts passive verb forms from target segments.

o   JA CUSTOM Microsoft Style Guide ver. 6.rul

Japanese custom list for Microsoft, containing the style rules adherent to Japanese Style Guide 6. Note that most unchanged rules are still based on SG5.

 

o   JA CUSTOM Microsoft MAC UI (Microsoft Style Guider ver. 6).rul

Detects the Windows UI strings that are different from Mac UI strings.

 

o   JA CUSTOM Microsoft Katakana (Microsoft Style Guide ver. 6).rul

Japanese custom list for Microsoft, containing rules of the elongation mark usage and the spaces within a compound noun for Katakana letters adherent to Japanese Style Guide 6. Note that most rules have not been changed since SG5 and this newly includes the new cho-on rules.

 

o   JA CUSTOM Microsoft Character Spacing (MS Style Guide ver. 6).rul

Japanese custom list for Microsoft, containing the character spacing rules adherent to Japanese Style Guide 6.Fully compatible with Style Guide 5.Note that in most case you cannot apply this rule against software strings containing such as placeholders and tagged RTF files, unless you turn on "Use Formatted Segments" option in the properties window of the rule. This rule replaces JA_CUSTOM_MS_CharSpacing_SG5.rul.

 

o   RU STANDARD DateTime EN-RU.rul

Checks that dates/times in RU target segment correspond to EN source and that they follow the correct format. Please note: If date format should not match Russian date format, exclude this rule (i.e. if 11/9/07, acc. to style guide, should be 9 ноября 2007 года, not 09/11/2007). 1-го октября and suchlike, ranges like 5 или 6 июня are not accounted for. Jan 18 should be 18 января, else an error is added to LTB report. 'May' in source should be capitalized to match (to avoid false positives with the modal verb 'may').

 

o   Core FR STANDARD Punctuation space before ALL-FR.rul

We have added a new check to ensure that there is always a space before the % sign for French. Previously named: Core FR STANDARD Punctuation ALL-FR.rul

o   Core FR-CA STANDARD Punctuation space before ALL-FRCA.rul

Rule ID: 2 has been modified as it reported incorrectly as issues situations when there was no space before semi colon (;). Previously named: Core FR-CA STANDARD Punctuation ALL-FRCA.rul

 

o   DE CUSTOM Microsoft Common Errors EN-DE.rul

Renamed rule. Previously named: DE_Microsoft_Errors_en-de.rul

 

·      Updated rules:

o   ALL STANDARD Strict Punctuation.rul

These two new checks have been introduced as they have been removed from the rule CORE ALL STANDARD Punctuation.rul:

§  Punctuation: Extra or missing space at start of segment (Rule ID: 24)

§  Punctuation: Extra or missing space at end of segment (Rule ID: 25)

 

o   CORE ALL STANDARD Punctuation.rul

These two checks have been removed in order to reduce the output produced by the Core configuration:

§  Punctuation: Extra or missing space at start of segment (Rule ID: 24)

§  Punctuation: Extra or missing space at end of segment (Rule ID: 25)

These two checks (Rule ID: 24 and 25)  are still available in the ALL STANDARD Strict Punctuation.rul.

o   IT STANDARD Common Errors ALL-IT.rul

o   IT STANDARD False Friends EN-IT.rul

These two rules have been updated with a view to reduce false positives.

 

o   RO STANDARD Detect Corrupt Chars or Wrong Lang EN-RO.rul

o   RO STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

These two rules have been modified to add Romanian (Romania) target language.

 

o   SV STANDARD Detect Corrupt Chars or Wrong Lang EN-SV.rul

o   SV STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

These two rules have been modified to accept the character é as a valid Swedish character.

 

·      Deleted rules:

o   Core FR STANDARD Punctuation ALL-FR.rul

o   Core FR-CA STANDARD Punctuation ALL-FRCA.rul

o   DE_Microsoft_Errors_en-de.rul

The three rules above have been deleted as they have been renamed for clarity purposes.

o   JA_CUSTOM_MS_Katakana.rul

o   JA_CUSTOM_MS_StyleGuide5.rul

The three rules above have been deleted as they have been updated to comply with the Microsoft Style Guide for Japanese ver. 6.

 

Entity Extractor Rules

 ·      New rules:

 o   EN STANDARD Cross References.rul

Detects instances of inconsistent cross references.

 

o   EN STANDARD Procedural Sentences.rul

Detects instances of procedural sentences.

o   ES STANDARD Passive Voice.rul

Extracts passive verbs from source segments.

Dictionaries

·      New dictionaries:

o   EN(US)_Life Sciences.xpl

 

·      Updated dictionaries:

o   ES_Life Sciences.xpl

o   Finnish.xpl

o   French (Canada).xpl

o   FR_Life_Sciences_Medical_devices_&_specialities.xpl

Documentation

·       Help file has been updated with changes related to this release.

·       The document with information about the functionality and usage of the rules installed with the Term and Punctuation component have been updated: http://autoupdate.lionbridge.com/ltb/_LTB_Installation_Rules_Description.html

·       Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

The Known issues have been updated: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm 

___________________________________________________________________________________________________________________________________________

 

Lionbridge Linguistic ToolBox ver. 2.40 Update 1 – Release Notes

October 21, 2010

Contents

Enhancements

Defects fixed

Support files updates

Term & Punctuation Rules

Entity Extractor Rules

Dictionaries

Documentation

 

We are pleased to announce a new release of Linguistic ToolBox ver. 2.40 Update 1. In this version we have implemented fixes for several defects and we have added new and updated rules for both the Term & Punctuation component and the Entity Extractor.

If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·         For Lionbridge users, please use Service Desk.

·         For non Lionbridge users, you can contact us at GTLTBSupport@lionbridge.com.

 

Enhancements

 

·         In the LTB output report there has been some changes in the QUICK component and in the Spelling Component.  In the QUICK tab the column “Comments” has been renamed to “Issues” and in the Spelling tab, there have been two changes, the column “Misspelled word” has been renamed to “Match” and a new column has been created called “Issue” to contain Spelling errors and Grammar Errors.

·         LTB Helper has been updated with changes related to the above enhancement.

 

Defects fixed

·         Issue with some specific LocStudio filters not appearing in the Settings file.

·         Issue with non-Latin characters not displaying correctly in LocStudio filters in the Settings file.

·         Issue with .Lspkg files when the first string was empty an unhandled exception used to occur

·         Issue with the data from the formulas in the LTB Statistics tab not displaying correctly for Office 2003 users.

·         Issue with LocStudio locked strings being incorrectly reported as Segments with empty target.

·         Issue with the Segment Checker option: Ignore Camel Words incorrectly identifying some words that were not CaMel words. This was a consequence of mixing up LogoportMarkup styles with normal text within a translation unit.

·         Issue with LTB incorrectly processing a type of locked segments in Idiom XLIFF files when the Segment Checker option: Ignore locked segments or marked as non translatable was enabled. This issue only happened for files that used the attribute node: “lock_status” for each unit.

·         Issue with LTB incorrectly considering legacy segments for the Segment Checker checks when the option Ignore matches equal or above to was enabled.

 

Support files updates

Term and Punctuation Rules

 

·      New rules:

o  ALL STANDARD Detect Wrong Variable Order.rul

Checks that the variables/placeholders in the translation follow the original order in the source. This rule works for these types of variables: %s1, %(w)d and  %1.

o  RU STANDARD Country Names EN-RU.rul

Checks that country names have been correctly translated from English into Russian. PLEASE NOTE: Use the option: Ignore untranslated segments in the Term and Punctuation Checker to use in documents with addresses where country names could have been left in English.

·      Updated rules:

o   NB CUSTOM Microsoft EN-NB.rul 

Entity Extractor Rules

 ·      New rules:

o  EN STANDARD Chat and Text Abbreviations.rul

Detects instances of common chat and text abbreviations, such as: LOL, TTYL, FYI, etc.

o  EN STANDARD Rude Words.rul

Detects instances of rude words and vulgarisms in English.

 

o   ALL STANDARD Monolingual Duplicated Words.rul

Checks for duplicated words in monolingual files or in the source for bilingual files.

·      Updated rules:

o   ALL STANDARD Expanded Acronyms.rul

Dictionaries

 

·      Updated dictionaries:

o   Norwegian (Bokmål).xpl

Documentation

·       Help file has not been updated with changes related to this release.

·       Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

The most up-to-date information on Known issues can be found at: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm

 

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.40 – Release Notes

September 21, 2010

Contents

New Features

Changes

Enhancements

Defects fixed

Support files updates

Core Configuration

Rules

Dictionaries

Documentation

 

We are pleased to announce a new release of Linguistic ToolBox ver. 2.40. In this version we have implemented support for the new Microsoft® Localization Studio file formats.  We have also included new ignorability options and functionality with a view to reduce the number of false positives. And in relation to our efforts to adapt LTB even more for Machine Translation tasks, we have created two brand new tools to assist with MT preparation tasks.

In addition, as usual, there are many other enhancements and bug fixes as part of this release; see below for more details.

IMPORTANT NOTE:  The installation requirements have changed and LTB users must install .Net Framework 4 prior to installing/updating to ver. 2.40. This is a mandatory requirement. Please note that .Net Framework 4.0 has not been tested internally yet to ensure compatibility with other applications and this may cause unexpected issues.

If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·         For Lionbridge users, please use Service Desk.

·         For non Lionbridge users, you can contact us at GTLTBSupport@lionbridge.com.

 

New Features

·         Support for the new Microsoft LocStudio file formats (LCL, LCX and Lspkg). Thanks to this development, there is no need to export to.CSV file format to use the new LocStudio file formats with LTB. As part of this new release, we have also included a new feature to enable users to apply filters to the LocStudio files (Input files tab>Settings). Please note that you can also use LocStudio new file formats as QUICK glossaries for ensuring consistency between UI and UA and in the Change Report to compare two versions of the same LocStudio files.

·         New Ignorability options in the Quality Control components:

o   Ignore untranslated segments (option added to the QUICK component and Term & Punctuation)

o   Ignore segments with empty target (option added to the Term & Punctuation component)

o   Ignore hotkeys (option added to the Segment Checker and Term & Punctuation component). This option only deals with the standard type of hotkeys, indicated with &, such as, for example: &Organizer. This option does not deal with the new LocStudio hotkeys indicated with an underscore, for example: Move Dow_n.

o   Ignore end punctuation (option added to the Segment Checker). The overall performance of this option has been improved.

Please note that the previous Segment Checker option>Ignore hotkeys and end punctuation has now been replaced with two separated options: Ignore hotkeys and Ignore end punctuation. Both of these options only affect the inconsistency checks.

·         Dictionaries for multiple language variants included within multilingual configuration files now will behave as follows: For example if you have a multilingual configuration file containing dictionaries for Spanish and Spanish (Mexico), the behavior will be as follows:

o   If you select Spanish (Argentina) as your Target Language under Project options, the dictionary loaded by default in the Spelling tab would be Spanish.xpl (as there is no matching dictionary for the Spanish (Honduras) language variant.

o   If you select Spanish (Mexico) as your Target Language under Project options, the dictionary loaded by default in the Spelling tab would be Spanish (Mexico).xpl (as there is a match with the language variant of the dictionary).

·         Clean for MT Tool (Beta). This tool cleans-up and prepares bilingual translation memories with a view to use these resources to train Statistical Machine Translation Systems (SMTS). This tool automatically performs the most common type of checks required to prepare legacy TMs for Machine Translation, for example: segment length, direct and reverse consistency, tag checks, etc. With the Clean for MT tool, we obtain two different types of output:

o   "correct" output (correct.tmx): this is the output with potentially correct bilingual material for its direct use for SMTS.

o   "incorrect" output (inconsistent.tmx, revinconsistent.tmx and rule_name.tmx ): this is the output with wrong alignments or potentially wrong segments, so these can be reviewed, edited or discarded if necessary.

·         Extract TMX Tags Tool (Beta). This tool extracts from translation memories in TMX 1.1 and TMX 1.4b file formats internal tags and internal group tags, reporting the following information for each tag: Tag name, Count, Type of Tag, Content surrounded by the tag and Examples. In addition, the report provides a summary of the total number of tags and the total number of unique tags in both source and target segments.

Changes

·         Installation requirements have changed: LTB users must install .Net Framework 4 prior to installing/updating to this new version. This is a mandatory requirement.

·         User Interface of LTB has been made larger in order to support fonts in different languages. This new UI should be used in conjunction with a DPI Setting of 96.

·         In the LTB Installer references to the Logoport Word Client have been modified with references to Translation Workspace Microsoft Word plug-in.

·         Languages.xml installation file has been updated to ensure consistency with the ISO Language codes used by the LTB Glossary Converter script.

·         Two new input languages have been added to LTB: English (Singapore) and English (India).

 

 

Enhancements

·         In the LTB output report now you can identify under the Status column, segments locked or marked as non-translatable. In the LTB output report these segments are identified following this convention: Blocked – match percentage, for example: “Blocked-100%”.

·         LTB Statistics tab is now available by default in the generated LTB output report. Previously this tab was only visible in the LTB output report once the LTB Helper had been activated. Now, this tab is available by default and the statistical information is automatically populated by either using the LTB Helper or by entering manually Ignore or Mark under the Ignore column in the LTB output report.

·         Show additional Source/Translation columns in the report. This new option under File>Preferences>Output displays in the LTB output the Formatted Source and Formatted Translations columns for all the Linguistic ToolBox quality control components, independently on whether the option Use formatted segments is enabled or not.

·         The LTB Helper’s overall performance in relation to Excel and other add-ins has been improved. To work with the LTB Helper, once you have opened the LTB output report in Excel, click Start/Stop LTB Helper button and when you have finished, click Start/Stop LTB Helper.

·         LTB now does not report inconsistent translations when the segments that produce the inconsistency are segments with empty target. Please note that going forward the only way to detect these issues are by using the Segment Checker option, Check if target is empty.

·         The overall performance of the Segment Checker options: Ignore alphanumeric only segments and Ignore case and punctuation have been improved.

·         In the Input files tab, when you click on Add Folder, now you can also copy and paste to the folder path. This enhancement is particularly useful for Windows 7 users, as the option to drag and drop folders is not available.

·         New option in the XLS Properties window: Use these settings on all workbooks: This option applies the same settings to all the Excel files in the Input Files tab, QUICK tab and Change Report tab

 

Defects fixed

·         Issue with LTB not processing Idiom .xlz files containing a .tbx glossary file within. Now LTB processes these files correctly without producing the error message "Data descriptor signature not found".

·         Issue with LTB producing sometimes noise when carrying out spell check using the option Ignore Internet and file addresses.

·         Issue when working with multilingual configuration files in Windows 7. When working with very large configuration files not all the relevant support files were loaded according to the languages selected.

·         Issue with the QUICK component causing noise entries when dealing with entries in the QUICK glossary that contains brackets that are not in Latin encoding.

·         Issue in the XLS Properties dialog, when scrolling up and down the files producing an unhandled exception error.

·         Issue with the clean-up of the Source and Translation columns in the LTB output report not being done consistently for the Segment Checker.

·         Issue with LTB output report not showing for some components the columns Formatted Source and Formatted Translation.

·         Issue with the LTB Helper automatically saving LTB output reports under My Documents folder. After using the LTB Helper, if you selected Save in Excel (instead of Save As), the LTB output report saved under My Documents instead of being saved at the default location.

Support files updates

Core Configuration

·      LTB_Core_Config.zip

 

The Core Configuration file has been updated to include this change: under Term & Punctuation, the new options Ignore segments with empty target and Ignore untranslated segments are enabled by default.

 

Rules

 

·      New rules:

o   ALL STANDARD Capitalization at the beginning of the segment or string EN-ALL.rul

Checks for capitalization mismatches between source and target at the beginning of the segment or string. This rule is valid for Latin script and Cyrillic script languages. Please note that this rule is not applicable for those languages for which capitalization does not exist.

o   ALL STANDARD Check Trade and Registered Marks.rul

Checks that the number of registered marks ®, trademarks ™, copyright marks ©, service marks℠ and sound recording copyright marks ℗ found in the source and target match. It also checks if there are any of these marks found in source and not in the target and the opposite.

o   DA STANDARD Spanish-Danish Translation Errors ES-DA.rul

Detects common translation errors including many false friends from Spanish into Danish. Applicable to the Travel and Tourism domain.

 

 o   DA STANDARD Common Errors ALL-DA.rul

Checks for common errors such as use of one/two words, use of uppercase, general style in texts with Danish target.

 

o   FR CUSTOM Microsoft Style Guide EN-FR.rul

Checks for common Microsoft stylistic guidelines for French, according to the Microsoft Style Guide ver. 1.

·      Updated rules:

o   ES CUSTOM Microsoft EN-ES.rul (Previously named: ES STANDARD MICROSOFT EN-ES.rul)

o   ES STANDARD Grammar Errors ALL-ES.rul

o   ES STANDARD Common Errors ALL-ES.rul

o   ES STANDARD Spelling Errors ALL-ES.rul

 

o   NB CUSTOM Microsoft EN-NB.rul (Previously named: NB_CUSTOM_Microsoft_EN-NB.rul)

o   NB STANDARD Common Errors EN-NB.rul (Previously named: NB_STANDARD_General_EN-NB.rul)

 

o   NN STANDARD Common Errors EN-NN.rul (Previously named: NN_STANDARD_General_EN-NN.rul)

 

Dictionaries

 

·      Updated dictionaries:

o   French (Canada).xpl

o   Norwegian (Bokmål).xpl

o   Slovak.xpl

Documentation

·       Help file has been updated with changes related to this release.

·       Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

The most up-to-date information on Known issues can be found at: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox ver. 2.37 Update 1 - Release Notes

April 22nd, 2010

Contents

Changes

Defects fixed

Support files updates

Core Configuration

Rules

Dictionaries

Documentation

Known Issues

 

We are pleased to announce a new release of Linguistic ToolBox ver. 2.37 Update 1. In this version we have implemented several bug fixes. If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·         For Lionbridge users, please use Service Desk.

·         For non Lionbridge users, you can contact us at GTLTBSupport@lionbridge.com.

 

Changes

·    In the Spelling tab, under Ignore Options, the option Acronyms and words in all capitals now appears disabled by default.

 

Defects fixed

·     Unexpected error when executing Change Report with the option Report orphan segments enabled.

 

·     Unexpected error when adding a batch of .txt files through the Input files tab.

 

·     Previously LTB did not detect automatically .DOCX and .XLSX file types as .DOC and .XLS under the Type column in the Input files tab: as a consequence these files were not processed correctly unless the file type was manually changed from UNK into .DOC or .XLS; this is now done automatically by LTB. In addition, when users load input files that show UNK under the Type column in the Input files tab and try to execute LTB , LTB shows a warning message.

 

·    When creating multilingual configuration files, LTB did not allow you to save configuration files containing dictionaries or Term & Punctuation rules for languages different to the Source-Target language combination.  In this version, if you try to execute LTB when you have rules and dictionaries that don't match the source/target languages, LTB displays an error message. However, LTB allows you to save configuration files containing Term & Punctuation rules and dictionaries for languages others that those specified in the source/target language, this functionality is implemented in order to allow internal Lionbridge users to create multilingual configuration files.

 

·     When adding .xls or .xlsx input files with an empty first column, LTB only displayed the first and second columns but not the third and subsequent columns.

 

Support files updates

Core Configuration

·      LTB_Core_Config.zip

 

The Core Configuration file has been updated to include the French (Canada) dictionary and to include the updated Core rules below.

 

Rules

 

·      Updated rules:

 

The rules below have been updated to also detect instances of all uppercase words concatenated, for example: ALL_CAPS and to detect instances of combinations of uppercase and lowercase letters separated by underscores, for example: CaMel_Words. This rule has also been improved to fix a rare bug that made LTB to stop responding when these rules were used under specific circumstances and to solve an issue related to leading spaces.

 

§  Core ALL STANDARD Literals - CaMel.rul (Term & Punctuation)

§  ALL STANDARD CaMelWords.rul (Entity Extractor)

 

          The Language Properties of the Term & Punctuation rules below have been updated to include new languages recently added to LTB or to reflect changes in language naming conventions. The functionality of the rules below remains the same:

 

§  ALL (Not French) STANDARD Punctuation space before.rul

§  ALL STANDARD Strict Punctuation.rul

§  AR STANDARD Detect Corrupt Chars or Wrong Lang EN-AR.rul

§  BG STANDARD Detect Corrupt Chars or Wrong Lang EN-BG.rul

§  CA STANDARD Detect Corrupt Chars or Wrong Lang EN-CA.rul

§  CA STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

§  Core ALL (Not EN) STANDARD Literals - Numbers.rul

§  Core ALL (Not French) STANDARD Punctuation space before.rul

§  Core DE STANDARD Date Time EN-DE.rul

§  Core EN STANDARD Date Time EN-All.rul

§  Core EN STANDARD Literals - Numbers.rul

§  Core EN STANDARD Number Format 1,000.0 to 1 000,0.rul

§  Core EN STANDARD Number Format 1,000.0 to 1.000,0.rul

§  Core ES STANDARD Date Time EN-ES.rul

§  Core FR STANDARD Date Time EN-FR.rul

§  Core FR STANDARD Punctuation ALL-FR.rul

§  Core IT STANDARD Date Time EN-IT.rul

§  Core PT STANDARD Date Time EN-PT.rul

§  CS STANDARD Detect Corrupt Chars or Wrong Lang EN-CS.rul

§  DA STANDARD Detect Corrupt Chars or Wrong Lang EN-DA.rul

§  DE STANDARD Detect Corrupt Chars or Wrong Lang EN-DE.rul

§  EL STANDARD Detect Corrupt Chars or Wrong Lang EN-EL.rul

§  EN STANDARD Date Time EN-All.rul

§  EN STANDARD Detect Corrupt Chars or Wrong Lang EN-ENGB.rul

§  EN STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

§  EN-UK_STANDARD_CheckTargetforAmericanEnglish.rul

§  ES STANDARD Common Errors ALL-ES.rul

§  ES STANDARD Detect Corrupt Chars or Wrong Lang EN-ES.rul

§  ES STANDARD False Friends EN-ES.rul

§  ES STANDARD Grammar Errors ALL-ES.rul

§  ES STANDARD MICROSOFT Geopolitical Issues EN-ES.rul

§  ES STANDARD Mistranslations_EN-ES.rul

§  ES STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

§  ES STANDARD Neutral Spanish ALL-ES.rul

§  ES STANDARD Punctuation ALL-ES.rul

§  ES STANDARD Rude Words and Vulgarisms ALL-ES.rul

§  ES STANDARD Spelling Errors ALL-ES.rul

§  ET STANDARD Detect Corrupt Chars or Wrong Lang EN-ET.rul

§  EU STANDARD Detect Corrupt Chars or Wrong Lang EN-EU.rul

§  FA STANDARD Detect Corrupt Chars or Wrong Lang EN-FA.rul

§  FI STANDARD Detect Corrupt Chars or Wrong Lang EN-FI.rul

§  FIL STANDARD Detect Corrupt Chars or Wrong Lang EN-FIL.rul

§  FR STANDARD Detect Corrupt Chars or Wrong Lang EN-FR.rul

§  GL STANDARD Detect Corrupt Chars or Wrong Lang EN-GL.rul

§  HE STANDARD Detect Corrupt Chars or Wrong Lang EN-HE.rul

§  HI STANDARD Detect Corrupt Chars or Wrong Lang EN-HI.rul

§  HR STANDARD Detect Corrupt Chars or Wrong Lang EN-HR.rul

§  HU STANDARD Detect Corrupt Chars or Wrong Lang EN-HU.rul

§  HY STANDARD Detect Corrupt Chars or Wrong Lang EN-HY.rul

§  ID STANDARD Detect Corrupt Chars or Wrong Lang EN-ID.rul

§  IS STANDARD Detect Corrupt Chars or Wrong Lang EN-IS.rul

§  IT STANDARD Detect Corrupt Chars or Wrong Lang EN-IT.rul

§  JA STANDARD Detect Corrupt Chars or Wrong Lang EN-JA.rul

§  KK STANDARD Detect Corrupt Chars or Wrong Lang EN-KK.rul

§  KO STANDARD Detect Corrupt Chars or Wrong Lang EN-KO.rul

§  LT STANDARD Detect Corrupt Chars or Wrong Lang EN-LT.rul

§  LV STANDARD Detect Corrupt Chars or Wrong Lang EN-LV.rul

§  MS STANDARD Detect Corrupt Chars or Wrong Lang EN-MS.rul

§  NB STANDARD FalseFriends EN-NB.rul

§  NB STANDARD FalseFriends SV-NB.rul

§  NL STANDARD Detect Corrupt Chars or Wrong Lang EN-NL.rul

§  NO STANDARD Detect Corrupt Chars or Wrong Lang EN-NO.rul

§  PL STANDARD Detect Corrupt Chars or Wrong Lang EN-PL.rul

§  PT STANDARD Detect Corrupt Chars or Wrong Lang EN-PT.rul

§  RO STANDARD Detect Corrupt Chars or Wrong Lang EN-RO.rul

§  RU STANDARD Detect Corrupt Chars or Wrong Lang EN-RU.rul

§  SK STANDARD Detect Corrupt Chars or Wrong Lang EN-SK.rul

§  SL STANDARD Detect Corrupt Chars or Wrong Lang EN-SL.rul

§  SR (Cyrillic) STANDARD Detect Corrupt Chars or Wrong Lang EN-SR (Cyrillic).rul

§  SR (Latin) STANDARD Detect Corrupt Chars or Wrong Lang EN-SR (Latin).rul

§  SV STANDARD Detect Corrupt Chars or Wrong Lang EN-SV.rul

§  SV STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

§  TGL STANDARD Detect Corrupt Chars or Wrong Lang EN-TGL.rul

§  TH STANDARD Detect Corrupt Chars or Wrong Lang EN-TH.rul

§  TR STANDARD Detect Corrupt Chars or Wrong Lang EN-TR.rul

§  UK STANDARD Detect Corrupt Chars or Wrong Lang EN-UK.rul

§  UR STANDARD Detect Corrupt Chars or Wrong Lang EN-UR.rul

§  VI STANDARD Detect Corrupt Chars or Wrong Lang EN-VI.rul

§  ZH-CN STANDARD Detect Corrupt Chars or Wrong Lang EN-ZHCN.rul

§  ZH-TW STANDARD Detect Corrupt Chars or Wrong Lang EN-ZHTW.rul

 

·      Deleted rules:

 

The rule below has been deleted as its functionality was duplicated in this rule: Core ALL STANDARD Literals - CaMel.rul.

§  ALL STANDARD Literals – CaMel.rul

Dictionaries

 

·      New dictionary: French (Canada).xpl

Documentation

·       Help file has been updated with changes related to this release.

·       Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

The most up-to-date information on Known issues can be found at: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm

___________________________________________________________________________________________________________________________________________

 

Lionbridge Linguistic ToolBox 2.37 Version Release Notes

March 30th, 2010

Contents

New Features. 1

Enhancements. 1

Defects. 2

Support files updates. 2

Term & Punctuation Rules. 2

Dictionaries. 2

Documentation. 3

Known Issues. 3

 

We are pleased to announce a new release of Linguistic ToolBox ver. 2.37. In this version we have implemented support for S-Tagger files from FrameMaker and now, we have made it possible for LTB to also process text within a text box in Word documents.

Together with this release, we are automatically installing for you the LTB Helper add-in. LTB Helper has been developed in order to reduce the time and effort required for Linguistic ToolBox users to distinguish valid issues from false positives in the Linguistic ToolBox output report. This tool automatically groups similar types of issues together in the LTB output reports to allow users to automatically add a comment under the Ignore? column of the reports to designate each entry as either Mark or Ignore. Ignore is used for the entries that are not real errors and therefore don’t need to be corrected in the translation. On the other hand, Mark is used for the entries that do need to be corrected in the translation. For more information about the LTB Helper add-in, please refer to the LTB Helper User Guide available in the Help folder of your LTB Installation.

In addition, we have developed some new enhancements and corrected several defects, more details below.

If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·         For Lionbridge users, please use Service Desk.

·         For non Lionbridge users, you can contact us at GTLTBSupport@lionbridge.com.

New Features

·         Support for S-Tagger files from FrameMaker.

 

·         The LTB Helper add-in to help review the LTB output report is now automatically installed with the standard LTB installation. Any updates to the LTB Helper add-in will also be done through the LTB auto-update process. The LTB Helper User Guide can be found in the Help folder of the LTB Installation, usually at this location: C:\Program Files\Lionbridge\Linguistic ToolBox\Help

 

·         New tab in the LTB output report: LTB Statistics. This tab is only visible in the LTB output report once you have activated the LTB Helper add-in and used the Go to next Keys (blank) button. In this tab, you can see a summary of the total number of issues per component that have been commented as either Mark or Ignore.

 

·         CSV Properties dialog box detects automatically whether the text in .CSV or tab delimited .TXT input files is enclosed in between double quotes, single quotes or nothing under Field Enclosed In.

Enhancements

·         LTB now automatically processes text found within a text box in Word file formats.

 

·         LTB now only loads the correct generic dictionaries as per the Source/Target Language selected in the left hand panel when loading multilingual LTB configuration files with several generic .xpl dictionaries.

 

·         Lionbridge internal users can now save configuration files including the option Use MS Word spell checker for any language even if the Microsoft Word proofing tools for that language are not installed. Note that when executing the configuration file if users don’t have MS Word proofing tools for the relevant language installed, a warning message will pop up.

 

·         Logoport internal mark-up styles are no longer reported as “Target is empty - Unsegmented text” by the Segment Checker option Check if target is empty.  Please note that text in internal Logoport styles won’t be reported either by any of the other LTB components.

 

·         In the General Information tab, there's new information displayed: Total number of input files, Total number of segments, Total source words and Total target words.

 

·         New language added to the list of Source and Target languages: Montenegrin.

 

·         Once users install the Translation Workspace Client, they will see a new LTB Login screen: the Translation Workspace Server Login. This screen contains a new field called Tenant where users should enter the name of their Translation Workspace tenancy.

 

·         The Logoport tab has been renamed to Translation Workspace (File>Manage Configurations or File>Load Configurations).

 

Defects

·         Issue with bilingual checks in the Term & Punctuation component generating noise for unsegmented text or source segments with empty targets has been corrected. The only situation when source segments with empty targets will be taken into account by the Term & Punctuation checks is when the rule mode is Error and the check is to be carried out for the source segment.

 

·         Issue with the Segment Checker not detecting inconsistencies when the Diff word is repeated in both target segments but there is a different number of occurrences of the Diff word between each of the equivalent target segments. For example:

 

Source:    You can find more information about our tool in the Help files.

Target 1:  Puede encontrar más información sobre nuestra herramienta en los archivos de Ayuda.

Target 2:  Puede encontrar más información sobre nuestra herramienta en los archivos nuestra herramienta de Ayuda.

 

·          Issue with LTB producing sometimes noise when carrying out spell check for  XLF/XLZ, TTX and XML files because of the way LTB parses text surrounded by internal tags.  In the example below, LTB used to report a non-valid issue for Spelling:

 

Source: Select the customer and<cf italic="on">1 </cf>1select the <cf bold="on">2Functions</cf>2 Button and take the <cf bold="on">3Open Transaction Editing </cf>3option

Misspelled word: andselect

 

·         When executing the Term & Punctuation component, in some occasions Windows 7 users have received this error message: “Unhandled exception thrown. Input string was not in correct format”, this behavior is now corrected.

 

·         Configuration files previous to version 2.35 don’t contain the options Ignore matches equal or above, Use formatted segments and Ignore locked segments or marked as non-translatable, from version 2.37 these old configuration files will take these values from File>Preferences or Project Options.

 

·         When downloading LTB configuration files from Logoport some users received this message even though their permissions were correct: "You don't have permission to perform this action", this issue has now been corrected.

 

Support files updates

Core Configuration

·      LTB_Core_Config.zip

 

The Core Configuration file now contains all generic dictionaries. Remember that LTB now only loads the correct generic dictionaries as per the Target language selected in the left hand panel.

Term & Punctuation Rules

 

·       Updated rules:

 

§         ES STANDARD Mistranslations_EN-ES.rul

 

New entries have been added to this rule to detect common mistranslations in translations from English into Spanish.

Dictionaries

·       Updated Dictionaries:

§         Bengali.xpl

§         German.xpl

§         Spanish.xpl

 

·       New Dictionaries:

§         ES_Life Sciences.xpl

§         IT_Life Sciences.xpl

§         DE_Life Sciences.xpl

 

Custom dictionaries for the Life Sciences domain.

Documentation

·       Help file has been updated with changes related to this release.

·       Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

The most up-to-date information on Known issues can be found at: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm

 

 ___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.36 Version Release Notes

February 2nd, 2010

Contents

Enhancements. 1

Defects. 1

Support files updates. 2

Documentation. 2

Known Issues. 2

 

We are pleased to announce a new release of Linguistic ToolBox ver. 2.36. In this version we have implemented some major improvements to the LTB tokenization in relation to spell-checking and have corrected many defects. If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·         For Lionbridge users, please use Service Desk.

·         For non Lionbridge users, you can contact us at GTLTBSupport@lionbridge.com.

Enhancements

·   The LTB output report file extension has changed from .xls to .xml. Please note that the file format still remains XML as per in previous versions, only the extension has changed. Please remember that when working with large volumes of output it may be more convenient to resave the output report as .xls or .xlsx, as .xml file format is slower. However, when using your commented output report for the option Filter using previous report, ensure that the report is always saved as .XML Spreadsheet file format.

 

·   The options Report source sentences longer than and Report target sentences longer than [] bytes in the Segment Checker component, now allows you to specify the type of encoding of your files. You can select from the drop-down list the following options for byte counting:

·         bytes UTF-8

·         bytes UTF-7

·         bytes UTF-16 (Big Endian)

·         bytes UTF-16 (Unicode)

·         bytes UTF-32

 

·   When using LTB to carry out spell-checking using the option: Use Microsoft Word spell

checker with the Ignore Options: Acronyms and words in all capitals, Alphanumerics and Internet and file addresses. The LTB settings will be the ones considered by LTB independently of the equivalent settings that you may have selected for these options in Microsoft Word.

 

Defects

·   Issue with monolingual rules not working when used in the Term & Punctuation component:  This issue has been fixed in this version. However, please note that there will be additional noise in the reports for bilingual checks, as any unsegmented text or any source segments with empty target will be taken into account by the bilingual T&P checks now. This will be addressed in the next release.

 

·   Issues related to the option Filter using previous report. Previously LTB sometimes used to close unexpectedly while executing when the option Filter using previous report was selected. In addition, the functionality of this option sometimes did not work as expected. Both issues have now been corrected on this release. Now, LTB does not take into account file name when using this option or the information contained in the General Settings tab of the output report.

 

Please remember, when using the option Filter using previous report, ensure that your previous report is always saved in .XML Spreadsheet file format.

 

·      Issue with LTB displaying a message when processing Word files from a network location for every file asking whether you would like to open a Read-only copy is corrected.

 

·      Issue with unsegmented text in .TTX files not detected by the option Check if target is empty is corrected.

 

·   Issue with the alignment of segments in the Change Report component. Previously one target segment was aligned to all the reviewed segments that had the same source (one to many alignment), now the alignment of one target segment is only done with the first reviewed segment that has the same source (one to one alignment).

 ·   Incorrect message appearing for non-Lionbridge users: Previously in the non-Lionbridge user installation of LTB, after selecting File>New, users where prompted with this message: Do you want to save the changes made to the configuration before proceeding? This behavior was incorrect as non-Lionbridge users cannot save configuration files by design. If the user selected Yes, LTB crashed.
 

·   Issue with Entity Extractor Statistics tab of the LTB output report not showing any data is corrected.

 

·   The following tokenization issues related to spelling have been corrected:

 

·   Tokenization issue with hyphenated words - in the previous version each word part of a hyphenated term would be displayed as separated word entries in the Speller Missing words tab, now LTB reports hyphenated words as single entries in the output report.

 

This issue affected German spell-checking specially. In order to correct this issue we have changed the sequence of the Spelling component checks for hyphenated words.

 

·   LTB tokenization now takes into account the period at the end of an abbreviation as the period can be orthographically significant in certain cases. For example, previously LTB reported an issue when spell-checking the entry below, as the period was considered as a separator for our tokenization. Now, the period is only considered as separator when it indicates end of a segment.

 

Source: transportation departments

Target: Kfz-Zulassungsstellen bzw. Straßenverkehrsämter

Misspelled word: bzw

 

·   Tokenization issue with words that contain apostrophe – in the previous version, a word that contained an apostrophe would be incorrectly reported as a spelling mistake. For example, LTB used to report the following when performing spell-check for the word below:

 

Source: dell'altra parte

Misspelled word: dell

 

This issue affected Italian and French spell-checking specially. In order to correct this issue we have changed the sequence of the Spelling component checks for words that contain apostrophes.

 

·   Tokenization issue with the interpunct character – in the previous version, the interpunct was considered incorrectly as separator for tokenization. For example, LTB used to report the following when performing spell-check for words containing interpunct in Catalan:

 

Source: col·locar-lo

Misspelled word: col·locar

 

Support files updates

Term & Punctuation Rules

 

§       New rule for ALL languages:

 

§         ALL STANDARD Strict Literals - Acronyms.rul

 

This is a new stricter rule to check acronyms. Checks that acronyms found in the source and target match. It also checks if there are acronyms found in source and not in the target and the opposite.

 

·         Updated rules:

 

§         PT-BR STANDARD Detect Old Spelling ALL-PT-BR 1-12.rul

§         PT-BR STANDARD Monolingual Detect Old Spelling 1-12.rul

 

The two rules above have been updated with the most up to date vocabulary from the Brazilian Spelling Reform.

 

§         IT STANDARD Common Errors - ALL-IT.rul

 

IT STANDARD Rude Words-Vulgarisms ALL-IT.rulfrom rule ID: 22 “Common error: check correct spelling of the word approvvigionamento|i, approvvigionare,ecc”.

 

§         IT STANDARD Rude Words-Vulgarisms ALL-IT.rul

§         IT STANDARD Punctuation EN-IT.rul

 

The two rules above have been improved to eliminate noise.

 

§         HR STANDARD Detect Corrupt Chars or Wrong Lang EN-HR.rul

§         HR STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

§         SR (Cyrillic) STANDARD Detect Corrupt Chars or Wrong Lang EN-SR (Cyrillic).rul

§         SR (Cyrillic) STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

§         SR (Latin) STANDARD Detect Corrupt Chars or Wrong Lang EN-SR (Latin).rul

§         SR (Latin) STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

 

These six rules above for Croatian, Serbian (Cyrillic) and Serbian (Latin) have been improved to include support for the characters ô and â.

 

·         Deleted rules for ALL languages:

 

§         PT-BR STANDARD Detect Old Spelling ALL-PT-BR 1-11.rul

§         PT-BR STANDARD Monolingual Detect Old Spelling 1-11.rul

 

The two rules above have been deleted, as these rules have been replaced now with version 1-12.

 

Dictionaries

·       Updated Dictionary:

§         Spanish.xpl

·       New Dictionary:

§         RU_Computing_Software.xpl

 

Custom dictionary for the Computing Software domain for Russian.

Documentation

·       Help file has been updated with changes related to this release.

·       The documentation describing the functionality and usage of the rules from the Term & Punctuation component has also been updated: http://autoupdate.lionbridge.com/ltb/_LTB_Installation_Rules_Description.html

·       Previous Release Notes are available at: http://autoupdate.lionbridge.com/ltb/PreviousReleaseNotes.htm

Known Issues

The most up-to-date information on Known issues can be found at: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm

 

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.35 Version Update 1 on Dec 17th

We are pleased to announce a new release of Linguistic ToolBox ver. 2.35 Update 1. In this version, we have implemented the fixes below and in addition, we have radically improved the speed of Linguistic ToolBox when processing .XLZ files.

The issue below with .XLZ files has been corrected:

 

Sometimes, when executing LTB to process .XLZ files, LTB displays this error message in the Log tab: The remote server returned an error: (403) Forbidden.

Fixed

Issue corrected in this release.

The issue in the Linguistic Editor when you typed rule mode entries instead of selecting them from the drop-down list has been corrected.

 

For example, in the Linguistic Editor if you typed the mode: literal or LITERAL instead of Literal, the rule did not work.

Fixed

Issue corrected in this release

ALL STANDARD Software Checks.rul

Updated

This rule has been updated to fix the issue detected on this rule for Japanese: missing hotkeys and missing variables were not identified by the rule correctly.

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.35 Support files Updates on Dec-16th

 

LTB_Core_Config.zip

Updated

·    Core ALL STANDARD Punctuation.rul has been updated to remove the check: Punctuation: Target ends with punctuation, but source doesn't.

·     New Rule for Japanese and Chinese: Core East Asian STANDARD Reverse Spell Check EN-CCJ.rul has been included in the Core (more details below).

·         Core ALL STANDARD Entity References.rul has been improved.

 

Russian.xpl

Updated

Updated dictionary (case-sensitive, 45454 words).

 

Russian_Computing.xpl

Deleted

Deleted from LTB installation due to duplication of content with generic dictionary.

 

Turkish.xpl

Updated

Updated dictionary (426600 words).

 

Gujarati.xpl

Updated

Updated dictionary (3727 words).

 

East Asian STANDARD STRICT Reverse Spell Check EN-CCJ.rul

New

Checks "English" words or phrases in target segment against source. STRICT check expects identical phrases, capitalization and spacing. Words with less than 2 letters are excluded from this check. This rule is valid for these languages: Chinese (Simplified), Chinese (Traditional) and Japanese. This rule is mutually exclusive with rule: Core East Asian STANDARD Reverse Spell Check EN-CCJ.rul; please use only one.

 

Core East Asian STANDARD Reverse Spell Check EN-CCJ.rul

New

Checks individual "English" words in target segment against source. CORE check is NOT case-sensitive, and allows missing final 's' in plurals.  Words with less than 2 letters are excluded from this check. This rule is valid for these languages: Chinese (Simplified), Chinese (Traditional) and Japanese. This rule is mutually exclusive with rule: East Asian STANDARD STRICT Reverse Spell Check EN-CCJ.rul; please use only one.

 

EA Standard Target English_EN-EA.rul

Deleted

This rule has been substituted with the two new rules above which have improved functionality: East Asian STANDARD STRICT Reverse Spell Check EN-CCJ.rul & Core East Asian STANDARD Reverse Spell Check EN-CCJ.rul.

 

ALL STANDARD STRICT Punctuation.rul

New

A new improved check to detect these instances: “Punctuation: Target ends with punctuation, but source doesn't” have been added to the previous functionality of the ALL STANDARD Punctuation.rul. This rule is mutually exclusive with the Core ALL STANDARD Punctuation.rul.

 

Core ALL STANDARD Punctuation.rul

Update

Rule ID: 42 (Punctuation: Target ends with punctuation, but source doesn't) has been removed in order to reduce the number of false positives. This check has now been to the ALL STANDARD Strict Punctuation. This rule is mutually exclusive with the ALL STANDARD Strict Punctuation.rul.

 

ALL STANDARD Punctuation.rul

Deleted

This rule has been replaced by ALL STANDARD STRICT Punctuation.rul.

 

Core ALL STANDARD Entity References.rul

Updated

This rule has been improved.

 

ALL STANDARD Entity References.rul

Deleted

This rule has been deleted as its functionality is duplicated in the rule Core ALL STANDARD Entity References.rul.

 

ALL STANDARD Tags.rul

Deleted

This rule has been deleted as its functionality is duplicated in the rule Core ALL STANDARD Punctuation.rul.

 

ALL STANDARD Tags HTML.rul

New

Checks that all tags are HTML tags and target corresponds to source (Note: this in no way replaces the need for engineering-level check)

 

ALL STANDARD Tags S-Tagger.rul

New

Checks that all tags are STF tags and target corresponds to source (Note: this in no way replaces the need for S-Tagger Verifier check)

 

ALL STANDARD Tags XHTML.rul

New

Checks that all tags are XHTML tags and target corresponds to source (Note: this in no way replaces the need for engineering-level check)

 

AR STANDARD Detect Corrupt Chars or Wrong Lang EN-AR.rul

Updated

Detects possible corrupt characters in bilingual Arabic files in target (and to a limited extent in source) segments.

 

AR STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

Updated

Detects possible corrupt characters in monolingual Arabic files.

 

CA STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Catalan files.

 

EN STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual English files.

 

EU STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Basque files.

 

FA STANDARD Detect Corrupt Chars or Wrong Lang EN-FA.rul

Updated

Detects possible corrupt characters in bilingual Farsi files in target (and to a limited extent in source) segments.

 

FA STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Farsi files.

 

FIL STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Filipino files.

 

GL STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Galician files.

 

HE STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

Updated

Detects possible corrupt characters in monolingual Hebrew files.

 

HE STANDARD Detect Corrupt Chars or Wrong Lang EN-HE.rul

Updated

Detects possible corrupt characters in bilingual Hebrew files in target (and to a limited extent in source) segments.

 

HI STANDARD Detect Corrupt Chars or Wrong Lang EN-HI.rul

Updated

Detects possible corrupt characters in bilingual Hindi files in target (and to a limited extent in source) segments.

 

HI STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

Updated

Detects possible corrupt characters in monolingual Hindi files.

 

HY STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Armenian files.

 

IS STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Icelandic files.

 

KK STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Kazakh files.

 

KK STANDARD Detect Corrupt Chars or Wrong Lang EN-KK.rul

New

Detects possible corrupt characters in bilingual Kazakh files in target (and to a limited extent in source) segments.

 

MS STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Malay files.

 

TH STANDARD Detect Corrupt Chars or Wrong Lang EN-TH.rul

Updated

Detects possible corrupt characters in bilingual Thai files in target (and to a limited extent in source) segments.

 

TH STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

Updated

Detects possible corrupt characters in monolingual Thai files.

 

UR STANDARD Detect Corrupt Chars or Wrong Lang EN-UR.rul

Updated

Detects possible corrupt characters in bilingual Urdu files in target (and to a limited extent in source) segments.

 

UR STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Urdu files.

 

VI STANDARD Detect Corrupt Chars or Wrong Lang EN-VI.rul

Updated

Detects possible corrupt characters in bilingual Vietnamese files in target (and to a limited extent in source) segments.

 

VI STANDARD Monolingual Detect Corrupt Chars or Wrong Lang.rul

New

Detects possible corrupt characters in monolingual Vietnamese files.

___________________________________________________________________________________________________________________________________________

 

Lionbridge Linguistic ToolBox 2.35 Support files Updates on Nov-23rd

ES STANDARD MICROSOFT EN-ES.rul

Updated

Incorrect Target languages in Properties have been removed.

Estonian.xpl

New

New dictionary (35001 words).

ZH-TW STANDARD Punctuation EN-ZHTW.rul

Updated

Checks punctuation problems in Traditional Chinese.

ZH-TW STANDARD Literals - Duplicated Words ALL-ZHTW.rul

Updated

Checks for duplicated words in Traditional Chinese, exceptions have been included to reduce false positives.

ZH-TW STANDARD Style Check ALL-ZHTW.rul

Updated

Checks common style problems (spacing and some others) in Traditional Chinese.

 ___________________________________________________________________________________________________________________________________________

 

Lionbridge Linguistic ToolBox 2.35 Version Release Notes

We are pleased to announce a new release of Linguistic ToolBox ver. 2.35. In this version we have implemented some major improvements to the parsing of LTB in relation to tags for many file formats, this will reduce considerably the amount of noise produced by the tool. We have also extended our support for files converted from SDLX (ITD files) and Idiom.

If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·          For Lionbridge users, please use Service Desk.

·          For non Lionbridge users, you can contact us at GTLTBSupport@lionbridge.com

 

New Features

·      New User Interface:  The options Ignore matches equal or above, Use formatted segments and Ignore locked segments or marked as non-translatable now appear in the left-hand pane of Linguistic ToolBox. These options can also be found under File>Preferences, where you can set your preferred settings by default for these three options.

·      LTB Configuration files now store ignorability options: The options: Ignore matches equal or above, Use formatted segments and Ignore locked segments or marked as non-translatable now are saved within the LTB Configuration files.  LTB Output reports now show the LTB configuration file name used and also include the ignore options.

·      Description for rules will appear automatically now without having to access the T&P Linguistic Editor:  The description will appear in a pop-up window on the grid that displays when the user moves the mouse icon over the Term & Punctuation rules in the T&P tab.

·      Report sentence length based on bytes:  In the Segment Checker tab, new functionality has been added to report sentence length limits based on bytes.

·      New Languages added: English (Malaysia), Dari and Catalan (Valencia).

 Enhancements

·      LTB loads dictionaries from a configuration downloaded from Logoport: Now LTB generic dictionaries will be automatically referenced in the downloaded Logoport configuration. Customer-specific and domain-specific dictionaries will be stored in the LTB configuration.

 

·      *LTB support for .TTX files exported from SDLX (.ITD file format): Now we can process ITD files converted into .TTX format without any tagging issues.

 

·      Tagging issues with TMX exported from Idiom: Idiom placeholders are now handled correctly by the LTB parser.

 

·      SourceReference and TargetReference Modes issues:  Special RegEx characters are now escaped in the match text, before replacing the placeholders with them. Previously these two modes only used to report the first instance of a match, now they report all of them.

 

·      Issue on how entries under the Issue column in the report are highlighted under new tokenization: Previously, under the new tokenization rules, whole tokens were painted in the segments and included in the "Issue" column for each component. This was confusing users, as they saw strange entries in the output reports and consider them to be noise, when they were real errors. For example: for the following text: “<:crmk 9>Netsnoer “ LTB used to output this error:  “9>Netsnoer”.

 

·      *XML to TMX Converter supports Thai language.

 

·      Issue on the LTB Output Report on the Source Analysis tab: Previously, in the Entity Extractor Output tab, all the XML tags for font and color were not escaped and as a consequence, appeared in the report.

 

·      Parsing issues related to tags:

 

·       The parsing of HTML tags has been improved overall

 

·       Issue with <BR> tags:  <BR> tags were previously removed by LTB in the output reports and as a consequence, many words appeared incorrectly concatenated causing noise. For example: “<b>Musical Events<br>Branding at the Event</b>” was interpreted by LTB as: “Musical EventsBranding at the Event”. Now, LTB is substituting the <BR> tags with a space in the output reports.

 

·       Issue with space added in between consecutive tags: For all tagged formats, whenever a closing tag was immediately followed by an opening tag with no space in between, LTB wrongly added a space, creating false errors.

 

·       Issues with soft returns: LTB previously removed soft returns when parsing, going forward soft returns will be substituted with a space.

 

·       Extra space is added to source segment: In some circumstances, RTF files saved from Word 2003 add the \v code and additional spaces before and after it. The RTF parser was not prepared to handle this situation, and when the \v code was removed the spaces were left behind. Now, LTB RTF parser handles this situation correctly.

 

·       *<HR/> tags are now interpreted as a separator tag

  

Support files updates

Term & Punctuation Rules

·      Updated rule for ALL languages

 

§  ALL STANDARD Literals - Camels.rul

 

o   The functionality of this rule has been improved and now it searches for instances of initial lowercase followed by uppercase within the same word and for instances of more than 2 uppercase letters followed by lowercase within the same word.

 

§  Core ALL STANDARD Punctuation

 

o   A check to detect double period has been added to this rule.

 

§  Core EN STANDARD Number Format 1,000.0 to 1.000,0.rul

§  Core EN STANDARD Number Format 1,000.0 to 1 000,0.rul

§  Core ALL (Not EN) STANDARD Literals - Numbers.rul

§  Core EN STANDARD Literals - Numbers.rul

 

The four rules above have been improved to include the functionality of the rule CT STANDARD Numbers ALL-CT.rul that used to check instances of numbers found in target but not in the source segment. This rule has deleted from the current LTB installation.

 

·      Deleted rules for ALL languages:

 

§  CT STANDARD Numbers ALL-CT.rul

 

o   Rule deleted as this functionality is now included in the four core numeric rules described above.

 

§   ALL STANDARD Literals – Numbers.rul

 

o    Rule deleted as its functionality is already included in the four core numeric rules described above.

 

 

·      Updated rules for Finnish: All the rules below have been improved to reduce the number of false positives.

 

§  FI STANDARD Additional_grammar_and_punctuation_check ALL-FI.rul

§  FI STANDARD Rude_words_and_vulgarisms ALL-FI. Rul

 

 

·      Updated rule for Italian:

 

§  IT STANDARD Common Errors ALL-IT.rul

 

o   This rule has been improved to reduce the number of false positives.


 

·      Updated rule for Korean:

 

§  KO STANDARD Style ALL-KO.rul

 

o   A new check has been added to detect occurrences of em dash combining words in Korean; this usage of em-dash in Korean is not recommended.

 

 

·      Updated rules for Dutch: All the rules below have been improved to reduce the number of false positives.

 

§  NL STANDARD Common Errors ALL-NL.rul

§  NL STANDARD False Friends EN-NL.rul

§  NL STANDARD Fixed Expressions ALL-NL.rul

§  NL STANDARD Style Issues ALL-NL.rul

 

·      Deleted rules for Dutch: These rules have been removed as the functionality is included already I the Core rules.

 

§  NL STANDARD Grammar ALL-NL.rul

§  NL STANDARD Punctuation ALL-NL.rul

§  NL STANDARD Symbol & Numbers ALL-NL.rul

 

·      Renamed rules for Chinese (Simplified) and (Traditional) to follow standard language name conventions:

 

§  ZH-TW STANDARD Common Typos ALL-ZHTW

§  ZH-TW STANDARD Offensive Terms ALL-ZHTW

§  ZH-TW STANDARD PoliCheck ALL-ZHTW

§  ZH-TW STANDARD Punctuation EN-ZHTW

§  ZH-TW STANDARD Style Check ALL-ZHTW

§  ZH-TW STANDARD Literals - Duplicated Words ALL-ZHTW

§  ZH-TW STANDARD Literals - Variable ALL-ZHTW.rul

§  ZH-CN STANDARD Literals - Typo|Policheck Terms|English Term in Target EN-ZHCN.rul

 Entity Extractor Rules    

·      Updated rules: Previous rule ALL STANDARD CamelWords and Acronyms.rul has been divided in two and improved.

§  ALL STANDARD CamelWords

§  ALL STANDARD Acronyms

·      Deleted rules:

§  ALL STANDARD CamelWords and Acronyms.rul

 

Dictionaries

New (19):

Updated ( 28):

 

Arabic

Bengali

Catalan

Danish

Dutch

English (UK)

English (US)

Finnish

French

German

Greek

Hebrew

Italian

Kannada

 

Korean

Malayalam

Marathi

Norwegian (Bokmal)

Norwegian (Nynorsk)

Polish

Portuguese (Brazil)

Portuguese (Portugal)

Punjabi

Slovak

Spanish

Swedish

Tamil

Telugu

 

Documentation

Help file has been updated with changes related to this release.

Known Issues

The most up-to-date information on Known issues can be found at: http://autoupdate.lionbridge.com/ltb/KnownIssues.htm

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.34 Version Release Notes

We are pleased to announce a new release of Linguistic ToolBox. In this release we have completely revamped the functionality of the Spelling component to make it quicker, more efficient and more flexible.

You will now be able to use multiple dictionaries at the same time. For example, if you need to spell-check bilingual files for an automotive project from German into English, you will be able to use the generic English dictionary in conjunction with the specific dictionary for automotive. Our LTB dictionaries will be case-sensitive from this new release onwards, and therefore will be more accurate.

If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·         For Lionbridge users, please use Service Desk.

·         For non Lionbridge users, you can contact us at GTLTBSupport@lionbridge.com

 

=== New Features ===

* Mutiple Dictionaries allowed: LTB now allows the usage of multiple dictionaries at the same time with the Spelling component. This way we can easily manage “custom” dictionaries that are only applicable to specific projects/domains or subject matters. Users are now able to check/uncheck dictionaries from a list of dictionaries and to add more than one dictionary for each language.

* Language independent spell-check: LTB now allows to set the dictionary language value as [ All ]. For example, this can be useful for situations where we would like to spell-check Latin terms or Proper nouns lists (Company names, Product names, etc) that may be used in any language.

* XMLà TMX Converter now also allows to convert XML files into .RTF file format: There is a new option now in the XMLà TMX Converter to generate .rtf files as output in addition to .TMX. You can also select whether the generated output will contain or not leading and trailing spaces.

 We have improved this Converter to work as follows: does not export empty segments and if Source and Target paragraphs from the XML files contain the same number of segments, the Converter splits the paragraphs and aligns the segments. With regard to locked segments, the Converter considers as an internal group the tags that do not contain any condition in the //locked/tags/tag" node without analyzing the conditions within the "//locked/tags/tag" nodes.

=== Enhancements ===

* Spelling component’s execution is faster: LTB only spell-checks one instance of a repeated word.

 

*New tokenization: Previously LTB identified words from within segments by using as delimiters numbers, symbols, control characters, spaces and certain punctuation characters. Going forward, the only separator to identify words from within a segment is a space with or without punctuation characters around it. EXCEPT in the case where there is a period to the right of a space and to the left of a string, such as for example:  “ .Net Framework”.

 

* LTB Dictionaries are now case-sensitive: The previous LTB spell-check was not sufficiently case-sensitive. Specifically, words were converted to all-lower-case before they were looked up in the .XPL file, and they were converted to all-lower-case when imported from a TXT file into an XPL file.


* Use of MS Word Spell Checker option no longer takes into account MS Custom dictionaries: This way we can have more control over spelling when sharing LTB configuration files that include spelling.

 

* The Speller tab of the LTB Output Report has been improved by adding an additional column to clearly identify spelling mistakes from the source and the target: This will be applicable to situations where spell-check has been carried out on both source and target simultaneously.

 

* LTB displays a warning message if users try to run spell-check using an empty dictionary with the option Use MS Word spellchecker disabled.

 

=== Changes === 

* New user interface: The Spelling component User Interface has been modified to enable users to select multiple dictionaries at the same time.

Also, the option Ignore capitalized, CaMeL and alphanumeric expressions has been replaced by these individual options under Ignore Options:

o             Acronyms and words in all capitals

o             CaMel words

o             Alphanumerics

We have also introduced a new option, under Ignore Options called: Internet and file addresses to ignore URLs, FTP addresses, local and network file paths, file names and local drive units.

 

=== Service Desk tickets fixed in this release ===

* Issue with LTB not identifying target segments in some cases when using Word bilingual files.

* After removing the "\r\n" entities, the parser concatenated the words before and after them, producing false positives in the output.

* When importing an .xpl into an existing dictionary, you were not able to find any of the new imported words using the Search button.

* After importing a list of entries in .txt file format into an existing dictionary, the updated dictionary would not save within the configuration file, only the original dictionary.

* The Ignore capitalized, CaMeL and alphanumeric expressions option only used to recognize Latin characters. The regular expressions used to test whether words qualify for exclusion under this option (and equally, qualify for handling as case-sensitive spellings) need to be internationalized.

* When the option Ignore capitalized, CaMeL and alphanumeric expressions in the Spelling component was enabled, LTB still reported issues related to alphanumeric expressions even though they should have been ignored.

* In some cases, when a LTB Dictionary is edited using Dictionary Editor, it becomes corrupted and cannot be reused.

 

=== Support files updates ===

* Updated rules for Portuguese (Brazilian): Rules to detect instances of old spelling in both monolingual and bilingual files.

·         PT-BR STANDARD Monolingual Detect Old Spelling 1_11.rul

·         PT-BR STANDARD Detect Old Spelling 1_11 ALL-PT-BR.rul

 

* Updated rule for Chinese, Japanese and Korean: Checks for literal instances of untranslated words in Chinese, Japanese and Korean and reports an issue if there's any difference. This rule is especially important for Chinese and Japanese as these languages cannot be spell-checked and with this rule we can detect instances where there are spelling mistakes in untranslated text in the target.

·         EA Standard Target English_EN-EA.rul

 

=== Documentation ===

Help file has been updated with changes related to this release.

=== Known Issues ===

Description

Workaround

Particularity

Some translators are having problems installing the last version of Linguistic ToolBox. The message that they receive is that the Logoport Client is not installed although they are regular users of Logoport.

This issue is linked to using a 64 bits OS (Windows XP or Windows Vista). In these OS, the Windows path to the Logoport template does not update until you have either logged off or restarted your PC. Users must re-start Logoport once installed.

64 bits OS

When starting Linguistic ToolBox, the splash window appears to be frozen.

Move the mouse.

Multiuser environment

When using LTB to process large numbers of input files on a network, LTB displays a message for every file asking whether you would like to open a Read-only copy (when the files are not opened by any other users).

Copy input files locally to work with LTB.

Using input files on a network

When using LTB to process large numbers of input files on a network, LTB displays a message for every file indicating an invoke error on the file.

This is usually related to having Acrobat PDF Maker add-in for Word installed. The problem is solved if you uninstall the add-in.

PDF Maker add-in installed

When running Linguistic ToolBox using .doc or .rtf as input files, sometimes you get this error message: Object  Does not Match Target Type

  1. In Add/Remove Programs, look for Microsoft Office 2003 Professional Edition (or whichever edition you have installed) and click Change.
  2. Select Add or Remove Features and then click Next.
  3. Select Choose advanced customization of applications and then click Next.
  4. Expand the Microsoft Office Word node, select .NET Programmability Support, select Not Available, and then click Update.

Word files as input files containing tables or graphics

On the Term & Punctuation tab, the check-boxes beside each of the rules are missing. 

1. In the Windows Desktop right click and select Properties.

2. On the Appearance tab, ensure the Font size value is set to Normal.

3. On the Settings tab, click on the Advanced button and ensure the DPI Setting value is set to "Normal size (96 DPI).
4. Click OK.

This issue is related to the Font size.

Issues loading a configuration file correctly in v. 2.33 can be due to the configuration file having been created in an old version of LTB previous to 2.32.  

The old LTB cfg files were plain text, and did not allow for the introduction of new features in existing components, as the configuration values were read sequentially. Hence, the .cfg file format was moved to XML format a while ago to allow the introduction of any new features in future developments, as is the case now for some Preferences values. While no new features have been introduced to LTB till version 2.33, the old text files could be read and used in newer LTB versions. However, in LTB 2.33 new options have been added to the Preferences dialog, and some others removed from the LTB components, so old LTB configuration files are no longer valid. This is why the old plain-text cfg configuration files are failing. They are missing values that LTB expects to find.

To solve this issue, open the config file in the previous LTB 2.32 version, and save it again. Version 2.32 saves the file as XML, so you will be able to open them later in the new 2.33 version without problems.

 These are the steps:

1. Go to File>Manage Configurations

2. Browse to the location of your old configuration file and select the configuration file you wish to resave.

3. Click on the Load button.

4. Once the old configuration file has been loaded, go back to File>Manage configurations and click on the Save button.

Once everyone starts using v. 2.33 if you encounter this issue, send us a mail to: !GTLTBSupport@lionbridge.com

Configuration files created before v. 2.32

At the moment, LTB does not support Microsoft LocStudio EDB files as an input file format. In order to work with EDB files or LSG files with LTB, these file formats need to be converted into .CSV file format.

You can easily batch convert Microsoft LocStudio files into CSV file format by using the EDB2CSV Converter at this location:

 http://sp2.intranet.lionbridge.net/Dept/GT/LanguageTechnology/LinguisticToolbox/LTB%20Tools/EDB2CSV.zip

 Alternatively, you can export individual files to .CSV file format from LocStudio itself, by using either of these options:

  • For Software Files (EDBs): select Software in the menu bar, and then select Export Software to CSV.
  • For Glossary Files (LSGs): select Glossary in the menu bar, then select Export>Export Glossary to CSV.  

Microsoft LocStudio

When creating or saving your Linguistic ToolBox Configuration files the options from File>Preferences are not recorded as part of the configuration file.  This is especially an issue for options such as Ignore matches equal or above to or Ignore locked segments or marked as non-translatable.

When sharing LTB Configuration files, please specify in your instructions the options that need to be manually set in File>Preferences.

Configuration files and options in File>Preferences

When using LTB to carry out spell-checking using the option: Use Microsoft Word spell

checker and you wish to disable the Ignore Options: Acronyms and words in all capitals, Alphanumerics and Internet and file addresses, so that these items are taken into account by the spell-check; make sure that these settings are also disabled in Microsoft Word.

In order to do this, in Microsoft Word, go to Review>Spelling & Grammar>Word Options and make sure these options are disabled: Ignore words in UPPERCASE, Ignore

words that contain numbers and Ignore Internet and file addresses.

Spellchecking using the option Use Microsoft Word spell checker

 

The most up-to-date information on Known issues can be found at: http://sp2.intranet.lionbridge.net/Dept/GT/LanguageTechnology/LinguisticToolbox/Lists/Known%20Issues2/FinalView2.aspx?ShowInGrid=True&View=%7B84E4B81E%2D402F%2D43AB%2DBE66%2D30E9E1C2FFBC%7D

 . This link is only available for Lionbridge users.

___________________________________________________________________________________________________________________________________________

Lionbridge Linguistic ToolBox 2.33 Version Release Notes

We are pleased to announce a new release of Linguistic ToolBox. In this release we have added new functionality to ignore 100% matches from our checks when we don't have to review 100% matches and we have also introduced new functionality to detect unsegmented text. For more details, please see below.

On the next release we will work on improving the Spelling component to make it faster and more efficient.

If you have any technical questions or in case you identify any bugs, please contact us once you have checked with your Site Expert.

·         For Lionbridge users, please use Service Desk.

·         For non Lionbridge user, you can contact us through GTLTBSupport@lionbridge.com

 

=== New Features ===

* Ignore matches equal to or above a specified segment match: The option Ignore matches equal or above is now available from File>Preferences. This option is applicable to all the Quality Control components using bilingual files. Use this option when you don’t need to review legacy segments. Please refer to the "Changes” section for details on the implementation of this option for the consistency check features.

* Unsegmented text detection: In the Segment Checker component, we are now able to detect and report unsegmented text by selecting the option Check if Target is empty. This functionality is only available for .XLF, .TTX and .RTF files. Unsegmented text is now reported in the Linguistic ToolBox Output report under the Issue column as: Target is empty - Unsegmented text.

Note that unsegmented text is not taken into account for the QUICK, Term & Punctuation, or Change Report checks.

* Ignore locked segments or marked as non-translatable feature (File > Preferences): This new option allows you to ignore from all Linguistic Toolbox components the following data:

·         all segments flagged as translatable="no" in .XLZ/.XLF files

·         all locked segments in .RTF/.DOC files


=== Enhancements ===

* Installation:. The new install.exe available from http://autoupdate.lionbridge.com/ltb/ ensures that the PC meets the Linguistic Toolbox Installation requirements:

·         Windows XP or Windows Vista

·         .Net Framework 2 or later

·         Logoport Client

·         Office 2003 or 2007

 

=== Changes === 

* Quality Control components: The option Ignore Perfect match segments is not available any longer from the Quality Control components. This option has been replaced with the option Ignore matches equal or above, available from File>Preferences. Users will get a warning each time they click on the Start button if the value selected in the Ignore matches equal or above option is set to 100% or below.

*New behavior for the consistency checks in the Segment Checker: To ensure direct and reverse consistency between legacy and new segments, the options Check inconsistency and Check reverse inconsistency do not take into account the options Ignore matches equal or above and Ignore locked segments or marked as non-translatable selected in File>Preferences.

* Spelling component: The option Use MS Word spell checker is now enabled by default.

 

* Ignore segments marked as non-translatable: is now available from the option Ignore locked segments or marked as non-translatable

 

=== Support files updates ===

* Updated generic rule:

·         ALL STANDARD Literals - Acronyms.rul

 

* Updated rules for Portuguese (Brazilian):

·         PT-BR STANDARD Monolingual Detect Old Spelling 1_10.rul

·         PT-BR STANDARD Detect Old Spelling 1_10 ALL-PT-BR.rul

 

* Updated rules for French (Canada):

·         FR-CA STANDARD Punctuation ALL-FRCA.rul


* Updated rules for Norwegian (Bokmal) and Norwegian (Nynorsk):

·         NN STANDARD AutoCorrect All-NN.rul (Detects common typos for Norwegian Nynorsk)

·         NB NB STANDARD AutoCorrect All-NB.rul (Detects common typos for Norwegian Bokmål)

·         NB CUSTOM Microsoft EN-NB.rul (Microsoft rules for Norwegian Bokmål. Mainly style and spelling)

·         NN CUSTOM Microsoft EN-NN.rul (Microsoft rules for Norwegian Nynorsk. Mainly style and spelling)

 

* Updated rules for Spanish:                                                                                                                                                     

·         ES STANDARD Spelling Errors ALL-ES.rul

·         ES STANDARD Grammar Errors ALL-ES.rul

·         ES STANDARD MICROSOFT EN-ES.rul

·         ES STANDARD Neutral Spanish ALL-ES.rul

·         ES STANDARD Common Errors ALL-ES.rul

 

* New rules for English:

_ These monolingual and bilingual rules detect instances of English (US) wordings in English (UK) texts and vice versa.

·         EN-UK_STANDARD_CheckSourceforAmericanEnglish.rul

·         EN-UK_STANDARD_CheckTargetforAmericanEnglish.rul

·         EN-US_STANDARD_CheckSourceforBritishEnglish.rul

·         EN-US_STANDARD_CheckTargetforBritishEnglish.rul

 

_ Checks for UK-specific automotive terms in target segments and propose to replace them with a correct US equivalent.)

·         EN-US_Standard_Automotive_checkforUKterms_ALL-ENUS.rul

 

* Updated Dictionaries:

·         Polish.xpl

·         Norwegian (Nynorsk).xpl

·         Norwegian (Bokmal).xpl

 

=== Documentation ===

 Help file has been updated to include the new features and functionality.

=== Known Issues ===

Description

Workaround

Particularity

Some translators are having problems installing the last version of Linguistic ToolBox. The message that they receive is that the Logoport Client is not installed although they are regular users of Logoport.

This issue is linked to using a 64 bits OS (Windows XP or Windows Vista). In these OS, the Windows path to the Logoport template does not update until you have either logged off or restarted your PC. Users must re-start Logoport once installed.

64 bits OS

When starting Linguistic ToolBox, the splash window appears to be frozen.

Move the mouse.

Multiuser environment

When using LTB to process large number of input files on a network, LTB displays a message for every file asking whether you would like to open a Read-only copy (when the files are not opened by any other users).

Copy input files locally to work with LTB.

Using input files on a network

When using LTB to process large number of input files on a network, LTB displays a message for every file indicating an invoke error on the file.

This is usually related to having Acrobat PDF Maker add-in for Word installed. The problem is solved if you uninstall the add-in.

PDF Maker add-in installed

When running Linguistic ToolBox using .doc or .rtf as input files, sometimes you get this error message: Object  Does not Match Target Type

1.        In Add/Remove Programs, look for Microsoft Office 2003 Professional Edition (or whichever edition you have installed) and click Change.

2.        Select Add or Remove Features and then click Next.

3.        Select Choose advanced customization of applications and then click Next.

4.        Expand the Microsoft Office Word node, select .NET Programmability Support, select Not Available, and then click Update.

Word files as input files containing tables or graphics

On the Term & Punctuation tab, the check-boxes besides each of the rules are missing. 

1. In the Windows Desktop right click and  select Properties.

2. On the Appearance tab, ensure the Font size value is set to Normal.

3. On the Settings tab, click on the Advanced button and ensure the DPI Setting value is set to "Normal size (96 DPI).
4. Click
OK.

This issue is related to the Font size.

Issues loading a configuration file correctly in v. 2.33 can be due to the configuration file having been created in an old version of LTB previous to 2.32.  

The old LTB cfg files were plain text, and did not allow for the introduction of new features in existing components, as the configuration values were read sequentially. Hence, the .cfg file format was moved to XML format a while ago to allow the introduction of any new features in future developments, as is the case now for some Preferences values. While no new features had been introduced to LTB till version 2.33, the old text files could be read and used in newer LTB versions. However, in LTB 2.33 new options have been added to the Preferences dialog, and some others removed from the LTB components, so old LTB configuration files are no longer valid. This is why the old plain-text cfg configuration files are failing. They are missing values that LTB expects to find.

 

To solve this issue, open the config file in the previous LTB 2.32 version, and save it again. Version 2.32 saves the file as XML, so you will be able to open them later in the new 2.33 version without problems.

 

These are the steps:

1. Go to File>Manage Configurations

2. Browse to the location of your old configuration file and select the configuration file you wish to resave.

3. Click on the Load button.

4. Once the old configuration file has been loaded, go back to File>Manage configurations and click on the Save button.

Once everyone starts using v. 2.33 if you encounter this issue, send us a mail to: !GTLTBSupport@lionbridge.com

Configuration files created before v. 2.32

At the moment, LTB does not support Microsoft LocStudio EDB files as an input file format. In order to work with EDB files or LSG files with LTB, these file formats need to be converted into .CSV file format.

You can easily batch convert Microsoft LocStudio files into CSV file format by using the EDB2CSV Converter at this location:

 http://sp2.intranet.lionbridge.net/Dept/GT/LanguageTechnology/LinguisticToolbox/LTB%20Tools/EDB2CSV.zip

 Alternatively, you can export individual files to .CSV file format from LocStudio itself, by using either of these options:

Microsoft LocStudio

 

The most up-to-date information on Known issues can be found at: http://sp2.intranet.lionbridge.net/Dept/GT/LanguageTechnology/LinguisticToolbox/Lists/Known%20Issues2/FinalView2.aspx?ShowInGrid=True&View=%7B84E4B81E%2D402F%2D43AB%2DBE66%2D30E9E1C2FFBC%7D