Difference between revisions of "The TM/Glossary editor Wordfast Classic"

From Wordfast Wiki
Jump to: navigation, search
(Filtering)
(No difference)

Revision as of 15:47, 26 September 2017

Click the "TM/Glossary editor" TM Glossary editor icon.png icon in Wordfast's main toolbar, or the last icon in any of the glossary toolbars to start the TM/Glossary editor. Outside a translation session, glossary toolbars can be opened using the Ctrl+Alt+Right shortcut (and closed using the Ctrl+Alt+Left shortcut). During a translation session, the Ctrl+Alt+G shortcut pressed on a word or selection will open the glossary toolbar(s) of the glossary(ies) where the term was found. Glossary toolbars open only on glossaries that were specified in Wordfast/Terminology. Wordfast's TM/Glossary editor is intended to make maintenance easy and intuitive, and offers practically identical methods for TMs and glossaries. Once the editor is opened, you can scroll up/down the data, edit/delete/add entries.

Shortcut Effect
Space bar mark/unmark entries
Ctrl+A mark/unmark all entries
Shift+Ctrl+A reverse the current marking
Ctrl+X cut all marked entries
Ctrl+Y undo the previous cut operation
Ctrl+C copy all marked entries to Wordfast's own clipboard
Ctrl+V paste Wordfast's clipboard's contents to the end of the file
Ctrl+D

(right-click)

Toggle the display of all, or only marked, TUs.
F7 or Click the

column header area

Open the "Filter or sort" dialog box.
Ctrl+O Open another file (another glossary or antoher TM).

Note: cutting (deleting) a single line (or entry, or TU) is a soft operation, meaning it can be reversed or undone (press Delete twice on an entry to see the toggling effect). When an entry is cut (or soft-deleted), it appears as a blank line, but when it is selected, the source and target data appears in the editor's bottom blue/green display. Ctrl+Delete will permanently erase cut entries by "packing", i.e. rewriting, the entire TM or glossary.

The editor's Filter or Sort dialog box (Press F7 or click the column header are) gives access to three types of operation on data: Filter, Sort and Special filters.

Filtering

Filtering means you define a condition with a Field Condition Argument format. For example:

SourceText & "MyText

where & means "contains", or

Counter = 0

See more examples in the Filter or Sort dialog box' Help.

When Argument is made of text, it must be enclosed in straight quotes like this: "MyText".

The effect of a filter is that only the entries that conform to the filter's condition(s) will be made visible in the glossary editor. When a filter has been set, using the Mark methods (mark, unmark, copy, paste, cut) will operate only on visible entries. Use the F8 shortcut to cancel a filter.

Sorting

Sorting can take some time, because the entire file is actually (physically) sorted, not just the display of the file. Sort when necessary. Wordfast adds the convenience of being able to sort source or target text on word or character number. This can be useful for terminology extraction.

Special filters

Special filters are meant to perform operations that would be difficult or impossible to perform with just filtering and sorting. These operations are:

Mark redundant entries (there are various types of definition for a redundant entry, depending on whether you use a TM or a glossary). This feature marks entries that are considered duplicates. Once the marking is done, you can review them, then delete them all by using the Cut shortcut (Ctrl+X) followed by a hard-delete command (Ctrl+Delete). Of course, with a TM, such entries are grouped if the TM is sorted on the source segment.

Reverse source and target This will rewrite the current file and reverse source and target fields.

Export to Unicode Exports the current file to a unicode format.

Export to TMX (TM only) Exports the current file to the TMX format. The TM is not overwritten - a new file is created, and it has a .tmx extension.

Remove tags This special filter removes tags from a TM. This is recommended after finishing a project with tagged files. The leverage of TUs with tags is precious within the scope of a particular project. Tagged leveraged outside a project is an extreme rarity. This is why it is recommended to remove tags from a TM that will be used on different translation projects. Tags bloat TMs to a ridiculous extent.

Rewrite Entries with a Mask This powerful feature is used to replace a particular field, or many fields, with some given value, or erase the content of the fields, in all visible entries. Visible entries are those that are displayed in the editor. If a filter is set, only some entries are visible.

You are first presented with an empty entry (a mask). You can:

  • enter an equal sign (=) followed by some text in any field, in which case, the text after the equal sign will replace whatever is found is the corresponding fields in all visible entries in the file (TM or glossary);
  • enter "=null" in a field to erase the content of that field.

All fields that are left blank in the mask will remain untouched in the file.

The following mask would replace all User fields with "FOO", and erase Attribute fields 2, 3, 4 in the entire TM:

Practical example: "I have that older, bulky TM that combines TUs from various translators. I want these entries grouped by user (translator) name. I want to delete all entries that have a usage counter of less than 2, and that are older than August 31, 2004. Then I want to review them one by one and perhaps have some entries not marked for deletion if I think they're useful after all. Only then will I erase all marked entries that remain".

  1. Start the TM/Glossary editor, click the Tools button.
  2. Sort on "User".
  3. Set the following filter: Counter < 2 AND Date < 20040831 .
  4. Press Ctrl+D to view only marked entries.
  5. Review marked entries, un-mark the ones you wish to keep.
  6. Press Ctrl+X to cut all marked entries.
  7. Press Ctrl+Delete to permanently erase all marked entries.
  8. Sort on Date to revert to a "natural" order in the TM.

Note that all operations except #7 can be undone.

TMs and glossaries must be created for one language pair only. I also advise keeping separate TMs for different subject (domain) and client, and having them in dedicated folders so that keeping track of them, and especially backing them up, remains easy.

TMs keep growing all the time. Simple statistics show that a majority of TUs will never be re-used (or are very unkikely to be re-used), while a minority of them will. Since Wordfast keeps track of how many times a TU is re-used in the usage counter field, it is advised, when a TM reaches a large size (over 100,000 TUs), or when finishing a large translation project, to perform a compression by eliminating all TUs that have never been re-used. As a result, the TM's size will be considerably reduced, while its overall efficiency will be preserved. To do so:

  1. Start the TM/glossary editor on the required TM.
  2. Press F7, and set the following filter: Counter = 0 . Click OK.
  3. Mark all (Ctrl+A). Cut marked (Ctrl+X). Hard-delete (Ctrl+Delete).

Creating a startup TM. Create one single, large TM by combining all the TMs you have. Delete all TUs that have a usage counter of less than 3. To compress further, you can visually review the TM and delete TUs that are unlikely to pop up again. To do so, sort the TM on "SourceWords", go to the end of it and review the TUs that are the longest, where there are likely "ghost" candidates, longish TUs that are unikely to show up again. Delete them. This TM can then be used as a primer - if you need to create a new, empty TM, better use a copy of that TM instead, because it contains a "Top 50" or perhaps a "Top 1000" of your previous work. It's like priming a pump with a cup of water.

A Wordfast TM may contain TUs where the first figure of the date (normally "2", but it can be "1" for TMs created in the previous millenium) is replaced with "x", and which, as a consequence, appear to be "cut" in the editor. This is because, in the course of a translation session, the TU was proposed as 100% match on a green background, but the target segment was edited, so Wordfast has deleted the original version of the TU in the TM and has re-written the TU's edited version at the end of the TM. This is normal. Do not "resurrect" or un-delete such TUs: their correct version appears further down in the TM. During translation sessions, Wordfast is blind to TUs that are marked "x". As a rule of thumb, perform a "Reorganisation" of the TM before working on it. This is done with the Wordfast > Translation memory > TM "Reorganise" button and it erases all TUs that were marked as "Deleted" with an "x" mark in the course of previous translation sessions.

Sharing TMs with other Wordfast users, or with other CAT tools. Sharing TMs with other Wordfast users: always reorganise (use the Translation memory/TM/Reorganise button in Wordfast) before sharing a TM with another Wordfast user. Sharing TMs with other CAT tools: open the TM with the TM/Glossary editor, click Tools, apply the "Export TM as TMX" special filter. The TM will be re-written as TMX and the file's extension will be changed to .tmx.

  Back to Wordfast Classic User Manual