Converter to display word file correctly. Selecting text encoding when opening and saving files

In day-to-day IT tasks such as network administration and user support, a variety of files are often used, especially documents written with word processing. Unfortunately, the built-in Windows tools allow you to work with documents only as with files; standard tools do not handle internal word dataeg document type conversion.

I wrote a WSH (Windows Script Host) script called ConvertWord, which is used as a command shell for Microsoft Word and makes it easier to work with documents. In addition, the script can be useful for testing invalid documents.

ConvertWord requirements

To use ConvertWord, you must have Word 97 or a later text editor installed on your computer. The complete ConvertWord source can be downloaded from our magazine's Web site. Snippets of the ConvertWord script are shown below. The convertword.wsf and convertword.cmd files should be saved in the same folder.

ConvertWord can automatically invoke any converter file formatsimplemented in Word. A basic set of file format converters for common documents comes with Word. However, this set does not include special converters such as those for Microsoft Works or WordPerfect documents. To obtain these and other optional converters, you must perform a custom installation of Word.

Standard Word converters included in the package Microsoft Office Resource Kits, can be downloaded from the Office 2003 Editions Resource Kit page at http://www.microsoft.com/office/ork/2003/default.htm ... The resource pack converters are compatible with Word 97 and later versions of the editor. After installing the resource pack, go to the created directory (by default, \\% programfiles% orktools) and locate the file with the set of converters (oconvpck.exe), and then run oconvpck.exe on all computers on which you want to deploy the converters.

Purpose of ConvertWord

The original goal of ConvertWord was to perform some tasks that were not possible with the Batch Conversion Wizard in Word editor. The Batch Conversion Wizard is a useful addition to any administrator's toolbox. The master is word templatewhich converts one input format to one output format. For more information on this conversion, see the Microsoft article "How to automatically convert many documents to Word 2002 format" at http://support.microsoft.com/?kbid\u003d313714.

The Batch Conversion Wizard performs many tasks, but is not optimized for some of them, such as remote administration or automating simple conversions for end users who collaborate on documents on separate network sites. ConvertWord can help you solve these kinds of distributed transformation tasks by performing the following basic operations.

  • Sends a request to the system about the version of Word it has.
  • Automatically opens lists of documents of mixed types of arbitrary length.
  • Guaranteed to save documents with unique names in Word (default) or other formats.
  • Tests documents looking for formatting problems and incorrect user passwords.

How ConvertWord works

ConvertWord's conversion process is a four-step process. In the first step, the script generates an instance of Word as shown in the snippet listing 1 labeled A. Part of the script source is intended to minimize the number of dialog boxes. For example, the original text labeled B blocks dialog boxes, when it's possible.

In the second step, ConvertWord opens each document. The Word object contains a set of Documents; when calling the Open method of this set (the fragment labeled A in listing 2) the document is retrieved. If you know the name of the document and want Word to automatically detect its format, you can call the method with only the name of the document as an argument.

Alternatively, you can specify the document format as another parameter to the Open method. Unfortunately, depending on the version of Word, the Open method requires up to 16 parameters. Since the format-controlling parameter is in tenth place, the previous nine parameters must be specified. The result is a long, unwieldy line. Parameter information can be obtained at http://msdn.microsoft.com/library/default.asp?url\u003d/library/ enus / dv_wrcore / html / wrconwordobjectmodeloverview.asp or in Word Help.

The ConvertWord parameters are FileName, ConfirmConversions, ReadOnly, AddToRecentFiles, PasswordDocument, PasswordTemplate, Revert, WritePasswordDocument, WritePasswordTemplate, and Format. The FileName parameter is the file name of the Word document. The ConfirmConversions parameter displays a dialog box when Word converts an open document. ConvertWord always sets this parameter to False to ease automation.

The ReadOnly parameter controls the process of opening a read-only document; ConvertWord always sets this parameter to True to keep the original document unchanged. AddToRecentFiles determines whether the open document is added to the current user's RecentFiles list. The document can be one of tens or even hundreds, therefore it is not recommended to add it to the list and the parameter is set to False.

PasswordDocument is the password for opening protected documents, and PasswordTemplate is the password for templates. These values \u200b\u200bare not useful for non-Word documents, so instead of any parameter, you can specify two double quotes ("") Denoting an empty string. The Revert parameter determines whether the script reverts to the currently open version of the document if the document being converted is already open. ConvertWord sets this parameter to True to avoid losing changes and only activate an open instance of the document.

The WritePasswordDocument and WritePasswordTemplate parameters specify the passwords required to save an open document or template. For the purposes of this article, these parameters are optional because ConvertWord does not overwrite the original document; therefore, the script specifies "" for each of these arguments.

Finally, the Format parameter is a number that indicates the method that Word uses to determine the format of an open document. It is not easy to get the correct number because the numbers and the methods they represent depend on the version of Word installed, additional document converters, and the order in which they were installed. Suppose we need to open and convert an RTF (Rich Text Format) document with format open code 3. To open an example document using a standard RTF converter, use the following procedure:

Set doc \u003d Word.Documents._

Open ("c: my.rtf", False, _

True, False, "", "", _

True, "", "", 3)

Some lines of source code in this article have been split over multiple lines due to space constraints. A list of additional document converters with corresponding numbers and standard extensions can be found using the FileConverters object set. The source code for Listing 3 shows a list of these converters. There are no standard Word converters listed. A list of standard Word converters can be found at tab. 1 and in Word Help.

The CreateFormatCollections subroutine of the ConvertWord script displays a list of Word converters. While the script makes it somewhat easier to define the open and save formats, the format used to open or save a document depends on the version of Word and the order in which the converters are installed.

After opening the document, it a new version saved using the SaveAs method (the fragment labeled A in listing 4). The SaveAs method accepts up to 16 parameters, but we only need two, since the required SaveFormat parameter is the second. As with the OpenFormat parameters, you must specify the format codes for the document being opened in the SaveFormat parameter. To specify the save format - for example, to save the document in a plain text file C: my.txt - you must enter the command

doc.SaveAs "C: my.txt", 2

After saving the document, ConvertWord closes it using the Close method (label B in Listing 4). False indicates that Word should discard changes if the document has been modified after saving. When the script has sequentially opened, saved, and closed all the documents, the final step is to exit Word by calling the Quit method of Word ( listing 5).

Using ConvertWord

Before starting ConvertWord for the first time, it is useful to review the local version of Word information by running the command

convertword / version

This command displays important information, including the version of Word installed on the machine. Microsoft has stopped listing the version number in the product name since Office 95 (which would be called Office 7), but the internal version number increases by 1 with each major update. The same numbering scheme is used in Word as in a component of the Office suite. The internal version numbers are 8 (Word 97), 9 (Word 2000), 10 (Word 2002), and 11 (Word 2003).

By default, ConvertWord automatically opens files, making educated guesses about their format (e.g. Word, Plain Text, WordPerfect, RTF) and saves them as word documentsby giving them unique names consisting of a filename, an underscore, and a number. ConvertWord provides several ways to name documents. The file name can be entered as an argument to the following command:

convertword unicode.txt plain.txt

Otherdocscorel.wps

As a result of this approach, the output files in Word format are saved as unicode.doc, plain.doc, and otherdocscorel.doc. Another option is to configure ConvertWord to read files from a standard source, for example:

convertword

You can pipe the results of a command that creates a list of files to ConvertWord as follows:

dir / s / b c: inbox * .txt

| convertword

If no input is specified, then ConvertWord asks for the names of the input documents until the Ctrl + C key combination is pressed twice.

ConvertWord has a simple method that avoids overwriting files with the same name. Suppose you want to save your Word file as a text file named mylist.txt. If a file with the same name already exists, ConvertWord starts iterating over the sequence of derived names - mylist_1.txt, mylist_2.txt, and so on - until an unused name is found. This name is then assigned to the saved file. Typically, it takes less time to find the file name than to manually open and save the document.

Changing the storage location and file name

ConvertWord saves the files in the same folder as the original file with the same base name. Thus, when converting files for many users or groups of users, the new files will be placed next to the old ones. Usually users know "their" files and remember their names.

However, you can change the directory for storing converted documents. To do this, it is enough to specify the / d switch with a full name, which can be absolute or specified relative to the path pointing to the folder in which the script is running. ConvertWord expands the path to the full format and creates the appropriate directory if it doesn't already exist.

convertword / d: c: empexports

You can change the base name (filename without extension) using the / b switch. If ConvertWord detects multiple files with the same name, then ConvertWord changes the file names as explained above. You can also use the / x switch to specify a file extension other than the standard extension of the exported file type.

Create documents other than Word

By default, ConvertWord automatically generates Word documents. If you want to create a document other than Word, you can use the / sa parameter of the ConvertWord utility to change the default save format. The formats in which files can be saved vary depending on the version of Word and additional converters available on the system on which ConvertWord is running. The first step when saving a file in a certain format is to start Word with the / cnv switch to view the installed converters; the converter number corresponds to the type in which you want to save the new file. If all files need to be saved in a specific format, such as RTF (number 6), then the / sa: 6 switch should be added to the ConvertWord arguments. For example, to convert all WordPerfect files in the current folder to RTF, run the command

dir / s / b * .wpd

| convertword / sa: 6

Depending on the version of Word and the converters installed, the number of available formats can be large. You should always check the types before converting files, as their numbers will be different on different machines. The only exception to this annoying rule is the standard built-in Word converters. Word 97 and later have the same values \u200b\u200bfrom 0 to 6, and the standard type numbers increase as new versions are added. For Word 2003, the numbers 0 through 11 will be the same on all machines. The exception to the standard values \u200b\u200bis pin number -1. This value does not match the Word converter, but is used as a ConvertWord command to write data from a document file to the console. It can be set using the / sa - / sa: -1 or / sa + switch.

Error processing

During large-scale conversion operations, problems may occur with some files. You need a way to track documents that have failed to convert. If the file cannot be converted, ConvertWord sends the file name and descriptive information to the standard error stream (StdErr); an administrator can track bounces by watching the filenames scroll across the screen, or by redirecting error data to a file for later analysis, for example:

Errors.txt

By default, ConvertWord shows errors by listing only the filename and error number:

c: demo.rtf FAILED: 2

Using the / v + (verbose output) switch, you can get more detailed information about the error:

convertword
/v+\u003eerrors.txt

The / v- switch does not display error numbers; instead, the filename is simply passed to StdErr to facilitate subsequent processing.

The last error found by ConvertWord is always taken as the final error level; when the script finishes running, this value is available in the command environment and can be read by another script, which will determine whether the call to ConvertWord succeeded or failed.

To catch potential errors without converting documents, you can run ConvertWord with the / w (what if) switch. This switch forces ConvertWord to open all documents without saving them. If any files fail, for example, internal data is corrupted, the usual error message will be displayed.

Solving the password problem

Passwords are especially problematic in batch processing because they can be different for different documents. By default, ConvertWord uses the space character as a password, which opens all documents that do not have passwords, and documents with passwords generate an error that does not stop further processing.

This behavior can be changed using the / p (password) switch. If you specify an empty argument (for example, / p: ""), then Word asks for a password for all protected documents. With the / p switch, you can specify a specific password. However, you will not be able to open documents without a password or with a password other than the specified one.

Practical use of ConvertWord

I have done approximately 30K conversions using ConvertWord and have encountered a few common problems. Unusual crashes were almost invariably caused by Word automation bugs; the error number and message came from Word in most cases. Most errors (such as an incorrect password) are easy to fix or understand. The next three mistakes were repeated fairly regularly.

The first is the Word pop-up dialog box for documents containing macros. By default, ConvertWord blocks macros in documents to protect the user from dangerous code. However, when Word opens documents that contain macros, a dialog box appears with a message about blocking macros. The only way I know to eliminate this window is to allow macros to execute. You can do this by running ConvertWord and specifying the / as (automation security) switch with a value of 0 (/ as: 0). This is the default value for programmatically opened Word documents. Before using the / as switch, you need to make sure that the document being opened does not contain dangerous program code.

The second error is related to some RTF documents that cannot be opened successfully, but which nevertheless display correctly in WordPad. They are usually incorrectly formatted and cannot open correctly in Word. ConvertWord is unable to eliminate this problemtherefore, you cannot use ConvertWord to convert these files.

The third error occurs because Word identifies Unicode text documents by the Byte Order Mark in the file. If there is no mark, then Word treats the document as plain text, and when opening the converted document, the user will see voids after each visible character (in reality, voids correspond to null characters). The only way to solve the problem is to convert files with the / oa (OpenAs) switch set to Encoded or Unicode text (/ oa: 5 for Word 97 and later).

Fortunately, these errors are relatively rare. ConvertWord will be extremely useful for processing a large number of documents and will help you open and convert Word documents without tedious manual work.

Indiana Consulting Network Specialist. He is MCSE, MCP + I and MVP certified.

Overview

Microsoft Office XP and 2003 Word, Excel, or PowerPoint users: Before downloading the Compatibility Pack, install high priority updates from Microsoft Update before downloading the Compatibility Pack.

By installing the Compatibility Pack as an add-on to Microsoft Office 2000, Office XP, or Office 2003, you can open, edit, and save files in the new formats used in latest versions programs Word, Excel and PowerPoint. The Compatibility Pack can also be used with Microsoft Office Word 2003, Excel 2003, and PowerPoint 2003 viewers to view files saved in new formats. For more information about the Compatibility Pack, see KB article 924074.

Note... If you are using Microsoft Word 2000 or Microsoft Word 2002 to read or write documents that contain complex characters, then to correctly display Word documents in newer versions of the application, refer to the information in article 925451.

Administrators: You can download the Administrative Template for Word, Excel, and PowerPoint Converters included in the Compatibility Pack.

Update... The Microsoft Office Compatibility Pack has been updated to include Service Pack 2 (SP2). Now, if DOCX or DOCM files contain custom XML tags, then the tags are removed when the file is opened in Word 2003. For more information, see KB978951

System requirements

Operating Systems:

Windows 2000 Service Pack 4, Windows Server 2003, Windows Vista, Windows Vista Service Pack 1, Windows XP Service Pack 1, Windows XP Service Pack 2, Windows XP Service Pack 3
Windows 7; Windows Server 2008

Microsoft Word 2000 Service Pack 3 (SP3), Microsoft Excel 2000 Service Pack 3 (SP3), and Microsoft PowerPoint 2000 Service Pack 3 (SP3)

Microsoft Word 2002 Service Pack 3 (SP3), Microsoft Excel 2002 Service Pack 3 (SP3), and Microsoft PowerPoint 2002 Service Pack 3 (SP3)

Microsoft Office Word 2003 with Service Pack 1 (SP1) or later, Microsoft Office Excel 2003 with Service Pack 1 (SP1) or later, and Microsoft Office PowerPoint 2003 with Service Pack 1 (SP1) or later

Microsoft Office Word 2003 Viewer.

Microsoft Office Excel 2003 Viewer

Microsoft Office PowerPoint 2003 Viewer

Instructions

Installing the update
Make sure your system is up to date by installing High Priority Updates and Mandatory Updates downloaded from the Microsoft Update website (required for Microsoft Office XP and 2003 users).

After installing high priority and required updates from the Microsoft Update website, download the compatibility pack by clicking the Download button above and saving the file to your hard drive.

Double-click the FileFormatConverters.exe executable file saved on your hard disk to run the installer.

Follow the instructions on the screen to complete the installation.

Deleting the download file

From the Windows Start menu, click the Control Panel icon.

Select Add or Remove Programs.

In the list of installed programs, select the 2007 Office Compatibility Pack, and then click Remove or Add or Remove. If a dialog box appears, follow the instructions to uninstall the program.

Click Yes or OK to confirm the removal of the application.


For the full text of the Microsoft Office 2010 Compatibility Pack for Word, Excel, and PowerPoint File Formats, see the download.
The page shows a snippet.

FOR THOSE WHO HAVE AN OLD MICROSOFT OFFICE LIKE 97, 2003, 2007 and 2010

FOR COMPATIBILITY OF OFFICE FORMATS, INSTALL THE CORRECT SOFTWARE.

so that new formats open in old versions

Overview

Microsoft Office XP and 2003 Word, Excel, or PowerPoint Users: Before Downloading the Compatibility Pack install high priority updates from the websiteMicrosoft Update before downloading the compatibility packBy installing the Compatibility Pack as an add-on to Microsoft Office 2000, Office XP, or Office 2003, you can open, edit, and save files in the new formats that are used in the latest versions of Word, Excel, and PowerPoint. The Compatibility Pack can also be used with Microsoft Office Word 2003, Excel 2003, and PowerPoint 2003 viewers to view files saved in new formats. For more information about the Compatibility Pack, see the Knowledge Base article.

Note. If you are using Microsoft Word 2000 or Microsoft Word 2002 to read or write documents that contain complex characters, you should refer to the information in this article to correctly display Word documents in newer versions of the application.

Administrators: You can download the Administrative Template for Word, Excel, and PowerPoint Converters that is included in the Compatibility Pack.

Update.The Microsoft Office Compatibility Pack has been updated to include Service Pack 2 (SP2). Now, if DOCX or DOCM files contain custom XML tags, then the tags are removed when the file is opened in Word 2003. For more information, see KB978951

System requirements

  • Operating Systems:Windows 2000 Service Pack 4, Windows Server 2003, Windows Vista, Windows Vista Service Pack 1, Windows XP Service Pack 1, Windows XP Service Pack 2, Windows XP Service Pack 3
    Windows 7; Windows Server 2008
  • Recommended microsoft programs Office:
    • Microsoft Word 2000 Service Pack 3 (SP3), Microsoft Excel 2000 Service Pack 3 (SP3), and Microsoft PowerPoint 2000 Service Pack 3 (SP3)
  • Microsoft Word 2002 Service Pack 3 (SP3), Microsoft Excel 2002 Service Pack 3 (SP3), and Microsoft PowerPoint 2002 Service Pack 3 (SP3)
  • Microsoft Office Word 2003 with Service Pack 1 (SP1) or later, Microsoft Office Excel 2003 with Service Pack 1 (SP1) or later, and Microsoft Office PowerPoint 2003 with Service Pack 1 (SP1) or later
  • Microsoft Office Word 2003 Viewer.
  • Microsoft Office Excel 2003 Viewer
  • Microsoft Office PowerPoint 2003 Viewer

Instructions

Installing the update

  1. Make sure your system is up to date by installing High Priority Updates and Mandatory Updates downloaded from the Microsoft Update website (required for Microsoft Office XP and 2003 users).
  1. After installing high priority and required updates from Microsoft Update, download the Compatibility Pack by clicking the button above and save the file to your hard drive.
  1. Double-click the executable file saved on your hard drive to run the installer FileFormatConverters.exe.
  1. Follow the instructions on the screen to complete the installation.

Deleting the download file

When you open a text file in Microsoft Word or another program (for example, on a computer whose operating system language differs from the one in which the text in the file is written), the encoding helps the program determine how the text should be displayed on the screen in order to could be read.

In this article

Understanding text encoding

The text that appears as text on the screen is actually saved as numeric values \u200b\u200bin a text file. The computer translates numeric values \u200b\u200binto visible characters. For this, a coding standard is used.

An encoding is a numbering scheme whereby each text character in a set has a specific numeric value. The encoding can contain letters, numbers, and other characters. Different languages \u200b\u200boften use different character sets, so many of the existing encodings are designed to display the character sets of the respective languages.

Different encodings for different alphabets

The encoding information saved with the text file is used by the computer to display the text on the screen. For example, in Cyrillic (Windows) encoding, "Y" corresponds to the numeric value 201. When you open a file containing this character on a computer that uses Cyrillic (Windows) encoding, the computer reads 201 and displays sign "Y".

However, if the same file is opened on a computer that uses a different encoding by default, a character corresponding to the number 201 in this encoding will be displayed on the screen. For example, if the computer uses the encoding "Western European (Windows)", the character "Y" from the original text file based on the Cyrillic alphabet will be displayed as "É", since this character corresponds to the number 201 in this encoding.

Unicode: a single encoding for different alphabets

To avoid problems with encoding and decoding text files, you can save them in Unicode. This encoding includes most characters from all languages \u200b\u200bthat are commonly used on modern computers.

Since Word is based on Unicode, all files in it are automatically saved in this encoding. Files in Unicode can be opened on any computer with an operating system on english language regardless of the language of the text. In addition, on such a computer, you can save files in Unicode that contain characters that are not in Western European alphabets (for example, Greek, Cyrillic, Arabic, or Japanese).

Choosing an encoding when opening a file

If in open file the text is garbled or appears as question marks or squares, perhaps Word has incorrectly detected the encoding. You can specify the encoding that should be used to display (decode) text.

    Open the tab File.

    Click the button Parameters.

    Click the button Additionally.

    Go to section Generaland check the box Confirm file format conversion on open.

    Note: If this check box is checked, Word displays a dialog box File conversion each time you open a file in a format other than Word (that is, a file that does not have the DOC, DOT, DOCX, DOCM, DOTX, or DOTM extension). If you often work with these files, but you usually do not need to choose an encoding, do not forget to disable this option so that this dialog does not appear.

    Close and then reopen the file.

    In the dialog box File conversion select item Encoded text.

    In the dialog box File conversion set the switch Other and select the desired encoding from the list.

    In the region of Sample

If most of the text looks the same (such as squares or dots), your computer might not have the correct font installed. In this case, you can install additional fonts.

To install additional fonts, do the following:

    Click the button Start and select item Control Panel.

    Do one of the following:

    On Windows 7

    1. In the control panel, select the item Removing programs.

      Edit.

    In Windows Vista

      In the control panel, select the section Removing a program.

      In the list of programs, click Microsoft Office or Microsoft Word if it was installed separately from Microsoft Office, and then click Edit.

    On Windows XP

      In the control panel, click The installing and deleting of programms.

      In the list Installed programs click Microsoft Office or Microsoft Word if it was installed separately from Microsoft Office, and then click Edit.

    In Group Change your Microsoft Office installation press the button Add or remove components and then press the button Proceed.

    In section Installation options expand the item Common Office Tools, and then - Multilingual support.

    Select the font you want, click the arrow next to it, and then click Run from my computer.

Advice: When opening a text file in a particular encoding, Word uses the fonts defined in the dialog box Web Document Options... (To bring up a dialog box Web Document Options, click microsoft button Officethen click Word options and select a category Additionally... In section General press the button Web Document Options.) Using the options on the tab Fonts dialog box Web Document Options you can customize the font for each encoding.

Selecting encoding when saving a file

If you do not select an encoding when saving the file, Unicode will be used. Unicode is generally recommended as it supports most characters in most languages.

If you plan to open the document in a program that does not support Unicode, you can select the desired encoding. For example, in operating system in English, you can create a Chinese (Traditional) document using Unicode. However, if such a document will be opened in a program that supports Chinese but does not support Unicode, the file can be saved in Traditional Chinese (Big5) encoding. As a result, the text will display correctly when you open the document in a Chinese (Traditional) program.

Note: Since Unicode is the most comprehensive standard, some characters may not be displayed when storing text in other encodings. For example, suppose a Unicode document contains Hebrew and Cyrillic text. If you save the file in Cyrillic (Windows) encoding, the Hebrew text will not be displayed, and if you save it in Hebrew (Windows) encoding, the Cyrillic text will not be displayed.

If you choose an encoding standard that doesn't support some characters in the file, Word will mark them in red. You can preview the text in the selected encoding before saving the file.

When you save a file as encoded text, the text for which the Symbol font is selected and the field codes are removed.

Choosing an encoding

    Open the tab File.

    In field File name enter a name for the new file.

    In field File type select Plain text.

    If a dialog box appears Microsoft Office Word - Compatibility Checker, press the button Proceed.

    In the dialog box File conversion choose the appropriate encoding.

    • To use standard encoding, select the option Windows (default).

      To use MS-DOS encoding, select the option MS-DOS.

      To set a different encoding, select the switch Other and select the desired item from the list. In the region of Sample you can view the text and check if it is displayed correctly in the selected encoding.

      Note: To increase the display area of \u200b\u200bthe document, you can resize the dialog box File conversion.

    If the message "The text in red cannot be saved correctly in the selected encoding" appears, you can choose another encoding or check the box Allow character substitution.

    If character substitution is enabled, characters that cannot be displayed will be replaced by the nearest equivalent character in the selected encoding. For example, ellipsis are replaced with three dots, and corner quotation marks are replaced with straight ones.

    If the selected encoding does not have equivalent characters for the characters highlighted in red, they will be saved as out of context (for example, as question marks).

    If the document will open in a program that does not wrap text from one line to another, you can include hard line breaks in the document. To do this, check the box Insert line breaks and enter the desired break symbol (carriage return (CR), line feed (LF), or both) in the field End lines.

Find encodings available in Word

Word recognizes several encodings and supports encodings that come with the system software.

Below is a list of scripts and associated encodings (code pages).

Writing system

Encodings

Font used

Multilingual

Unicode (big endian UCS-2, UTF-8, UTF-7)

Standard font for the Normal style of the localized version of Word

Arabic

Windows 1256, ASMO 708

Chinese (Simplified)

GB2312, GBK, EUC-CN, ISO-2022-CN, HZ

Chinese (Traditional)

BIG5, EUC-TW, ISO-2022-TW

Cyrillic

Windows 1251, KOI8-R, KOI8-RU, ISO8859-5, DOS 866

English, Western European and others based on the Latin alphabet

Windows 1250, 1252-1254, 1257, ISO8859-x

Greek

Japanese

Shift-JIS, ISO-2022-JP (JIS), EUC-JP

Korean

Wansung, Johab, ISO-2022-KR, EUC-KR

Vietnamese

Indian: Tamil

Indian: Nepali

ISCII 57002 (Devanagari)

Indian: Konkani

ISCII 57002 (Devanagari)

Indian: Hindi

ISCII 57002 (Devanagari)

Indian: Assamese

Indian: Bengali

Indian: Gujarati

Indian: Kannada

Indian: Malayalam

Indian: Oriya

Indian: Marathi

ISCII 57002 (Devanagari)

Indian: Punjabi

Indian: Sanskrit

ISCII 57002 (Devanagari)

Indian: Telugu

    Using Indian languages \u200b\u200brequires operating system support and appropriate OpenType fonts.

    Only limited support is available for Nepali, Assamese, Bengali, Gujarati, Malayalam and Oriya.