Please Note: This article is written for users of the following Microsoft Word versions: 2007, 2010, 2013, and 2016. If you are using an earlier version (Word 2003 or earlier), this tip may not work for you. For a version of this tip written specifically for earlier versions of Word, click here: Understanding ASCII and ANSI Characters.
Written by Allen Wyatt (last updated August 9, 2022)
This tip applies to Word 2007, 2010, 2013, and 2016
Virtually everyone knows that a computer doesn't understand characters, it understands numbers. Thus, each character you see on the screen in a program such as Word is maintained internally as a number. The "mapping" of characters to numbers is known as a character set.
For the most part, Word relies on the character set used by whatever version of Windows you are using. Don't confuse the character set used by Windows with the character set used by the computer itself, as they are not the same. For instance, when you first boot your computer, you may see some start-up information on the screen. This information uses a character set maintained internally by the computer on ROM. Since Windows is not running at the time this information is displayed, the character set used by Windows cannot be in use. Once Windows is up and running, then the character set used by the computer itself is no longer used and the one maintained by Windows is relied upon.
This may sound confusing, but it is not meant to be. In the relatively short history of computers, there have been several different character sets used. The first character set used in small computers was ASCII, which is an acronym for American Standard Code for Information Interchange. It started as a code of 128 characters, using seven bits to represent all the characters. (A bit is a binary digit; it can have either two values: on or off. Thus, seven bits can have 2^7 or 128 possible unique values.)
ASCII was first developed for machines that used only seven bits of each byte (such as teletypes). Early personal computers, however, used eight bits, and thus could utilize 2^8 or 256 possible values for a character code. This led to what was known as extended ASCII, where the first 128 characters matched those in ASCII, but the second 128 were left up to the computer manufacturer. In early IBM PC models, the extended ASCII character set included some foreign-language symbols and many line-drawing characters, used for rudimentary graphics.
Microsoft calls the character set utilized by your computer (as pointed out earlier in this tip) the OEM character set. (OEM means "original equipment manufacturer.") Windows versions through Windows 95 utilize what is called an ANSI character set. This is a single-byte character set that can represent up to 256 characters. The original ASCII character set occupies the first 128 characters of the ANSI set used in Windows. All later versions of Windows, on the other hand, utilize the Unicode character set, which is described in other issues of WordTips.
Remember that this discussion of what the various versions of Windows use refers to what they use internally. Externally, for a typical Word user, there isn't much effect.
WordTips is your source for cost-effective Microsoft Word training. (Microsoft Word is the most popular word processing software in the world.) This tip (11330) applies to Microsoft Word 2007, 2010, 2013, and 2016. You can find a version of this tip for the older menu interface of Word here: Understanding ASCII and ANSI Characters.
Do More in Less Time! Are you ready to harness the full power of Word 2013 to create professional documents? In this comprehensive guide you'll learn the skills and techniques for efficiently building the documents you need for your professional and your personal life. Check out Word 2013 In Depth today!
The ribbon, displayed at the top of the Word window, is very handy with all the tools it allows you to access, but it can ...
Discover MoreYou can update fields and links automatically when you print your document, but what if you want them updated when you ...
Discover MoreDialog boxes normally present information in a series of tabs. If you want to move from tab to tab without taking your ...
Discover MoreFREE SERVICE: Get tips like this every week in WordTips, a free productivity newsletter. Enter your address and click "Subscribe."
2022-08-09 07:25:15
Jamies
I believe the article would have been vastly improved by at least a mention of the recent change MS made to the default character set used within Notepad, making many ASCII and ANSI based text readers and data transfer apps/utilities, options give poor transfer translation of data, including some windows conversion of punctuation characters into 3 character sets, some of which cause readers to do newline, or newpage spacing instead of showing the 'appropriate' punctuation !
Maybe mention replacements for notepad, and the (probable) need to specify an alternative codepage to the now current default one
Got a version of Word that uses the ribbon interface (Word 2007 or later)? This site is for you! If you use an earlier version of Word, visit our WordTips site focusing on the menu interface.
Visit the WordTips channel on YouTube
FREE SERVICE: Get tips like this every week in WordTips, a free productivity newsletter. Enter your address and click "Subscribe."
Copyright © 2024 Sharon Parq Associates, Inc.
Comments