Understanding Big and Little Endian Byte Order (2024)

There are two different methods for describing the order in which a sequence of bytes are stored in digital systems:

Big Endian: places the most significant byte first (also known as network byte order)
Little Endian: places the least significant byte first

The term endian comes from the novel Gulliver’s Travels by Jonathan Swift. In this fictitious world there were two island nations, Lilliput and Blefuscu. They were mortal enemies because the emperor of Lilliput had decreed that boiled eggs were to be cracked at the “little end”, whereas on Blefuscu they had always cracked their eggs at the “big end”. This seemingly trivial controversy had led to ongoing war between the two empires during which many thousands had been killed. It illustrates the fact that something quite simple can be done in two completely different ways.

Danny Cohen wrote a technical publication in 1980 entitled “On Holy Wars and a Plea for Peace” in which he reused the terms in the context of computing and telecommunication. He discussed the order that individual bytes within a larger ‘word’ (unit of data) can be stored and transmitted. He explained the issue of deciding whether the little end or the big end should come first. He ended his article with:

It may be interesting to notice that the point which Jonathan Swift tried to convey in Gulliver’s Travels is exactly the opposite of the point of this note. Swift’s point is that the difference between breaking the egg at the little-end and breaking it at the big-end is trivial. Therefore, he suggests, that everyone does it in his own preferred way. We agree that the difference between sending eggs with the little, or the big-end first is trivial, but we insist that everyone must do it in the same way, to avoid anarchy. Since the difference is trivial we may choose either way, but a decision must be made.

To fully understand and interpret data correctly, it is important to understand how data is stored. Digital forensic examiners have to understand the byte order concept so that they can correctly interpret the data they encounter during a forensic examination. Unfortunately, it is highly likely that both formats will be encountered on a regular basis.

The term ‘endian’ as derived from ‘end’ may lead to confusion. The end denotes which end of the number comes first rather than which part comes at the end of the sequence of bytes. The basic endian layout can be seen in the table below:

To understand this and get a handle on endianness, we will start with number base ten. Our decimal number system is typically written in big endian format. Numbers are placed so that the most significant (largest) values are located to the left and the least significant (smallest) to the right; therefore, when moving across the number from left to right, the most significant values are encountered first. As the number increases in value, we move from the least significant digit on the right to the left, each digit we add is worth ten times the previous digit. These positions are commonly known as units, tens, hundreds, thousands and so on. The weighted values for each position (up to one million) is as follows:

The number 3265 is easy to understand in decimal terms. It is made up as follows:

(3 x 1000) + (2 x 100) + (6 x 10) + (5 x 1)

When creating a number, we start with the units and add further digits as needed to create the number we want. For more information on number bases, see our introductory article: Introduction to Number Systems.

Big Endian

In hex, using 16 bits, the weighted value for each position is as follows:

In big endian format, the number 0123₁₆ would be calculated as follows:

(0 x 4096) + (1 x 256) + (2 x 16) + (3 x 1) = 291₁₀

In hex, this number would be represented as 123₁₆ (or 0x0123).

Little Endian

In hex, using 16 bits, the weighted value for each position is as follows:

In little endian format, the value would be calculated as follows:

(0 x 16) + (1 x 1) + (2 x 4096) + (3 x 256) = 8961₁₀

In hex, this number would be represented as 2301₁₆ (or 0x2301).

Big Endian

In hex, using 32 bits, the weighted value for each position is as follows:

In big endian format, the number 1230000₁₆ would be calculated as follows:

(0 x 268435456) + (1 x 16777216) + (2 x 1048576) + (3 x 65536) = 19070976₁₀

In hex, this number would be represented as 1230000₁₆ (or 0x01230000).

Little Endian

In hex, using 32 bits, the weighted value for each position is as follows:

In little endian format, the value would be calculated as follows:

(0 x 16) + (1 x 1) + (2 x 4096) + (3 x 256) = 8961₁₀

In hex, this number would be represented as 2301₁₆ (or 0x00002301). The leading zeros would normally be dropped as their presence makes no difference to the numeric value; however, they do help to indicate the value is stored as a 32 bit integer.

Another aspect of endianness which seems to cause confusion are the different methods for encoding and storing multi-byte characters. In the following examples, we will use the Notepad application to save a small text file in big and little endian UTF-16 format and then examine the file in a hex viewer. For a primer on character encoding, see our introductory article: Character Encoding: A Quick Primer.

Little Endian UTF-16 Text

In the image below, we can see the text represented by different UTF-16 code points, displayed in hex format. In this case, the text is stored in little endian format.

The first two bytes FF FE represent a byte order mark (BOM). Byte order marks describe the endianness of a text stream and the encoding used. With UTF-16, each character uses two or more bytes.

With standard ASCII encoded as UTF-16, we can see that each character in the text above only requires two bytes. In the case of little endian format, the least significant byte appears first, followed by the most significant byte. The letter ‘T’ has a value of 0x54 and is represented in 16 bit little endian as 54 00.

Big Endian UTF-16 Text

In the image below, we can see the text represented by different UTF-16 code points, displayed in hex format. In this case, the text is stored in big endian format.

As with the previous example, the text stream starts with a byte order mark FE FF. This indicates a UTF-16 stream in big endian format. As this text is stored in big endian format, the most significant byte is encountered first in each two byte character. The letter ‘T’ has a value of 0x54 and is represented in 16 bit big endian as 00 54.

The image below shows the bytes used in a sequence of two byte characters. Each two digit hex number represents a byte in the stream of text. You can see that the order of the two bytes that represent a single character is reversed for big endian vs. little endian storage. The byte order mark indicates which order is used so that applications can decode the content.

The following list highlights the endianness of some common file formats:

BMP – Little Endian
GIF – Little Endian
JPEG – Big Endian
MPEG-4 – Big Endian
PNG – Big Endian
TIFF – Both, Endian identifier encoded into file

Other articles in our core knowledge series for learning digital forensics:

Introduction to Number Systems
Character Encoding: A Quick Primer

FAQs

Understanding Big and Little Endian Byte Order? ›

In a big-endian machine, the big end of the data is stored first. In the case of multiple bytes, the biggest byte is the first one with the lowest address. On the other hand, little endian machines store data little-end first, with the first byte being the smallest in case of multiple bytes.

Learn More ›

What do you mean by little endian byte order and big-endian byte order? ›

A big-endian system stores the most significant byte of a word at the smallest memory address and the least significant byte at the largest. A little-endian system, in contrast, stores the least-significant byte at the smallest address.

What do you understand by the big-endian and the little endian configurations? ›

Differences between big and little endian

A big endian representation has a multibyte integer written with its most significant byte on the left; a number represented thus is easily read by English-speaking humans. A little endian representation, on the other hand, places the most significant byte on the right.

Discover More Details ›

How to know little endian or big-endian? ›

If the output starts with a 1 (least-significant byte), it's a little-endian system. If the output starts with a higher digit (most-significant byte), it's a big-endian system.

Learn More Now ›

What is big-endian and little endian assignment with example? ›

Big-endian stores the most significant bytes first, whereas little-endian stores the least significant bytes first. For example, in a memory space starting from 1010, the 16-bit word 123A stored via little-endian would be assigned with 3A going into 1010 and 12 going into 1011.

Learn More Now ›

What is the difference between little-endian and big-endian binary? ›

There are two sensible possibilities: Big-endian is when the most significant bit is considered to be first. Little-endian is when the least significant bit is considered to be first.

View Details ›

What is big-endian and what is little-endian in terms of how to store the instructions in the memory what is the default endian used in ARM processors? ›

There are two basic ways of viewing bytes in memory - little-endian and big-endian. On big-endian machines, the most significant byte of an object in memory is stored at the least significant (closest to zero) address. On little-endian machines, the most significant byte is stored at the highest address.

What is the difference between little endian and big-endian 16 bit? ›

Big-endian is an order in which the "big end" (most significant value in the sequence) is stored first (at the lowest storage address). Little-endian is an order in which the "little end" (least significant value in the sequence) is stored first.

How do you read bytes in little endian? ›

Little-endian: In little-endian format, the least significant byte is stored at the lowest memory address, and the most significant byte is stored at the highest memory address. It is as if the data is read from right to left.

What is the most significant byte? ›

In particular, the leftmost (first) byte is the most significant (containing the most significant eight bits of the corresponding bit string), and the rightmost (last) byte is the least significant (containing the least significant eight bits of the corresponding bit string).

Get More Info ›

Why does endianness matter? ›

If my computer reads bytes from left to right, and your computer reads from right to left, we're going to have issues when we need to communicate. Endianness means that the bytes in computer memory are read in a certain order. We won't have any issues if we never need to share information.

What is an example of a little endian byte? ›

For example, an 8-byte Data Element with VR of FD, might be written in hexadecimal as 68AF4B2CH, but encoded in Little Endian would be 2C4BAF68H.