What does a 64-bit processor mean? This is not a marketing ploy

7. 10. 2013

Mike Ash dedicated on his blog the practical implications of switching to 64-bit architecture in the iPhone 5S. This article draws on his findings.

The reason for this text is mainly due to the large amount of misinformation being spread about what the new iPhone 5s with a 64-bit ARM processor actually means for users and the market. Here we will try to bring objective information about the performance, capabilities and implications of this transition for developers.

"64 bit"

There are two parts of a processor that the "X-bit" label can refer to - the width of the integer registers and the width of the pointers. Fortunately, on most modern processors these widths are the same, so in the case of the A7 this means 64-bit integer registers and 64-bit pointers.

However, it is equally important to point out what "64bit" does NOT mean: RAM physical address size. The number of bits to communicate with RAM (thus the amount of RAM a device can support) is not related to the number of CPU bits. ARM processors have anywhere between 26- and 40-bit addresses and can be changed independently of the rest of the system.

Data bus size. The amount of data received from RAM or buffer memory is similarly independent of this factor. Individual processor instructions may request different amounts of data, but they are either sent in chunks or received more than needed from memory. It depends on the size of the data quantum. The iPhone 5 already receives data from the memory in 64-bit quanta (and has a 32-bit processor), and we can encounter sizes up to 192 bits.
Anything related to floating point. The size of such registers (FPU) are again independent of the internal workings of the processor. ARM has been using 64-bit FPU since before ARM64 (64-bit ARM processor).

General advantages and disadvantages

If we compare otherwise identical 32bit and 64bit architectures, they are generally not that different. This is one of the reasons for the general confusion of the public looking for a reason why Apple is moving to 64bit in mobile devices as well. However, it all comes from the specific parameters of the A7 (ARM64) processor and how Apple uses it, not just from the fact that the processor has a 64-bit architecture.

However, if we still look at the differences between these two architectures, we will find several differences. The obvious one is that 64-bit integer registers can handle 64-bit integers more efficiently. Even before, it was possible to work with them on 32-bit processors, but this usually meant dividing them into 32-bit long pieces, which caused slower calculations. So a 64-bit processor can generally compute with 64-bit types just as fast as with 32-bit ones. This means that applications that generally use 64-bit types can run much faster on a 64-bit processor.

Although 64bit does not affect the total amount of RAM that the processor can use, it can make it easier to work with large chunks of RAM in one program. Any single program running on a 32-bit processor only has about 4 GB of address space. Taking into account that the operating system and standard libraries take up something, this leaves the program with somewhere between 1-3 GB for application use. However, if a 32-bit system has more than 4 GB of RAM, using that memory is a bit more complicated. We have to resort to forcing the operating system to map these larger chunks of memory for our program (memory virtualization), or we can split the program into multiple processes (where each process again theoretically has 4GB of memory available for direct addressing).

However, these "hacks" are so difficult and slow that a minimum of applications use them. In practice, on a 32-bit processor, each program will only use its 1-3 GB of memory, and more available RAM can be used to run multiple programs at the same time or use this memory as a buffer (caching). These uses are practical, but we'd like any program to be able to easily use chunks of memory larger than 4GB.

Now we come to the frequent (actually incorrect) claim that without more than 4GB of memory, a 64-bit architecture is useless. A larger address space is useful even on a system with less memory. Memory-mapped files are a handy tool where part of the file's contents are logically linked to the process's memory without the entire file having to be loaded into memory. Thus, the system can, for example, gradually process large files many times larger than the RAM capacity. On a 32-bit system, such large files cannot be reliably memory-mapped, whereas on a 64-bit system, it is a piece of cake, thanks to the much larger address space.

However, the larger size of pointers also brings one big disadvantage: otherwise identical programs need more memory on a 64-bit processor (these larger pointers have to be stored somewhere). Since pointers are a frequent part of programs, this difference can burden the cache, which in turn causes the entire system to run slower. So in perspective, we can see that if we just changed the processor architecture to 64-bit, it would actually slow down the whole system. So this factor has to be balanced by more optimizations in other places.

ARM64

The A7, the 64-bit processor powering the new iPhone 5s, isn't just a regular ARM processor with wider registers. ARM64 contains major improvements over the older, 32-bit version.

Apple A7 processor.

registry

ARM64 holds twice as many integer registers as 32-bit ARM (be careful not to confuse the number and width of registers - we talked about width in the "64-bit" section. So ARM64 has both twice as wide registers and twice as many registers). The 32-bit ARM has 16 integer registers: one program counter (PC - contains the number of the current instruction), a stack pointer (a pointer to a function in progress), a link register (a pointer to the return after the end of the function), and the remaining 13 are for application use. However, the ARM64 has 32 integer registers, including one zero register, a link register, a frame pointer (similar to a stack pointer), and one reserved for the future. This leaves us with 28 registers for application use, more than double the 32-bit ARM. At the same time, the ARM64 doubled the number of floating-point number (FPU) registers from 16 to 32 128-bit registers.

But why is the number of registers so important? Memory is generally slower than CPU calculations and reading/writing can take a very long time. This would make the fast processor have to keep waiting for memory and we would hit the natural speed limit of the system. Processors try to hide this handicap with layers of buffers, but even the fastest one (L1) is still slower than the processor's calculation. However, registers are memory cells directly in the processor and their reading/writing is fast enough to not slow down the processor. The number of registers practically means the amount of the fastest memory for processor calculations, which greatly affects the speed of the entire system.

At the same time, this speed needs good optimization support from the compiler so that the language can use these registers and does not have to store everything in the general application (the slow) memory.

Instruction set

ARM64 also brings major changes to the instruction set. An instruction set is a set of atomic operations that a processor can perform (eg 'ADD register1 register2' adds the numbers in two registers). The functions available to individual languages are composed of these instructions. More complex functions must execute more instructions, so they can be slower.

New in ARM64 are instructions for AES encryption, SHA-1 and SHA-256 hash functions. So instead of a complex implementation, only the language will call this instruction - which will bring a huge speedup to the computation of such functions and hopefully added security in applications. E.g. the new Touch ID also uses these instructions in encryption, allowing for real speed and security (in theory, an attacker would have to modify the processor itself to access the data - which is impractical to say the least given its miniature size).

Compatibility with 32bit

It is important to mention that the A7 can run fully in 32-bit mode without the need for emulation. It means that the new iPhone 5s can run applications compiled on 32-bit ARM without any slowdown. However, then it cannot use the new ARM64 functions, so it is always worthwhile to make a special build just for the A7, which should run much faster.

Runtime changes

Runtime is the code that adds functions to the programming language, which it is able to use while the application is running, until after translation. Since Apple doesn't need to maintain application compatibility (that a 64-bit binary runs on 32-bit), they could afford to make a few more improvements to the Objective-C language.

One of them is the so-called tagged pointer (marked pointer). Normally, objects and pointers to those objects are stored in separate parts of memory. However, new pointer types allow classes with little data to store objects directly in the pointer. This step eliminates the need to allocate memory directly for the object, just create a pointer and the object inside it. Tagged pointers are only supported in 64-bit architecture also due to the fact that there is no longer enough space in a 32-bit pointer to store enough useful data. Therefore, iOS, unlike OS X, did not yet support this feature. However, with the arrival of ARM64, this is changing, and iOS has caught up with OS X in this regard as well.

Although pointers are 64 bits long, on the ARM64 only 33 bits are used for the pointer's own address. And if we are able to reliably unmask the rest of the pointer bits, we can use this space to store additional data – as in the case of the mentioned tagged pointers. Conceptually, this is one of the biggest changes in the history of Objective-C, although it is not a marketable feature - so most users will not know how Apple is moving Objective-C forward.

As for the useful data that can be stored in the remaining space of such a tagged pointer, Objective-C, for example, is now using it to store the so-called reference count (number of references). Previously, the reference count was stored in a different place in memory, in a hash table prepared for it, but this could slow down the whole system in the case of a large number of alloc/dealloc/retain/release calls. The table had to be locked due to thread safety, so the reference count of two objects in two threads could not be changed at the same time. However, this value is newly inserted into the rest of the so-called isa indicators. This is another inconspicuous, but huge advantage and acceleration in the future. However, this could never be achieved in a 32-bit architecture.

Information about associated objects, whether the object is weakly referenced, whether it is necessary to generate a destructor for the object, etc., is also newly inserted into the remaining place of pointers to the objects. Thanks to this information, the Objective-C runtime is able to fundamentally speed up the runtime, which is reflected in the speed of each application. From testing, this means about 40-50% speedup of all memory management calls. Just by switching to 64-bit pointers and using this new space.

záver

Although competitors will try to spread the idea that moving to a 64-bit architecture is unnecessary, you will already know that this is just a very uninformed opinion. It's true that switching to 64-bit without adapting your language or applications doesn't really mean anything - it even slows down the entire system. But the new A7 uses a modern ARM64 with a new instruction set, and Apple has taken the trouble to modernize the entire Objective-C language and take advantage of the new capabilities - hence the promised speedup.

Here we have mentioned a large number of reasons why a 64-bit architecture is the right step forward. It is another revolution "under the hood", thanks to which Apple will try to stay at the forefront not only with design, user interface and rich ecosystem, but mainly with the most modern technologies on the market.

Source: mikeash.com

Topics: cache, operation, iPhone 5S, iPhone 5, RAM, touch ID, memory, touch, program, design

Discussion of the article

Mirek

7. 10. 2013 12:02

A lot of uninformed Android/Samsung people should read this article and then hide in the corner.

Czechboy0

7. 10. 2013 12:17

Well, we have to feel sorry for them. For years they excused the tragic UX and UI of Android by saying that they have the most technologically advanced OS with features and now they found out that they are years behind again :)

N2by

7. 10. 2013 13:56

If a person is not a sheep and listens to advertisements (and he is good at it), then after personal experience he can form his own opinion :-).
I try almost all the competition and form my own opinion.
For me, I need a new super high-performance mobile phone, because I don't spend much on it. That is I need less performance for less price ;-). Maybe I would prefer a slower one with a bigger battery.
On the other hand, the new procak would be useful for the iPad where there are a lot of games :-).

Fuhrer

10. 10. 2013 0:44

I'm Android/HTC :) because IT is quite fun for me and rooting and converting high-quality HW into a fast fighter is my hobby. And iOS won't let me do that. (It's not even necessary. More or less, iOS is designed so that everything works as it should and you don't have to do anything there. When I stop enjoying playing, I'll buy an apple and enjoy it). But I don't know why you keep attacking each other like kids. Apple is completely like Android. It's like comparing Democracy with Dictatorship and the like... I watched the conference when the iPhone 5S was introduced and despite the fact that I don't own anything from Apple, I liked the 64bit and other improvements that came. But not because I'm a complex honimír trtko who sits behind a PC and chases Android or Apple, but because I see the PROGRESS that won't keep me waiting for long. People should start working really hard so they don't have time to deal with bullshit, to put it politely.

sarge

10. 10. 2013 23:59

constructive contribution from the other side :) kiez it would open the eyes of the remaining 99% android positive

haw

12. 11. 2013 14:59

maybe 99% of apple fanatics should be discussed first, then we can have a constructive conversation

jakub

7. 10. 2013 12:15

very complex things explained simply... thanks

Jarda

7. 10. 2013 12:29

Great article! Yes, I agree that Android/WP users should read this article as a must. Instead of trolling and talking smart about "how 64b is useless in mobiles"…

haw

12. 11. 2013 14:58

you probably never had a wp in your hand, otherwise you wouldn't have this

Jacob

7. 10. 2013 12:46

Since its first successes in the mobile market, Samsung has done nothing but smear the competition, but in essence, it has been following in its footsteps all this time. Apple has always been a role model for tech companies, and if they focus only on mocking and constantly misinforming customers, they will soon stumble. Apple has always gone its own way and it has always been a matter of very good timing, which many competing companies in the industry lack.

Jarda

7. 10. 2013 13:02

One could say that Samsung is riding the wave and taking advantage of its possibilities. He bet on Android, he has great HW, he makes a lot of things himself, he has decent support. And like any predatory Asian company, it uses all the possibilities of advertising. And of course he steals and copies. What "slant-eyed" is good at is copying. They have calculated very well that it is much cheaper than going their own way, step by step. And as a strong company, it can simply afford this. Yet…

Peter

7. 10. 2013 13:16

I just don't understand why the speed of the phone is constantly increasing, give me some examples of what you use it for, it slowly makes no sense to me to increase the performance of the mobile phone, but I will remove the word marketing.

rikiless

7. 10. 2013 15:18

Games, poorly optimized games. Also, Transport Tycoon on iPad 3 does not run as smoothly and in the same resolution as it does on the desktop. Example.

Peter

7. 10. 2013 13:17

I just don't understand why the speed of the phone keeps increasing, give me some examples of what you use it for, it slowly makes no sense to me to increase the performance of the mobile phone, if I remove the word marketing from it.

dfx

7. 10. 2013 18:49

For video, audio and image processing. And on to the games.

Anyone who uses an iPhone only for calling, texting, and occasionally reading or sending emails and occasionally surfing the Internet will need an iPhone 4. I believe that there are many such users. Not everyone needs the best phone in the world :-)

haw

12. 11. 2013 14:56

sheep

Czechboy0

7. 10. 2013 23:14

Doesn't the physical trade-off between hardware and software mean anything to you? This reminds me a bit of the end of the 19th century, when the physicists of that time said that everything in physics had already been discovered and there was no need to continue (a decade before the theory of relativity and three before the quantum theory).

The pursuit of the best never ends. Sometimes the software leads and sometimes the hardware. But if one gets stuck, it won't let the other one go. We will not be so selfish to our descendants :) So to your comment - a faster phone will enable more powerful applications, which will be able to do a lot more than drives. And once things that even today's computers are not enough for. The future is exciting.

Mirek

7. 10. 2013 23:41

Exactly :)

Robin Martinez

7. 10. 2013 13:45

Nice article, but I don't understand why Apple didn't put 7GB of RAM in the A2. Yes, iOS multitasking is not such that 2GB is necessarily needed, but given the twice the length of the memory pointer, it would be much more suitable.

But otherwise, I agree that a 64-bit processor is "unnecessary" for a mobile phone, just as a retina display or an optical mouse instead of a ball was unnecessary - all these inventions were labeled as "unnecessary", but in my opinion the correct word is "timeless", because once must come and Apple is not afraid to come up with something new.

Czechboy0

7. 10. 2013 16:05

I second that. Unfortunately, even "useless" is not an accurate expression. Unnecessary means something whose priority a person does not know. That is definitely not true. Speed may not need such speed, but it will definitely recognize it. And when the software catches up with the hardware, there will be room for improvement again.

Robin Martinez

8. 10. 2013 8:03

Sure, I'm in favor, I mean the iP5 is really a pretty fast smartphone, so the 5S wouldn't have to be 64bit at all. But one day someone had to deal with it again and it was Apple and it was now. For as long as I can remember, experts have also talked about how 64-bit processors will be useless even in computers.

Dominik Materna

7. 10. 2013 16:16

For me, as an IT layman who almost failed matric, the conclusion is important. The whole article (supported by the comments) seems quite insightful to me, and although I won't be able to explain it, the A7 with 64-bit architecture is a step forward. Thanks for the info.

Daniel Hrynusiw

7. 10. 2013 16:17

I would edit the title of the article, as it is a marketing move. Every innovation is essentially a marketing move. :-)

Mirek

7. 10. 2013 23:43

I do not think. For example, Samsung uses marketing moves. They show up with RAM, which the iPhone doesn't need at all. Their getting away with features that aren't usable at all. Their purposely increasing processor performance for tests. Etc. That's marketing, although yes, it's misleading, which they shouldn't just get away with ;)

Your name

Your comment

By filling in the above data, I acknowledge that the company TEXT FACTORY s.r.o., registered office in Brno, Durďákova 336/29, Černá Pole, ZIP code: 613 00, ID number: 06157831, registered at the Regional Court in Brno, section C, insert 100399, will process my personal data provided in the registration form filled out by me on the basis of the legitimate interests of TEXT FACTORY s.r.o. according to Article 6 paragraph 1 letter f) GDPR and to fulfill legal obligations (Article 6, paragraph 1, letter c) GDPR), for the following purposes: the necessity to ensure the authorization of visitors to websites operated by TEXT FACTORY s.r.o. to actively contribute to published articles or within discussion forums and exercising the rights of TEXT FACTORY s.r.o. as the administrator of these discussion forums. More information about the processing of personal data and rights can be found in Lessons on personal data protection. the entire text

"64 bit"

General advantages and disadvantages

ARM64

registry

Instruction set

Compatibility with 32bit

Runtime changes

záver

Source: mikeash.com

Related