L1, L2 and L3 Cache's on CPU's

morph · Aug 21, 2007

Oh man! Ad-blocking software has been detected! :'(

This website is run by the community, for the community... and it needs advertisements in order to keep running. Blocking our ads means your killing our stats!
Please disable your ad-block, or become a premium member to hide all advertisements and this notice.

Just somthing i've been wondering, ovbiously the early cpu's didnt have L2 or L3 caches, whilst reading up about all this the more modern cpu's have an L3 cache, for example on the Intel Itanium 2 its got an L3 cache of 1.5meg-3meg whereas its L1 cache is 32kb - why isnt the L1 cache the biggest so the cpu goes there first? Or am i missing somthing ovbious (which wouldnt surpise me )

Fergal1982 · Aug 21, 2007

Oh man! Ad-blocking software has been detected! :'(

This website is run by the community, for the community... and it needs advertisements in order to keep running. Blocking our ads means your killing our stats!
Please disable your ad-block, or become a premium member to hide all advertisements and this notice.

http://en.wikipedia.org/wiki/CPU_cache#Multi-level_caches said:

Larger caches have better hit rates but longer latency. To ameliorate this tradeoff, many computers use multiple levels of cache, with small fast caches backed up by larger slower caches.
Click to expand...

So there you have it. Check out the rest of the article for more detailed information on the caches.

greenbrucelee · Aug 21, 2007

morph said: ↑

Just somthing i've been wondering, ovbiously the early cpu's didnt have L2 or L3 caches, whilst reading up about all this the more modern cpu's have an L3 cache, for example on the Intel Itanium 2 its got an L3 cache of 1.5meg-3meg whereas its L1 cache is 32kb - why isnt the L1 cache the biggest so the cpu goes there first? Or am i missing somthing ovbious (which wouldnt surpise me )
Click to expand...

When the cpu is using lots of bits of data the cpu needs to access the ram, but ram isnt fast enough so the CPU uses cache the Level 1 cache is the first one used then the 2nd then the 3rd.

The reason why the L1 cache is smaller than the L2 is because a smaller level 1 cache and a bigger level 2 cache make the cpu (pipelining etc) much more efficient

morph · Aug 21, 2007

ah ok cool ta

dmarsh · Aug 21, 2007

Firstly thats a very good question !

The not fast enough bit is to do with latency, however it could be possible to do away with some of the latency but that would up the cost.
The latency is related to how physically fast the memory and bus systems are, this is also affected by physical distance.
Faster memory and better bus connections cost more, thats why the caches get bigger and slower, its the best way to get the most bang for buck.
Much of the design of things in modern computing is to do with issues relating to latency, if the processor, memory and disk subsystems were closer matched then we
wouldn't bother with extra design features like level 3 caches.

The presence of the cache is also to do with the architecture, see Von Neumann bottleneck.

The cache levels also refer to how close they are to the processor. Level 1 cache is 'on-chip' cache, as such it uses up valuable real estate on the silicon. There are only so many transistors that can fit within a set area, transistor count is based on the size of the die and the size of the gates or density. The bigger the die the more waste as impurities will cause more faulty units and a lower yield. More transistors allow for more complex and powerful processors, so making the level 1 cache bigger could be detrimental to the overall design, as it would use transistors that could be used for other logic or lower the yield by increasing the die size.

Moores law covers alot of this, many people think they understand moores law as they have the media's attention deficit disorder definition, they generally don't.

Moores Law :-

http://arstechnica.com/articles/paedia/cpu/moore.ars/3

Caches in general :-

http://en.wikipedia.org/wiki/CPU_cache

Design is the careful balancing of multiple forces or variables.

So on one level you are right, its just that your processor design would probably cost you £10,000, and it might not scale as well as 10 x £1000 processors !

Of course you can also pay for the extra complexity, it works fine on a SISD architecture, as soon as you bring in multiprocessor architectures you have cache snooping and cache coherency to deal with.

Log in or Sign up

L1, L2 and L3 Cache's on CPU's

morph Byte Poster

Fergal1982 Petabyte Poster

greenbrucelee Zettabyte Poster

morph Byte Poster

dmarsh Petabyte Poster

Share This Page

Navigation

Popular Forums

Useful Links

Log in or Sign up

L1, L2 and L3 Cache's on CPU's

morph Byte Poster

Fergal1982 Petabyte Poster

greenbrucelee Zettabyte Poster

morph Byte Poster

dmarsh Petabyte Poster

Share This Page

Useful Searches