There are myriad potential benefits of large
processor architectures. The first is that, relatively speaking,
infinite quantities of RAM and storage space can be directly
addressed.
2128
2
128
words of memory, presuming each word is 128 bits
long, is in the range of
4 *
1040
4 *
10
40
bytes of memory. A 256-bit CPU
could theoretically address
2 *
1079
2 *
10
79
bytes of memory!
Realistically, this much storage will never be available to us.
And, if the memory addresses are actually encoded in a word-length
instruction along with the register to use and the instruction
itself, there would only be
6 *
1035
6 *
10
35
bytes addressable. This is
still well beyond the capabilities of any machine to enclose, and
should, therefore, provide for nearly infinite use of this
architecture into the distant future.
Other benefits from using such a large
architecture come in the scientific processing world. Computer
scientists constantly need to think about the level of precision
their machines can handle. Even top-end supercomputers cannot
handle the floating-point operations I described above. Large-scale
simulations, such as those performed by the ‘Earth
Simulator’ in Japan, could have their accuracy greatly
enhanced by using larger architectures. Floating-point round-off
error in 256 or 512 bits is not worrisome to the vast majority of
the scientific community.
Other possible uses for this processor include
engineering workstations, data-mining servers, dedicated
cryptography machines, and the high-end graphics market. A wide
data path, such as the one described in this article, makes
implementing and executing most encryption algorithms simple. Most
encryption algorithms operate on large chunks at a time, such as
the Advance Encryption Standard, which uses either 128- or 256-bit
blocks in the rounds of its algorithm.
Performing advanced data analysis and mining
using a machine that could directly address all of the storage in
use on the planet would be a statistician’s dream. Engineers
and graphics professionals alike would see their productivity
enhanced by having more of their calculations performed more
directly rather than in extremely segmented chunks as it is on
current, common-off-the-shelf hardware.