
Features:
SUN'S GUSTAFSON ON ENVISIONING HPC ROADMAPS FOR THE FUTURE
by Tim Curns, Editor
John Gustafson, a senior scientist and a principal investigator for Sun's HPCS
Program, took a brief departure from his work at Sun last year, refreshing his
"familiarity with the world of cluster computing and the needs of the HPC
community." He is now back at Sun, happily working with the 2009 timeframe of
the DARPA initiative. HPCwire tracked John down to examine some of his
thoughts on the hot topics in HPC today.
HPCwire: What do you find to be the most exciting technology right now?
John Gustafson: That's easy: Proximity Communication. Breakthroughs like this
don't come along very often. Here we thought we'd be doomed to worsening the
balance between compute speed and communication speed, and have to continue to
burden programmers with awareness of data locality. Thanks to Drost's and
Sutherland's innovation, we can realize almost three orders of magnitude more
bandwidth in and out of chips. Who would have guessed that we'd ever be able
to build a tightly-coupled supercomputer again? And it looks like it might not
just be a reprieve, but a real solution, since now we can expect that the
bandwidth improvements will track Moore's law just like device densities do.
HPCwire: What important technologies do you see as emerging in the next few
years?
JG: Well, let's start with hardware advances. Optical interconnect will get
much more practical for short-distance communications. Chips will apply
asynchronous design principles, both to let each part of the chip run as fast
as it can and to save power. I foresee a return to liquid cooling in the next
few years, because air cooling is hitting its limits at many large HPC sites.
Of course we'll be putting multiple processor cores on one chip, and multiple
threads per core, which is going to mess with the minds of people who insist
on knowing how many "processors" there are in a system.
By the way, I predict you'll see some dramatic improvements to the way we do
floating-point math in the next few years. We won't just be throwing 64-bit
IEEE 754 at every problem and hoping for the best. We'll use sometimes more
precision (because we'll discover we need it) and sometimes less (because too
much precision wastes bandwidth and memory and electrical power). We'll have
better tools for assuring numerical validity. You may know that Sun already
offers Interval Arithmetic in its compilers, and it continues to pick up
momentum in the HPC community. That's one promising approach, but certainly
not the only one. We need better math.
For software advances, we can thank DARPA for a renewed emphasis on ease-of-
programming. In the last decade, we lost ground in programmer productivity,
but soon we should get it back and then some. HPC programmers have long been
presented an ugly choice between ease-of-use and performance. Now it looks
like we might be able to have it both ways: high-level expressions like those
of Matlab or Mathematica, but higher performance than if you wrote the
operations out in low-level C or Fortran and used an optimizing compiler. Only
when systems run _faster_ when programmed with high-level abstractions will
HPC programmers use those abstractions. I know, because I'm one of those
programmers!
There's an advance coming that may be just as important to the future of HPC
as anything I just mentioned: meaningful benchmarks. There was a time when
LINPACK was a good way to motivate vendors to work on their floating-point
performance, but now it's doing a disservice because it can motivate vendors
to ignore memory system requirements. The HPC Challenge Benchmarks and the
Purpose-Based Benchmarks show a lot of hope for establishing rankings that
correlate with real workloads. But it will take years to wean some people off
their Top 500 addiction and get them asking the right questions about system
performance.
HPCwire: How will Sun be involved in leveraging these technologies?
JG: Every single thing I just mentioned is being used in our HPCS approach.
For some of the technologies, like Proximity Communication, we're developing
the intellectual property entirely in-house and protecting that IP carefully.
The Purpose-Based Benchmarks are developed at Sun but shared freely with
vendors and customers. And for things like optical communication, we work
closely with third parties that are leaders in their areas to make sure
they're targeting HPC-specific needs.
HPCwire: What about parallel processing? Does that fit under the umbrella of
Grid computing? What does Grid computing mean to you, as definitions of it
defy general consensus?
JG: There are indeed some areas where parallelism is so easy that there's no
problem farming it out over a geographically distributed set of machines.
Factoring large integers, or processing SETI data, for example. But for
physical simulations, dispersing processors geographically for simultaneous
execution makes no sense to me. Supercomputing is currently limited by heat,
size, and cost. If you make the distance greater, you'll use more total
electricity (though the waste heat will at least be less concentrated). The
longer communication wires will cost more than if things were closer together.
So the way to use the Grid to supercomputing is simply to use it to tap into a
closely-knit cluster or SMP that offers the lowest cost for the compute
cycles.
You wouldn't believe the effort our engineering team puts into removing every
inch from the interprocessor communications path. Adding miles is
inconceivable in the realm of HPC applications we're addressing.
I hear very different uses of the term "Grid computing." It usually refers to
the ability to get to computing on a network, like an electric power consumer
uses a power grid. I think you can probably imagine our CEO, Scott McNealy,
cracking a lopsided grin and saying, "Gosh, what a novel idea!" I first heard
the idea of utility computing expressed by a guy named Ross Perot in the early
1980s. Sun has pitched this as the way to go since the company was founded in
1982, so the idea goes back at least that far.
The big advantages of the Grid are that it gives you the competitive pricing
of many choices, and it balances the load so you make better use of all the
systems that are out there. I hope people start using quality-of-service
measures in their Grid pricing, particularly as it applies to floating-point
arithmetic. For instance, if you do Interval Arithmetic on different
computers, your quality is how narrow the interval is that includes the
answer. Right now, if you do HPC-type calculations on different computers on
the Grid and get different floating-point answers, you have no way of knowing
which one to trust.
That's really a big hurdle that we should work to overcome. My vote would be
to expand the set of applications that we can handle with Interval Arithmetic,
and that solves the problem of porting approximate answers over the Grid. But
if people have other suggestions, I'm listening!
HPCwire: While we await responses to that, let's look at some of your history.
Could you explain your work with the Atanasoff-Berry Computer (ABC) and its
restoration? Why is this important?
JG: We are going from petascale to, uh, milli-scale! OK, back to computing in
the 1930s.
One reason I think historical computers are so important is because they often
made brilliant use of the parts and switching technology they had at the time.
There's a general sense that because computing technology changes so quickly,
we have nothing to learn from work that's more than ten years old. That's
absolutely not true. Engineers keep reinventing concepts and giving them new
names, unaware that a little knowledge of prior art could have saved them the
trouble.
The ABC was the first electronic computer to use base-2 math, and to make the
giant leap of using Boolean logic to perform arithmetic electronically. But
it's also the first machine to use dynamic RAM... capacitors, periodically
refreshed to maintain their voltage indicating a zero or one value. That was a
clever way to get the cost down, since capacitors cost pennies and vacuum
tubes cost dollars back then. And the ABC had a system clock: 60 Hz from the
line voltage! Moore's law works quite well if you go backward from Moore's
1970 paper, incidentally. In 1939, the ABC had 0.3 kilobytes of RAM in about
two cubic feet of space.
The original ABC was dismantled, and the only original parts remaining are a
memory drum and one of the binary add-subtract units. People get very
emotional about claims of who was "first" to do something in early computing,
and the ENIAC defenders are exceptionally unwilling to admit that their
"first" was based on any prior art. That was one reason I got involved.
The ABC was designed to solve dense linear equations. See, even back in the
1930s people wanted a LINPACK solver! Solving linear systems was the killer
app, because they could limp along through many things with pencil-and-paper
and Marchand or Monroe mechanical calculators. I suggested to Jack Dongarra
that maybe we could get the ABC on his list, with a speed of 0.06 flops per
second. Not megaflops. Flops. It took about 15 seconds to do a multiply-add!
Maybe we could start a "Bottom 500" list, with the ABC as the all-time
champion. Kidding. But if you like plotting graphs of LINPACK speed through
the years, you can now do it going back 65 years! Quite a baseline.
HPCwire: It sure is. How did you get involved in such a project?
JG: I did my Ph.D. work at Iowa State, and used to walk every day through the
physics building where the ABC was invented. There was a poster on the wall
bragging about it, and I always thought, "Yeah, right. I'll bet every
university in the country thinks they invented the electronic computer." I was
a typical cynical grad student.
You may recall that Sun acquired the part of Cray that used to be Floating
Point Systems. When I was at FPS in the 1980s, I managed the development of a
machine called the FPS-164/MAX, where MAX stood for Matrix Algebra
Accelerator. It was a general scientific computer with special-purpose
hardware optimized for matrix multiplication (hence, dense matrix factoring as
well). One of our field analysts, a well-read guy named Ed Borasky, pointed
out to me that our architecture had precedent in this machine developed a long
time ago in Ames, Iowa. He showed me a collection of original papers reprinted
by Brian Randell, and when I read Atanasoff's monograph I just about fell off
my chair. It was a SIMD architecture, with 30 multiply-add units operating in
parallel. The FPS-164/MAX used 31 multiply-add units, made with Weitek parts
that were about a billion times faster than vacuum tubes, but the
architectural similarity was uncanny. It gave me a new respect for historical
computers, and Atanasoff's work in particular. And I realized I shouldn't have
been such a cynic about the historical display at Iowa State.
I also learned about the controversy regarding the ENIAC, how Mauchly had
appropriated Atanasoff's ideas as his own, in what may be the greatest act of
intellectual piracy in the 20th century, and that hit a nerve with me.
The ENIAC clan claimed that the ABC never worked, or was never finished to the
point where it could do anything useful. So when I heard that George Strawn
and Del Bluhm were planning to get funds to reconstruct the ABC from the plans
and other data, I jumped in with both feet and ultimately led the project to
completion. And guess what? The ABC works very well, thank you! It solves
systems of linear equations. Very, very slowly. Like, eight hours for 29
equations in 29 unknowns. And we discovered that the original machine had been
used for solving least squares problems in the Iowa State Statistics
Department, so it wasn't as if it never became a "production" computer.
HPCwire: What about your immortal code? Describe your work here as well,
please. Does it relate to your current work?
JG: "Immortal code" was a phrase I coined for an effort to write computer
programs that could survive at least 50 years of changes in technology.
Mathematics doesn't go out of date; why should algorithms? The problem has
been that even the most mathematically-expressed algorithms still make all
kinds of assumptions about the machinery they'll run on. The classic programs
assumed sequential execution, and that arithmetic was much harder to do than
moving data around. We all know how those assumptions have been invalidated by
technology, so now we have to rewrite tons of code that had seemed very solid.
How do we prevent this from happening again?
We discovered that a lot of the "software rot" out there resulted from people
stepping out of their proper role, and doing someone else's job badly. Like a
chemist who unrolls a loop in the belief that it helps the compiler, only to
later discover the loop runs faster in its original and easier-to-read form.
Or the system software designer who assumes physicists want an alarm to sound
and the program to halt when e to a negative big number causes an underflow.
The solution was for each player -- hardware designer, system software
designer, application programmer, user -- to stay strictly in role and
communicate explicitly what happens at their interfaces. Think of it as the
communication of both requirements and concessions, with an emphasis on
concessions. If you can tell a library designer, for example, that you don't
care if intermediate results in complex arithmetic cause overflow, then the
library can promise much higher speed in return. About the only way we have
now of communicating these things is with clunky compiler flags that are
arcane and vendor- specific.
It's funny that you should ask about this, because a lot of that work turned
out to be exactly what DARPA needed in its HPCS program. HPCS requires
robustness and portability, and the Immortal Code work gave me a head start in
working out how we might be able to write software in such a way that it
doesn't have to be reverse-engineered and modified at huge expense every few
years, like we do now.
HPCwire: Ok, that brings us back to HPCS. Could you describe Sun's HPCS
research programs that are related to the DARPA grants? What kind of
advancements are being made here? What phase are you in now? How are you
involved?
JG: There are, at last count, 24 teams working on the HPCS program at Sun,
covering every aspect of the system. Administration, I/O, Proximity
Communication, Purpose-Based Benchmarks... and Product Concept Definition.
Like IBM and Cray, Sun is in the middle of "Phase II" right now, which is the
minimization of the risks in the radical ideas we proposed in Phase I. The
idea is that by the end of Phase II, we'll have a plausible case for actually
building a system using what we've learned, and can propose a real product to
DARPA that would have broad commercial and government use.
At SC2004, we really let people have a good look at how Proximity
Communication allows us to build the equivalent of Wafer-Scale Integration,
but with high yield and repairability. We'll be dribbling out other milestones
over the next few months, so watch for our announcements. There's a lot more
where this came from.
As for my role, I got Sun to Phase II and was PI for the program initially.
I'm now concentrating on the applications of the Hero system. I'm one of the
dyed-in-the-wool HPC developers at Sun and as such I can infuse real-world
experience into the design process. I provide in-house guidance about what
does and what doesn't matter to the HPC community.
HPCwire: What DOES matter to the HPC community? What's next in this area,
particularly for Sun?
JG: We're getting incredible response to our "utility computing" program...
and virtually all the early adopters are in HPC. Manufacturing. The energy
industry. Life sciences. A lot of Sun's revenue comes from technical and
scientific sales. But you have to realize that you're talking to Sun's biggest
HPC bigot.
HPCwire: Of course. Will Sun be involved in further government initiatives?
Why Sun?
JG: The HPCS program is certainly not the first collaborative development
project Sun has done jointly with the government... and we have several
interesting projects underway currently. I can tell you that we're doing what
it takes to make a case for Sun being in Phase III of HPCS, and that we have
excellent support from Sun executives.
Why Sun? I hear that question at conferences a lot. People had this impression
that Sun only builds workstations, then they had the impression that we only
sold dot-com hardware, so why are we involved in HPC? That's a crazy
pigeonhole to put us in, like saying that IBM only builds mainframes. The
thing Sun can now bring to HPC is the full range of memory models, from cheap
commodity clusters to giant flat-address-space SMPs, and offer the
architecture that actually matches the application. If a customer has an
application that works with message passing, then Sun has Opteron-based
clusters that have outstanding price/performance. But if they don't have
locality in their application, or don't want to invest in the programming
effort to fit the MPI model, then fine. We've got an SMP roadmap that goes to
2010 and beyond and scales to huge numbers of processor threads.
HPCwire: How has HPC become even more important in the last few years? Where
do you see HPC heading in the next 5 years?
JG: HPC is always important, even if some other markets temporarily steal all
the limelight now and then. You need "world's fastest computer" efforts to
make ALL of computing make advances... that's what's been happening for many
decades. The breakthroughs come from the bleeding edge stuff, and then
percolate down to more conventional computing.
In the last few years, there's been increasing realization that clusters are
very cost-effective, accompanied by a realization that they don't cover all of
the applications we used to do with supercomputers. So the main trend I see
for HPC is a huge effort to fill in that gap. One way is to incrementally
improve cluster software and put in a lot of work to make them turnkey
application servers. Another way is to architect monolithic, closely-coupled
systems that are so much easier to use that the saving in programmer and
administrator cost is more than enough to offset the higher hardware cost.
You'll see a lot of both approaches in the next few years.
HPCwire: Thanks so much for taking the time to answer some questions about
your work and HPC in general. I'm sure the community will be watching your
work with Sun very closely.
John Gustafson has been published many times and has served as a computational
scientist, SCL associate scientist, adjunct associate professor of Computer
Science and Applied Mathematics at Iowa State University, subject area editor
for Journal of Parallel and Distributed Computing, and on the editorial board
for Parallel and Distributed Computing Practices.
|