HPCwire
 The global publication of record for High Performance Computing / September 24, 2004: Vol. 13, No. 38

  |  Table of Contents  |  

Cluster Computing:

CONFERENCE CHAIRS DISCUSS THE IMPORTANCE OF CLUSTERING
by Tim Curns, Editor

HPCwire caught up with the general co-chairs of IEEE's Cluster 2004 Conference (http://grail.sdsc.edu/cluster2004/) Daniel Katz, of the Jet Propulsion Laboratory (JPL), and Henri Casanova, Assistant Research Scientist at SDSC, to ask some questions about the show and cluster computing in general.


HPCwire: To begin with, please give a brief overview of what you've seen or heard at Cluster 2004.....what's happened here in San Diego? What has impressed you? And how do these things compare to what you expected?

Henri Casanova: Being involved with the conference organization I did not have the chance to spend too much time in the technical sessions or even at the exhibit, so my view of the conference as an event is probably not representative of what a regular attendee would experience. I was personally impressed by the number of exhibitors that stepped in to sponsor the conference and showcase their technology. This of course helped the conference financially, but also contributed greatly to a more diverse and enjoyable experience. This has been above my expectations. This year we decided to have only 2 parallel technical sessions, which I believe worked much better than 3. It made the conference more focused, and the technical sessions livelier.

HPCwire: So Dan, what has impressed and or suprised you?

Dan Katz: The effort made by all the people to attend/present/exhibit/etc. It's all been more work than I was expecting.

HPCwire: Is this the first time for both of you chairing an event like this? What other experiences have helped you serve as General co-Chairs?

HC: It was my first time chairing an even like this!

DK: I've had a variety of experiences in the Cluster series, including being 1 of 3 Program Vice-Chairs, Deputy Program Chair, General Vice-Chair, Co- Program Chair, and now Co-General Chair. Cluster2001 was most helpful to me, because that meeting was organized by JPL, and it seems that the local people really see all of what's happening, where the remote people often only see their part of the conference.

HPCwire: Let's get into some cluster talk. Where do you see the line between cluster computing and Grid computing being drawn?

HC: The line can of course be defined from the traditional technical standpoints (in a machine room, with a single file system, over a switch, etc.). Today, the most common components of large-scale grids are clusters, and, rather than trying to partition the domains, lot of work is underway trying to understand what work can be done both on the cluster side and on the Grid side so that such platforms can be easily constructed, maintained, and used. So, there is a clear interface and overlap between these two worlds. Furthermore projects such as the Optiputer push the limit of the classical definition of a cluster as fast optical links allow for building clusters whose nodes are separated by long distances. This is where the line between Grid and cluster computing becomes very blurred, and it will be interesting to see what develops over the next couple of years.

DK: Clusters are not grids, though they can be elements in grids. On the other hand, clusters can be used in a manner similar to grids, though doing so seems to be a waste of resources. The difference is the networking - clusters are much more tightly bound together than grids. Also, clusters are intended to be a single system, while grids are intended to be used separately, but of course can be used together.

HPCwire: So how do you see SDSC and UCSD fitting into these areas?

HC: SDSC and UCSD are leaders in both areas and involved in projects such as TeraGrid and Optiputer, that both involve deep cluster computing and Grid computing issues, spanning the whole spectrum between the two, and contributing to defining where the field is going.

HPCwire: What exactly makes PC clustering easier to utilize for classical supercomputer users, as well as experimental data processing groups and bioinformatics users?

HC: We come from a world where users/programmers used to gain immense experience developing applications on particular supercomputers, and used to go through painful transitions each time a new generation of machines would be available, possibly from a different vendor, with a different architecture, with a different operating system, and different compilers. By contrast, there is a large effort today to standardize cluster computing (as exemplified in the Beowulf idea), a lot of community code is available that can run on such clusters, entire frameworks are available for cluster management (e.g., the ROCKS project), and Linux (or flavors of it) has emerged as a de-facto O/S. This provides a much more uniform development environment, which is affordable to a much larger community of users, and which comes with a large and common software and knowledge base. In fact, Dr. Ryutaro Himeno, form the Riken supercomputer center in Japan, who was invited as a keynote speaker at Cluster'04 in San Diego, presented a very compelling case of why they made the decision to move from vector supercomputers to clusters for their scientific users, how it made sense in terms of cost, of performance, and of ease-of-use for users. His presentation at the conference is only one of many examples of how clusters are a viable alternative to supercomputers for many classes of scientific users, even in settings in which entire generations of users have been trained on traditional supercomputers.

HPCwire: In addition to moves like Riken's, how do you think parallel processing and clustering has changed over the last decade or so? What issues are more prominent this year at Cluster 2004, than last year for instance?

HC: Cluster computing has become mainstream over the last decade, initiated by the Beowulf project, and we can see this today during the conference (not being able to attend Cluster'03 and Cluster'02, I cannot comment about what issues are more prominent this year). Of course, in terms of parallel processing, the main evolution has been Grid computing, and clusters are the most common components of a Grid platform.

DK: Obviously clusters are a new feature in parallel processing over the last 10 or so years, as they have now basically driven off many more powerful systems. The consequence of this is that only those applications that run well on clusters are being pursued, and much of work in new applications is stagnating.

The RAIT workshop (RDMA Applications, Implementations, and Technologies) was new to Cluster this year, and had about 40 attendees. This seems to be a new area of growth for the conference and for the world of clustering in general. Otherwise, the technical area of clusters seems fairly similar to last year.

I would like to see some discussion of how potential new languages and programming schemes might impact the world of clusters, but this didn't really happen in the technical papers this year. Perhaps next year in Boston...

HPCwire: How important is benchmarking, such as LINPACK for example, in the world of high-performance clustering? Do you think we should continue to focus on these litmus tests? Why or why not?

HC: Of course a benchmark such as LINPACK can only give a single and narrow view of the performance delivered by a machine (for instance, it does no I/O). However, it has value because it fosters competition and provides us with a view of the trends and evolutions of parallel computing platforms over the years. In fact, mining the data contained in the Top-500 lists over the last decade reveals a lot of interesting facts that confirm or disprove our assumptions about how technology evolves. So, in this sense, having something like the LINPACK benchmark and the Top-500 lists has a great value for the community. Now, I don't believe we, as a community, actually do "focus" on these tests. It is well understood what these tests are and what their limitations are. In fact, there are entire suites of benchmarks that, while not as visible as the LINPACK benchmark, provide much more complete views of a platform's performance and potential for real applications. Also, although performance modeling is always a very difficult question, there is a lot of expertise and good rules of thumbs in the community to assess how good a machine is for a given application, without taking things like the LINPACK benchmark into account necessarily. So, yes, we should keep such tests, and no, we shouldn't focus on them, which I don't believe we are doing as a community.

DK: Benchmarks are clearly important, but they are also clearly misused. The best benchmark is customer-specific, but it is unreasonable to expect vendors to run customer-specific benchmarks, except for very large procurements. I think we will continue to have benchmarks play an important role, as they serve an important function, but I hope to see the reliance on LINPACK reduced in favor of newer benchmarks such as the HPC Challenge benchmarks. These tend to represent a wider variety of potential applications, and the results of this suite should help customers determine what systems meet their needs.

HPCwire: Henri, before your work in the US, you worked at France's Ministry of Defense and as a graduate research assistant at the Institut de Recherche en Informatique de Toulouse (IRIT) in Toulouse, France. How has your work here differed? How prepared is the U.S. to handle the emerging importance of clustering and Grid computing?

HC: One of the differences I could see when I first started working in the US, and which is definitely relevant to the cluster computing community, is the fact that in the US there was at the time a greater synergy between academia and industry. In fact, I believe that the US is in an great position to handle the future of cluster and Grid computing precisely because of the tight connections and collaborations between academia, supercomputer centers, national labs, and industry. This is exemplified by projects like the TeraGrid for instance.

HPCwire: How about France's position?

HC: I cannot place a judgment on how France is positioned as I have not been closely involved with research programs there, although I collaborate with a few individual researchers.

HPCwire: What kept you guys up at night while organizing this event? What may continue to worry you in general regarding this field?

HC: Obviously, as a general chair, there are a lot of issues regarding logistics and budget, which are not specific to the IEEE Cluster conference but apply to all conferences. In terms of the Cluster conference specifically, I think that one important concern was that, in the face of the multitude of conferences in this and related field, Cluster would retain its identity as a conference truly focused on cluster computing, that would attract practitioners both from industry and academia, rather than aiming for a broader range of topics that would then make it undistinguishable from other conferences in this and related fields. In this context, one key issue was to attract a good number of industrial exhibitors who would demonstrate latest cluster computing technology. Luckily Cluster'04 has been particularly successful in attracting industrial sponsors and exhibitors, and is definitely the venue for Cluster Computing research and practice.

DK: Our financial situation was the most stressing, more than anything else, though this was based on not knowing the numbers of attendees before the event started, because the percentage of attendees who register on-site is so high. In general, Clustering will do fairly well in the future. My concern is for the applications that are not well served by Clustering.

HPCwire: Understandably. Is there anything else that either of you would like to add?

DK: If possible, I would like to publicly thank the Cluster2004 team, especially Greg Bruno (SDSC), our local arrangements chair, and Jim Ang (SNL) and Kurt Keville (MIT), our exhibits/sponsors co-chairs. The conference would not have been successful without their hard work.


Clearly, clustering is still as important as ever. And, with the emerging technology involved with Grid computing, we are sure to see even more advances in the area over the next several years...


Top of Page

  |  Table of Contents  |