Second International Workshop

on Operating Systems, Programming Environments and Management Tools

for High-Performance Computing on Clusters

(COSET-2)

http://coset.irisa.fr

Cambridge, Massachusetts (USA), June 19th, 2005

in conjunction with

ACM International Conference on Supercomputing (ICS05)

http://ics05.csail.mit.edu

Final Program

Proceedings are available here

 

9:00 Opening remarks

9:15 Invited talk introduction

9:30 Invited Talk: Cluster of clusters with OSCAR
Eric Focht, NEC Europe.


The OSCAR clustering infrastructure is designed for homogeneous clusters of similar nodes. The talk shows one possible approach for extending OSCAR to enable it to easilly deal with heterogeneous setups like for example a centrally managed cluster of subclusters with nodes of different architectures. The setup, installation and administration methods for such a cluster are explained. Also the additional toolsare described: a subcluster class which uses the C3 toolkit and its scalable features, administration commands which use "pull" methods and journals, simple Ganglia based (sub)cluster-membership functions. The extensions are integrated in HPCL, an OSCAR based clustering stack provided by NEC HPC Europe.

slides.pdf

10:30 Coffee break

11:00 Session 1: Data Management

A New Approach to Cost-Aware Caching in Heterogeneous Storage Systems

Liton Chakraborty, Ajit Singh (University of Waterloo)

slides.pdf

Design and Implementation of HTSFS: High-throughput & Scalable File System

Wenguo Wei (Guangdong Polytechnic Normal University), Shoubin Dong, Lin Zhang (Guangdong Key Laboratory of Computer Network, South China University of Technology)

12:00 Lunch (on own)

1:30 Session 2: High availability

Scaling Out OpenSSI

Bruce Walker (HP), Jaideep Dharap (HP & UCLA)

slides.pdf

Asymmetric Active-Active High Availability for High-end Computing

Chokchai (Box) Leangsuksun (Louisana Tech University), Venkata Kiriti (Kit) Munganuru (Louisana Tech University), Tong Liu (Dell Inc.), Stephen L. Scott (ORNL), Christian Engelmann (ORNL, The University of Reading)

slides.pdf

High Availability for Ultra-Scale High-End Scientific Computing

Christian Engelmann (ORNL, The University of Reading), Stephen L. Scott (ORNL)

slides.pdf

3:00 Coffee Break

3:30 Session 3: Performance Optimization

A Flexible Thread Scheduler for Hierarchical Multiprocessor Machines

Samuel Thibault (LABRI)

slides.pdf

Remote-Write Communication Protocol for Clusters and Grids

Ouissem Ben Fredj, Eric Renault (GET/INT)

slides.pdf

4:30 Session 4: Tools

Efficient Parallel Shell

Georges-André Silber (Centre de recherche en informatique, Ecole des Mines de Paris)

slides.pdf

5:00 - Closing remarks

 

 


      

 

CALL FOR PAPERS

SCOPE

Clusters are not only the most widely used general high-performance computing platform for scientific computing but also according to recent results on the top500.org site, they have become the most dominant platform for high-performance computing today. While the cluster architecture is attractive with respect to price/performance there still exists a great potential for efficiency improvements at the software level. System software requires improvements to better exploit the cluster hardware resources. Programming environments need to be developed with both the cluster and human programmer efficiency in mind. Administrative processes need refinement both for efficiency and effectiveness when dealing with numerous cluster nodes. The goal of this one-day workshop is to bring together a diverse community of researchers and developers from industry and academia to facilitate the exchange of ideas and to discuss the difficulties and successes in this area. Furthermore, to discuss recent innovative results in the development of cluster based operating systems and programming environments as well as management tools for the administration of high-performance computing clusters. The workshop organizers solicit papers presenting new software systems and concepts for clusters ranging from small clusters - through large-scale - and federated clusters.

 

Topics of interest for this workshop include but are not limited to:


* Cluster management and configuration tools

* Cluster distribution technology and experience (ie. OSCAR, Scyld, Rocks, others...)
* Single system image systems
* Global scheduling
* Process migration
* High performance communication systems
* Distributed Shared Memory
* Distributed and parallel file systems for clusters
* High performance I/O
* Multi-threading environments
* OpenMP support on clusters
* Message-passing programming environments (ie. PVM, MPI, others...)
* Security
* Fault-tolerance
* Checkpointing
* High availability
* Efficient and innovative communication methodologies
* High-performance networking technologies (ie. Myrinet, Infiniband, others...)
* Remote paging
* Cooperative caching
* Cluster operating systems
* Performance Evaluation
* New commercial or experimental software for high performance cluster computing

The workshop format will include one keynote speaker, presentations from authors of reviewed papers, and one panel session discussion on a relevant topic. The program committee will review all papers. The submissions to be presented at the workshop will be selected, based on their originality, technical merit, and topical relevance of the contents. Accepted papers will appear in the workshop proceedings provided to all workshop attendees and posted on the workshop website. An effort will be made to capture all presentation material to post on the workshop website following the meeting.
Authors of accepted papers are expected to register and present the paper at the workshop.

 

PAPER SUBMISSION

Submitted papers should not exceed 10 single-spaced pages (8.5x11 paper and using at least 11pt font) including all figures, tables, graphs and bibliography).

The cover page must contain:
* abstract of approximately 150 words
* 3-5 key words
* name and affiliation of author(s)

* clear indication of the corresponding author's:
* email
* telephone number
* fax number
* postal address

Submission will only be accepted electronically via email, in either postscript or pdf formats.

Send submissions by April 15th, 2005 to both workshop co-chairs:
Christine Morin, INRIA (France) (christine.morin@inria.fr)
Stephen L. Scott, ORNL (USA) (scottsl@ornl.gov)

 

IMPORTANT DATES

Submission deadline: April 15th, 2005 (extended deadline, no further extension)
Author notification:   April 29th, 2005
Camera-ready due:     May 20th, 2005

Workshop: Sunday June, 19th, 2005

 

WORKSHOP CO-CHAIRS

Stephen L. Scott
Oak Ridge National Laboratory
P. O. Box 2008, Bldg. 5600, MS-6016
Oak Ridge, TN 37831-6016
email: scottsl@ornl.gov
http://www.csm.ornl.gov/~sscott/
voice: 865-574-3144
fax: 865-576-5491

Christine A. Morin
IRISA/INRIA
Campus universitaire de Beaulieu
35042 Rennes cedex, France
email: christine.morin@inria.fr
http://www.irisa.fr/paris
voice: +33 2 99 84 72 90
fax: +33 2 99 84 71 71

 

PROGRAM COMMITTEE

Ramamurthy Badrinath, HP, India
Amnon Barak, Hebrew University, Israël
Jean-Yves Berthou, EDF R&D, France
Brett Bode, Ames Lab, USA
Ron Brightwell, SNL, USA
Toni Cortès, UPC, Spain
Narayan Desai, ANL, USA
Christian Engleman, ORNL, USA
Graham Fagg, University of Tennessee, USA
Paul Farrell, Kent State University, USA
Andrzej Goscinski, Deakin University, Australia
Liviu Iftode, Rutgers University, USA
Chokchai Leangsuksun, Louisiana Tech University, USA
Laurent Lefèvre, INRIA, France
Renaud Lottiaux, INRIA, France
John Mugler, ORNL, USA
Raymond Namyst, Université de Bordeaux 1, France
Thomas Naughton, ORNL, USA
Rolf Riesen, SNL, USA
Michael Schoettner, University of Ulm, Germany
Assaf Schuster, Technion, Israël
Gil Utard, Université de Picardie, France
Geoffroy Vallée, INRIA, France

 

ARCHIVES

The first COSET workshop (COSET-1 was held on June 26th, 2004 in conjunction with ICS '04 in Saint-Malo (France).