Hank Dietz
Assoc. Prof. of Electrical and Computer Engineering
Purdue University
West Lafayette, IN 47907-1285
Parallel processing offers the potential to achieve faster program execution by having multiple processors work together. Traditionally, these processors were contained within a dedicated parallel supercomputer; now, it may be more cost-effective to use the processors of a group, or cluster, of PCs or workstations. The problem with clusters as parallel machines is that conventional networks were not designed to coordinate processors working on a single program, but to transfer large blocks of data between essentially independent systems.
Instead of using conventional network hardware and software, we suggest that it may be more appropriate to use a simple custom synchronization unit to coordinate the actions of a cluster of machines. In particular, we discuss how PAPERS, Purdue's Adapter for Parallel Execution and Rapid Synchronization, can make a cluster behave as a single tightly-coupled parallel supercomputer.
![]() |
![]() |
| TTL_PAPERS | TTL_PAPERS 951201 |
|---|
bcastPutType(d), bcastGetType()
putgetType(d, s)
reduceOpType(d)
scanOpType(d)
any(f), all(f)
waitvec(f)
vote(s), matchType(d)
| Machine | Barrier Sync. | 32-bit Putget (permutation) | 64-bit Broadcast |
|---|---|---|---|
| MasPar MP-1 | 0.1 | 44.0 | 31.0 |
| 486 Linux (PAPERS1) |
3.1 | 27.0 | 81.0 |
| Cray T3D (PVM) | 21.0 | 82.0 | |
| 486 Linux (TTL_PAPERS) |
2.5 | 216.0 | 137.0 |
| Intel Paragon XP/S | 530.0 | 700.0 | 210.0 |
| 486 Linux (Ethernet PVM3) |
49,000.0 | 100,000.0 | 40,000.0 |
s_write(a, s);
s_poll();
![]() |
![]() |
![]() |
| TTL_PAPERS 951201, 386 | TTL_PAPERS, 486 | TTL_PAPERS, Pentium |
|---|