Cluster SoftWare Notes
For cluster software installation/management:
- Rocks
(own Linux distribution, RedHat Enterprise Linux 4 - RHEL4 - based, adds
capabilities via "Rolls" - prepackaged groups of software).
- Provisioning
- Monitoring
- CPU load
- free memory
- disk usage
- network I/O
- operating system version
- dead nodes
- Interprocess: MPI (PVM probably via SGE)
- Scheduler: SGE,Condor
- Oscar
(Toolkit on top of other Linux distributions)
- Provisioning
- Monitoring
- CPU load
- free memory
- disk usage
- network I/O
- operating system version
- dead nodes
- Interprocess: PVM,MPI
- Scheduler: SGE,Maui,Torque
- SCore
(Installs on top of CentOS 4.3 (RHEL4 clone).
- Provisioning
- Monitoring
- Interprocess: MPI,PVM(w/limitations)
- Scheduler: PBS,SCore-D
- OpenSCE
(Toolkit on top of other distributions, works nicely with Rocks)
- Provisioning
- Monitoring
- CPU Utilization and information (brand, clock rate, details)
- Memory information and usage
- Disk information and I/O rate
- Network interface information and usage rate
- Page/Swap/Context Switching rate
- Interrupt information
- System temperature and fan speed via ACPI or LM Sensor
- File system usage information (per partition basis)
- Interprocess: MPI
- Scheduler: SQMS
- Perceus
(More geared towards diskless clusters, should still work with local
storage instead, costumizable)
Warewulf
(More geared towards diskless clusters, should still work with local
storage instead, costumizable)
- Monitoring
- Interprocess: MPI (PVM probably via SGE)
- Scheduler: SGE,Torque
- SCali
Commercial toolkit above other Linux distributions
- Provisioning
- Monitoring
- Looks to have Reboot/Shutdown capabilities via PBS?
- Interprocess: MPI
- Scheduler: PBS
- Verari Command Center
Commercial toolkit for Verari BladeRack2 clusters
- Standard
- Monitoring
- Blade Power management (power up/down indiv or groups of blades)
- Blade insertion/removal
- CPU fan speed
- memory status
- rack temperature
- rack fan speed
- power status
- LED status/control
- Advanced
- Provisioning
- Monitoring: Standard +
- CPU utilization
- memory usage
- disk usage
- network traffic
- Enterprise
- Provisioning
- Monitoring: Advanced +
- E-mail alerts for specific events
MPI/Pro®
Offered through Verari.
ClusterController®
Verari scheduler for Microsoft® Windows.
- BigBrother
- Nagios (Open Source)
- Monitoring
- Monitoring of network services (SMTP, POP3, HTTP, NNTP, PING, etc.)
- Monitoring of host resources (processor load, disk and memory usage,
running processes, log files, etc.)
- Monitoring of environmental factors such as temperature
(?requires hardware sensors?)
- Contact notifications when service or host problems occur and get
resolved (via email, pager, or other user-defined method)
- Halcyon
"PrimeAlert Adapter for Netcool provides the capability to integrate Sun Management Center (Sun MC) alarms into Netcool." Does this restrict
Halcyon to Sun systems?
Rolling Back-ups
- Easy
Automated Snapshot-Style Backups with Linux
Uses the rsync facility of Linux. Includes links to many other
variants.
- Bacula Tape backup freeware
- Amanda Tape backup freeware
Fast communications over Gigabit Ethernet
- GAMMA
and
MPI/GAMMA
Would have required a second LAN for non-GAMMA IP traffic (e.g. NFS)
- Parastation
The Opteron version seems to be a beta version.
- Interprocess: MPI
- Scheduler: LSF,PBS-Pro,OpenPBS, (possibly SGE)
- SCore (see above)
- SCali MPI (see above, commercial)
Parallel Filesystems
- PVFS2
Open source parallel filesystem, does not require kernel modifications
-
GPFS IBM product. Appears to be for IBM servers only.
- Lustre
Very high performance parallel filesystem, requires extensive kernel
modifications
- SFS: HP's commercial version of Lustre
Fortran compilers for Linux
- A set of different comparisons from
polyhedron
- Opteron Benchmark Notable results
- Pathscale EKO Fortran Compiler 2.4
- Absoft Pro Fortran 10.0.3 GA
- The Portland Group Compiler 6.2-4 (PGI)
- Intel Fortran Compiler
- A couple of very bad results: "Capacita" & "Fatigue"
- One
warning against using Intel Fortran Compiler on non Intel chips
- Intel Benchmark Notable results
- Intel Fortran Compiler
- Pathscale EKO Fortran Compiler 2.4
- Absoft Pro Fortran 10.0.3 GA
- The Portland Group Compiler 6.2-4 (PGI)
- Supported language extensions
- Diagnostic Capabilities
- Himeno
Benchmark
- Portland & Fujitsu compilers show more benefits from tuning compiler
flags than does Intel compiler.
- Opteron
Benchmark compiled by
DisCO
- A list of FORTRAN compilers for linux, including some more benchmarks.
Software Development Kits
-
Absoft High Performance Computing Software Development Kits (HPC SDK)
- Includes Absoft, PathScale, Intel or IBM compilers (F77/90/95, C/C++)
- Fx2 debugger
- math libraries
- MPI distributions
- tracing tools
- Allinea DDT
The Distributed Debugging Tool
- MultiCore Plus SDK
No FORTRAN support. May be limited to Mercury systems.
- Intel Software
Development Products
Not a single package but a page listing many Intel products.
See also Original Intel page.
- Intel compilers (F77/90/95, C/C++)
- Intel Vtune analyzers
- Intel performance libraries
- Intel threading analysis tools
- Intel cluster tools
- gdb
Cluster Set-up
-
ROCKS+support Support Rolls + 3rd party rolls. Stumbled onto
this through one of the Older (4.2.1) Rocks Rolls.
Matlab
-
The Mathworks Announces Breakthrough Parallel Algorithm Development
Security
- firewall?
Cluster Hardware Notes
DDR2 memory
-
DDR vs. DDR2 - What it means to you.
-
Introduction to DDR-2: The DDR Memory Replacement
-
DDR2: a Soon-to-be DDR Replacement. Theoretical Basis and First Low-level Test Results
ProCurve Switch 2800 Series. specifications