------------------DOCUMENTS------------------ Hardware: - pfjl0_details_specs_racks.mail - VS_BladeRack_Manual_RevI - ProCurve_Switch_2800_Series.pdf - ARCvault 24 User Guide SOFTWARE - Web-page: http://oceans.deas.harvard.edu/haley/HideyHole/cluster_software.html - rocks-usersguide-4.2.1 - pjh3_notes_conf_call_final.mail - We have Rocks (contains RHEL4). Contains: CentOS 4 update 5 (replace the OS Roll with the complete media set from any Red Hat Enterprise Linux 4 compatible distribution, including the commercial version). Schedulers: CONDOR (High throughput computing tools), grid Globus 4.0.4 (GT4) SGE Sun Grid Engine job queueing system area51 System security related services and utilites bio Bioinformatics utilities ganglia Cluster monitoring system from UCB java Sun Java SDK and JVM pvfs2 PVFS2 File System MPI installed ------ CLUSTER SET-UP TODO: SOFTWARE TO INSTALL ------- -- Parastation, PBS, Score: to be installed Need to install parastation on top of Rocks (also Score) - PVM: Need to check to see if in Rocks on not, and if it works with new condor (parallel universe) - Seems like Rocks can deploy software, hence no need for XCAT to depoy OS/software.etc see http://www.xcat.org/ and http://www.alphaworks.ibm.com/tech/xCAT/ - parallel file systems: at least on main storage array. - Cross mounting of disk: set up NIS on master server with NFS such that linux/windows desktops can see cluster disks. - backup data: Oleg to contact MIT IS&T Need to set-up rolling back-ups (search Pat's page http://oceans.deas.harvard.edu/haley/HideyHole/cluster_software.html - Compilers (see Pat's page): they need to be discussed and quotes are needed, compare also with MIT offering. - Software development kits: same thing. - Matlab: PFJL has quotes. - Authentication - Firewalls - software download from MIT for install on large cluster: Need to check with IS&T or simply download from Web, install on front-end and use rocks to copy. Someone needs to do a list of free/4pay software needed from MIT list - ask someone how to continue to copy files across tapes (Al Conte)? - Surge protector: We have quotes and web info from Verari and Andrew Hamilton from Anixter (APC local) on surge protector and refurbished UPS/Power supply. Pat needs to follow-up with Boyer/Ron from MIT. - internal connect and configuration of switches for more than 1G connect (see Conway's emails) - partitioning of disk in 3U so that one/some of them is/are used for backup. - OS can be changed mid-stream (user authentication) ---OS recommended by 2.006 student, which we should check http://en.wikipedia.org/wiki/Debian_GNU/Linux http://www.debian.org/ http://en.wikipedia.org/wiki/Ubuntu_Linux http://www.ubuntu.com/