This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
user:jr018429:portfolio:eight_machine_hpc_cluster [2011/05/10 19:04] – created jr018429 | user:jr018429:portfolio:eight_machine_hpc_cluster [2011/05/16 18:51] (current) – [Materials/Reading/Prerequisites] jr018429 | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | <WRAP centeralign round graybg box 96%> | ||
+ | <WRAP muchbigger> | ||
+ | <WRAP bigger> | ||
+ | </ | ||
+ | ======Objective====== | ||
+ | The object of this exercise is to create an 8-machine HPC cluster which is capable of performing HPC tasks.\\ | ||
+ | ======Materials/ | ||
+ | To do this project, you will need to have successfully completed on 8 computers: | ||
+ | * Linux/*BSD Desktop Install | ||
+ | To accomplish this project you will need: | ||
+ | ^Item^Description^Qty^ | ||
+ | |computer|HPC cluster|8| | ||
+ | |student shelf KVM setup|The location for this installation is largely established and fixed|1| | ||
+ | |||
+ | ======Background====== | ||
+ | Recently, HPC systems have shifted from supercomputing to computing clusters and grids. | ||
+ | Open MPI is one of a number of HPC technologies used to implement and create high performance computing systems. | ||
+ | ======Procedure====== | ||
+ | These instructions assume that the boot sequence of all of the computers in the cluster are configured to boot from the hard drive before booting from the network. If a computer' | ||
+ | - Set the master KVM bank switch to bank one. | ||
+ | - Set the master KVM computer switches to the computer on which the operating system is to be installed. | ||
+ | - Turn on the computer. | ||
+ | - As a computer boots, press F2 on the keyboard to bypass the " | ||
+ | - In the cmos setup, use the right or left arrow keys to select " | ||
+ | - Verify that "PXE boot to LAN" is enabled. If it isn't enabled, enable it. | ||
+ | - In the cmos setup, use the right and left arrow keys to select " | ||
+ | - Arrow down to " | ||
+ | - Verify that " | ||
+ | - If cmos settings were changed save and exit the cmos setup utility. If no changes were made to the cmos, select the exit without saving option. | ||
+ | - The Computer reboots.\\ | ||
+ | - Press F10 to access the boot menu.\\ | ||
+ | - On the boot menu, arrow down to "IBA GE Slot 0208 V1210" and select it.\\ | ||
+ | - Next, the Lair Network Boot Menu loads.\\ | ||
+ | - Select " | ||
+ | - Arrow to " | ||
+ | - At this point, the installation begins (defaults were selected throughout MOST but not all of the installation). | ||
+ | - The " | ||
+ | - The " | ||
+ | - The " | ||
+ | - The " | ||
+ | - The " | ||
+ | - The " | ||
+ | - The " | ||
+ | - The " | ||
+ | - The " | ||
+ | - In the first installation od Debian Squeeze, the installation continued from here, however during the second installation, | ||
+ | - Next, the "Set up users and passwords" | ||
+ | - The "Set up users and passwords" | ||
+ | - The "Set up users and passwords" | ||
+ | - The "Set up users and passwords" | ||
+ | - The "Set up users and passwords" | ||
+ | - Next, the " | ||
+ | - The " | ||
+ | - The " | ||
+ | - The " | ||
+ | - The " | ||
+ | - Partitioning message-" | ||
+ | - Popularity contest message " | ||
+ | - The " | ||
+ | - Configuring grub message: " | ||
+ | - Installation complete < | ||
+ | - Finishing the installation..\\ | ||
+ | - Computer reboots automatically.\\ | ||
+ | - Repeat steps 2 through 44 for all computers in the cluster.\\ | ||
+ | - Change the KVM computer switch to the first position, node 00.\\ | ||
+ | - Change the working directory to /etc/apt. At the command prompt enter "cd / | ||
+ | - Rename the existing sources.list to sources.bak. At the command prompt enter "mv sources.list sources.bak" | ||
+ | - Now download sources.list. At the command prompt, enter "wget http:// | ||
+ | - Next, open sources.list with a text editor and replace all references to " | ||
+ | - On the command line, enter and execute: " | ||
+ | - change the directory to " | ||
+ | - On the command line, enter and execute the following command: "cat id_rsa.pub >> authorized_keys" | ||
+ | - On the command line enter and execute the following commands: for i in 1 2 3 4 5 6 7; do ssh node0$i "mkdir -p .ssh; chmod 700 .ssh" sca id_rsa.pub node0$i: | ||
+ | - On node00, perform an aptitude install openmpi-bin openmpi-libs0 openmpi-dev openmpi-dev build-essential | ||
+ | - On the command line, enter and execute the following command: "for i in 1 2 3 4 5 6 7; do ssh node0$i aptitude install openmpi-bin openmpi-libs0 openmpi-doc done". | ||
+ | - On node00, change directory to / | ||
+ | < | ||
+ | node00 slots=1 max_slots=1 | ||
+ | node01 slots=1 max_slots=1 | ||
+ | node02 slots=1 max_slots=1 | ||
+ | node03 slots=1 max_slots=1 | ||
+ | node04 slots=1 max_slots=1 | ||
+ | node05 slots=1 max_slots=1 | ||
+ | node06 slots=1 max_slots=1 | ||
+ | node07 slots=1 max_slots=1 | ||
+ | </ | ||
+ | then close vi and save the file :wq.\\ | ||
+ | \\ | ||
+ | As part of the HPC cluster, we used a network file system to share resources across the cluster.\\ | ||
+ | We installed the nfs server on VM03,a virtual machine, and on all of the machines in the cluster.\\ | ||
+ | On the nfs server(vm03 in this case) perform the following steps:\\ | ||
+ | < | ||
+ | - Log into the virtual machine as root | ||
+ | - aptitude intstall nfs-kernel-server | ||
+ | - mkdir / | ||
+ | - reboot the vm using the reboot command | ||
+ | - Log into the vm | ||
+ | - edit the file / | ||
+ | - exportfs -rva | ||
+ | </ | ||
+ | On each of the machines in the cluster perform the following steps:\\ | ||
+ | < | ||
+ | - Log into a machine as root | ||
+ | - aptitude install nfs-common | ||
+ | - reboot the machine using the reboot command | ||
+ | - Log into the machine | ||
+ | - mount -t nfs vm02:/ | ||
+ | </ | ||
+ | ======References====== | ||
+ | * HPC: http:// | ||
+ | * Open MPI tutorial: http:// | ||
+ | * Open MPI: http:// | ||
+ | |||
+ | |||