-

This site is deprecated and will be decommissioned shortly. For current information regarding HPC visit our new site: hpc.njit.edu

Difference between pages "HPCEnvironment" and "HPCLocMtx"

From NJIT-ARCS HPC Wiki
(Difference between pages)
Jump to: navigation, search
(Importing text file)
 
(Importing text file)
 
Line 1: Line 1:
== Research computing environment overview ==
+
== HPC Location Matrix ==
 +
 
 +
This matrix lists the pros and cons of types of HPC hardware ownership and hardware locations.
 +
 
  
 
<table width="100%" style="border:1px solid black; border-collapse:collapse;">
 
<table width="100%" style="border:1px solid black; border-collapse:collapse;">
  
<caption>HPC Environment Overview</caption>
+
<caption>HPC Location Matrix</caption>
 
<tr bgcolor="#dddddd">
 
<tr bgcolor="#dddddd">
 +
  <th style="border:1px solid black;">Item</th>
 
  <th style="border:1px solid black;">Category</th>
 
  <th style="border:1px solid black;">Category</th>
  <th style="border:1px solid black;">Sub-category</th>
+
  <th style="border:1px solid black;">Subcategory A</th>
  <th style="border:1px solid black;">General<br /> Access</th>
+
  <th style="border:1px solid black;">Subcategory B</th>
  <th style="border:1px solid black;">Node Age [1]<br />(years: %)</th>
+
  <th style="border:1px solid black;">Pros</th>
  <th style="border:1px solid black;">DMS-only<br />Access [2]</th>
+
  <th style="border:1px solid black;">Cons</th>
  <th style="border:1px solid black;">Node Age [1]<br />(years: %)</th>
+
  <th style="border:1px solid black;">Private<br />Access [3]</th>
+
  <th style="border:1px solid black;">Node Age [1]<br />(years: %)</th>
+
 
  <th style="border:1px solid black;">Notes</th>
 
  <th style="border:1px solid black;">Notes</th>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
<td style="border:1px solid black;">CPU </td>
+
<td style="border:1px solid black;">1</td>
 +
<td style="border:1px solid black;">Hardware owned<br />by institution </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
Line 22: Line 24:
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">2</td>
 +
<td style="border:1px solid black;"></td>
 +
<td style="border:1px solid black;"> On-campus </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
 +
<td style="border:1px solid black;">3</td>
 +
<td style="border:1px solid black;"></td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> Strongly preferred by NJIT researchers using HPC </td>
 +
<td style="border:1px solid black;"> Hardware will not stay current without funded timely re-fresh </td>
 +
<td style="border:1px solid black;">  </td>
 
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"></td>
<td style="border:1px solid black;"> Nodes </td>
 
<td style="border:1px solid black;"> 240 </td>
 
<td style="border:1px solid black;"> >10: <<font color="red">7</font><br />3-10: <<font color="red">90</font><br />0-3: <font color="green">3</font> </td>
 
<td style="border:1px solid black;"> 31 </td>
 
<td style="border:1px solid black;"> >10: <font color="red">0</font><br />3-10: <font color="red">100</font><br />0-3: <font color="red">0</font> </td>
 
<td style="border:1px solid black;"> 21 </td>
 
<td style="border:1px solid black;"> >10: <font color="red">0</font><br />3-10: <font color="red">34</font><br />0-3: <font color="green">76</font> </td>
 
<td style="border:1px solid black;"> About 1000 Kong cores are permanently out of service due to hardware failure </td>
 
 
</tr>
 
</tr>
 
<tr>
 
<tr>
 +
<td style="border:1px solid black;">4</td>
 
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"></td>
<td style="border:1px solid black;"> Cores </td>
 
<td style="border:1px solid black;"> 1,896 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 380 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 168 </td>
+
<td style="border:1px solid black;"> Provides maximum control for researchers and administrators </td>
 +
<td style="border:1px solid black;"> Expansion is limited by physical space, and power and HVAC requirements </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">5</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> [[ HPCBaseline ]] provides guaranteed capability for researchers without and with their own funding </td>
 +
<td style="border:1px solid black;"> Expansion of resources takes months, compared to typically less than 1 hour for cloud </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
 +
<td style="border:1px solid black;">6</td>
 
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"></td>
<td style="border:1px solid black;"> RAM, TB </td>
 
<td style="border:1px solid black;"> 10.5 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 3.6 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 4.7 </td>
+
<td style="border:1px solid black;"> Researchers who own hardware have continued use of that hardware when their funding runs out </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">7</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> For HPC use patterns, less expensive than commercial cloud </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
<td style="border:1px solid black;">CPU with GPU </td>
+
<td style="border:1px solid black;">8</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> Easily integrated into the NJIT AFS software environment </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">9</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> Can be expanded into the cloud as needed and funded </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">10</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> Colorado School of Mines (peer institution) [https://docs.google.com/document/d/1QW83O9n26o2mQmhxq5T6pDEzrh2mROLmu0HMPeqnBwE/edit?ts=5c6c2ed2 stance] </td>
 +
<td style="border:1px solid black;">  </td>
 +
<td style="border:1px solid black;"></td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">11</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> [https://wiki.hpc.arcs.njit.edu/external/surveys/campchamp/Jan2019.results.pdf Campus Champions On/off-premise HPC] </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> See: 1)Last bullet in "Section 1 responses"<br />2) Next to last bullet in "Section 2 responses"  </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
 +
<td style="border:1px solid black;">12</td>
 
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"></td>
<td style="border:1px solid black;"> Nodes </td>
+
<td style="border:1px solid black;"> Off-campus </td>
<td style="border:1px solid black;"> 2 </td>
+
<td style="border:1px solid black;"> >10: <font color="red">0</font><br />3-10: <font color="red">100</font><br />0-3: <font color="green">0</font> </td>
+
<td style="border:1px solid black;"> 2 </td>
+
<td style="border:1px solid black;"> >10: <font color="red">0</font><br />3-10: <font color="red">100</font><br />0-3: <font color="green">0</font> </td>
+
<td style="border:1px solid black;"> 8 </td>
+
<td style="border:1px solid black;"> >10: <font color="red">0</font><br />3-10: <font color="red">0</font><br />0-3: <font color="green">100</font> </td>
+
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
 +
<td style="border:1px solid black;">13</td>
 
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"></td>
<td style="border:1px solid black;"> GPU Cores </td>
 
<td style="border:1px solid black;"> 10,752 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 15,320 </td>
+
<td style="border:1px solid black;"> Co-location </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 64,512 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
 +
<td style="border:1px solid black;">14</td>
 
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"></td>
<td style="border:1px solid black;"> CPU Cores </td>
 
<td style="border:1px solid black;"> 40 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 44 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 10 </td>
+
<td style="border:1px solid black;"> Differences from "On-campus": Item 4, Diminished control for administrators; Item 8, Integration may be problematical </td>
 +
<td style="border:1px solid black;"> Differences from "On-campus" None </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">15</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> Commercial </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
 +
<td style="border:1px solid black;">16</td>
 
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"></td>
<td style="border:1px solid black;"> RAM, TB </td>
 
<td style="border:1px solid black;"> 0.25 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 0.26 </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 2.0 </td>
+
<td style="border:1px solid black;"> Differences from "On-campus": Item 4, Significantly diminished control for administrators; Item 8, Integration likely to be problematical </td>
 +
<td style="border:1px solid black;"> Differences from "On-campus" Items 3, 4, 5 do not apply </td>
 +
<td style="border:1px solid black;"> Costs are unlnown </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">17</td>
 +
<td style="border:1px solid black;">Hardware not owned<br />by institution </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
<td style="border:1px solid black;">Node interconnect </td>
+
<td style="border:1px solid black;">18</td>
 +
<td style="border:1px solid black;"></td>
 +
<td style="border:1px solid black;"> Free cloud [1] </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 13 of 10Gb/sec, 12 of 56 Gb/sec; rest 1Gb/sec </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 40 Gb/sec </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> 12 of 56Gb/sec; rest 10Gb/sec </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">19</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> "Evergreen" - new processors readily available </td>
 +
<td style="border:1px solid black;"> Successful proposal needed </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
<td style="border:1px solid black;">Parallel file system<br />(PFS) </td>
+
<td style="border:1px solid black;">20</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> None </td>
 
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> None </td>
+
<td style="border:1px solid black;"> Flexible configuration </td>
 +
<td style="border:1px solid black;">  Access ends when proposal period ends </td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> None </td>
+
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">21</td>
 +
<td style="border:1px solid black;"></td>
 
<td style="border:1px solid black;"> </td>
 
<td style="border:1px solid black;"> </td>
<td style="border:1px solid black;"> Traditionally used for temporary files, PFS's are now used for all kinds of storage. Researcher request s for storage are routinely 10 to 20TB, compared to 50 to 100GB two to three years ago </td>
+
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> Expandable, within propsal limits </td>
 +
<td style="border:1px solid black;"> Loss of common software environment compared to owned </td>
 +
<td style="border:1px solid black;"></td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">22</td>
 +
<td style="border:1px solid black;"></td>
 +
<td style="border:1px solid black;"> Commercial cloud [2] </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">23</td>
 +
<td style="border:1px solid black;"></td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> "Evergreen" - new processors readily available </td>
 +
<td style="border:1px solid black;"> Generally more expensive for HPC than owned hardware </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">24</td>
 +
<td style="border:1px solid black;"></td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> Flexible configuration </td>
 +
<td style="border:1px solid black;"> Loss of common software environment compared to owned </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 +
</tr>
 +
<tr>
 +
<td style="border:1px solid black;">25</td>
 +
<td style="border:1px solid black;"></td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"> Rapidly expandable </td>
 +
<td style="border:1px solid black;"> Access ends when funds run out </td>
 +
<td style="border:1px solid black;"> </td>
 +
<td style="border:1px solid black;"></td>
 
</tr>
 
</tr>
 
</table>
 
</table>
 
<p></p>
 
<p></p>
 
<strong>Legend</strong><br />
 
<strong>Legend</strong><br />
[1] :
+
[1] E.g., XSEDE
<ul>
+
<br />
<li>Applies also to Cores and RAM</li>
+
[2] E.g., AWS, Azure, GCP, Penguin
<li><font color="red">Red numbers</font> mean that the nodes are <strong>Out of Warranty</strong></li>
+
<li><font color="green">Green numbers</font> mean that the nodes are <strong>In Warranty</strong></li>
+
</ul>
+
 
+
[2] :
+
"DMS" refers to the Department of Mathematical Sciences. DMS owns the <em>Stheno</em> cluster.
+
 
+
[3] :
+
"Private Access" refers to cluster hardware purchased by individual researchers. That hardware is
+
dedicated to those researchers.
+
 
+
== High-speed node interconnect and parallel file system ==
+
[[ IBandPFS | Roles of internal Network and PFS ]]
+
 
+
== HPC cluster storage ==
+
[[ ClusterStorage | Storage accessible to HPC clusters ]]
+
 
+
 
----
 
----

Latest revision as of 16:33, 5 October 2020

HPC Location Matrix

This matrix lists the pros and cons of types of HPC hardware ownership and hardware locations.


HPC Location Matrix
Item Category Subcategory A Subcategory B Pros Cons Notes
1 Hardware owned
by institution
2 On-campus
3 Strongly preferred by NJIT researchers using HPC Hardware will not stay current without funded timely re-fresh
4 Provides maximum control for researchers and administrators Expansion is limited by physical space, and power and HVAC requirements
5 HPCBaseline provides guaranteed capability for researchers without and with their own funding Expansion of resources takes months, compared to typically less than 1 hour for cloud
6 Researchers who own hardware have continued use of that hardware when their funding runs out
7 For HPC use patterns, less expensive than commercial cloud
8 Easily integrated into the NJIT AFS software environment
9 Can be expanded into the cloud as needed and funded
10 Colorado School of Mines (peer institution) stance
11 Campus Champions On/off-premise HPC See: 1)Last bullet in "Section 1 responses"
2) Next to last bullet in "Section 2 responses"
12 Off-campus
13 Co-location
14 Differences from "On-campus": Item 4, Diminished control for administrators; Item 8, Integration may be problematical Differences from "On-campus" None
15 Commercial
16 Differences from "On-campus": Item 4, Significantly diminished control for administrators; Item 8, Integration likely to be problematical Differences from "On-campus" Items 3, 4, 5 do not apply Costs are unlnown
17 Hardware not owned
by institution
18 Free cloud [1]
19 "Evergreen" - new processors readily available Successful proposal needed
20 Flexible configuration Access ends when proposal period ends
21 Expandable, within propsal limits Loss of common software environment compared to owned
22 Commercial cloud [2]
23 "Evergreen" - new processors readily available Generally more expensive for HPC than owned hardware
24 Flexible configuration Loss of common software environment compared to owned
25 Rapidly expandable Access ends when funds run out

Legend
[1] E.g., XSEDE
[2] E.g., AWS, Azure, GCP, Penguin