| HPSC -- High Performance Statistical Computing for Data Intensive Research |
| All Websites: pbdR | HPSC | Phyloclustering | R_note | About me | |
|
About HPSC Overview Rscript Master/Worker SPMD Example Cookbook Reference NGSA pbdR |
Section: About HPSC
``Read in Distributed, Compute in Distributed, Statistics in Distributed,
Output in Distributed.''
This web page introduces a simple computing framework for "Big Data" called
single program multiple data (SPMD), and many
statistical methodology can be fairly easily redesigned in the same way.
We aim to introduce ideas in the sense of
STATISTICS,
and provide
Cookbook to illustrate the framework covering from
fundamental statistics to advance methodology.
Tentatively, the pages will cover basic ideas of parallel computing,
statistical computing, and R programming, and they will be illustrated
in a simple manner.
"Have a Big dream of Bigger than Big."
By default, all examples of this website are illustrated in the Unix/Linux
system with
For Mac users,
OpenMPI is suggested for
If you don't have many machines/processors,
the easier way you can test and learn is to install
VirtualBox
with Unix/Linux system. The VirtualBox allows to generate simultaneously
multiple virtual computers in most common systems.
You can duplicate the virtual machines/processors
inside VirtualBox as many as you want.
Therefore, a parallel
computing environment can be done in a single machine.
Regardless of computing performance,
it is helpful for testing programs and for building projects in
a consistent environment.
When mentioning the HPSC, please cite:
Need Help
Wei-Chen Chen and
George Ostrouchov.
Wei-Chen thanks Dr. George Ostrouchov of Oak Ridge National Laboratory for helpful discussion, and provide insightful suggestions and materials about general parallel computing. The contents are outcomes part of the project "Visual Data Exploration and Analysis of Ultra-large Climate Data" supported by U.S. DOE Office of Sience.
Wei-Chen also thanks
Dr. Hao Yu,
the author of
Rmpi,
for great discussion about Rmpi design and parallel programming in Rmpi.
Also, Wei-Chen thanks
Stephen Weston,
one author of
Parallel R Data Analysis in the Distributed World,
for sharing MPI and
This website is built on a machine located in
the
Department of Statistics
at Iowa State University
in Ames
Iowa,
USA.
|
|
| [ Go to top ] |
|
Created: Oct 19 2011 Last Revised: Feb 13 2013, 12:20 (CDT Ames, IA, USA) Maintained: Wei-Chen Chen E-Mail: wccsnow @ gmail.com |
|
Best Resolution Firefox 3.5 1024x768 small font |