Wirawan Purwanto
18d79dd34b
* On custom sacct1.sh, include job (most recent) state as well.
4 years ago
Wirawan Purwanto
d384d0320d
* Added explicitly headless option (--headless).
...
* Increased wait time to 5 minutes.
5 years ago
Wirawan Purwanto
c2a5ae8863
* slurm: Modified fields to print by default.
5 years ago
Wirawan Purwanto
32b82db7a3
* slurm: Added custom sacct wrapper script which contains my preferences.
5 years ago
Wirawan Purwanto
db2ca075ed
* Must re-add SLURM if it is not loaded.
6 years ago
Wirawan Purwanto
a65338a8bf
* Added "wgo" (what's going on) to check the status of processes
...
on login node. Currently supposed to be used for Turing only.
6 years ago
Wirawan Purwanto
1387997010
* Prints an error message and quit in case Jupyter
...
did not start after 2 minutes.
6 years ago
Wirawan Purwanto
ba6c9f53ed
* Added accommodation for Anaconda as well as site-provided python.
...
* Added support for fully headless mode (connect via local browser).
* Added some safeguards against job failing to start due to executable
not found, etc.
6 years ago
Wirawan Purwanto
1c8a5da492
* Imported launch_jupyter from Turing.
...
Originally furnished by Min Dong, 2019-03-01 14:30 EST.
6 years ago
Wirawan Purwanto
28eb7a0d98
* sq: Customizable squeue wrapper: introduce new defaults / default
...
behavior on squeue.
6 years ago
Wirawan Purwanto
94e0aa9490
* jupyter-anaconda2: A script that will start Jupyter notebook process
...
for Anaconda2 distribution.
Note: for now it is a script that has to be submitted in the compute
node. I will upgrade this to become a self-submitting script eventually.
6 years ago
Wirawan Purwanto
95034685ff
* interact: Tool to allocate an interactive session on a regular
...
compute node under SLURM.
7 years ago
Wirawan Purwanto
dbce662c5a
* interact-gpu: Tool to allocate an interactive session on a GPU
...
compute node under SLURM.
7 years ago
Wirawan Purwanto
82ea3bc689
* Archived: SGE version of pwscf-5.3 script.
8 years ago
Wirawan Purwanto
9c82c4d465
* Update bash module support: with recent changes on Turing,
...
'module' seems to be supported out of the box for bash.
If 'module' environment is detected, we skip the initiation step.
8 years ago
Wirawan Purwanto
68c0e70d4d
* Added "regular" runsas which runs with more limited memory.
8 years ago
Wirawan Purwanto
27b8ccd6ae
* Added runsas-himem from earlier consultation this year.
8 years ago
Wirawan Purwanto
df6facce86
* pwscf: Ad-hoc fix for Turing after 2016 upgrade.
...
We force using the old (TCL) module system since the new module
system (LMOD) always executes itself whenever a bash batch script
is executed on Turing right now.
8 years ago
Wirawan Purwanto
739d765f53
* Added convenience for gathering & analyzing CPUs on the cluster.
...
* Documentation update.
8 years ago
Wirawan Purwanto
aa597b907c
* In hoststats subcommand: Also print node status flags if they exist.
8 years ago
Wirawan Purwanto
999fe5f571
* Add more info to gather.
8 years ago
Wirawan Purwanto
cabacb58cb
* Also added dump for mount points and disk free for compute nodes.
8 years ago
Wirawan Purwanto
b6d22cf68b
* Added "hoststats" subcommand for summarizing host occupancy statistics
...
irrespective of queue.
8 years ago
Wirawan Purwanto
ebdc93e80f
* Also collect dmesg snapshot.
8 years ago
Wirawan Purwanto
879927f16e
* Added python workbench hpl_timing, for estimating/analyzing HPL timing.
8 years ago
Wirawan Purwanto
d6d71364de
* Imported initial tools for extracting HPL benchmark results.
8 years ago
Wirawan Purwanto
483c6874c0
* Use getopt to handle command-line option.
...
* Include a help command.
8 years ago
Wirawan Purwanto
67bc899f4a
* Fixes for unhandled/unrecognized command options.
...
* Documentation update.
* Added help command.
8 years ago
Wirawan Purwanto
850bd34377
* Documentation update.
8 years ago
Wirawan Purwanto
e382a5eb35
* Minor fix to strip domain name (can be truncated).
8 years ago
Wirawan Purwanto
f06803ba6c
* show-node-status.py: A toolbox to analyze node status returned by SGE.
8 years ago
Wirawan Purwanto
acfb11e010
* Initial form of documentation.
8 years ago
Wirawan Purwanto
04515dcd35
* Allow external qstat-f file for raw node status dump.
8 years ago
Wirawan Purwanto
7957b28a05
* show-node-status.py: Initial tool to replace node-slot-status.sh.
...
This initial edition contains only "--raw" command.
8 years ago
Wirawan Purwanto
34a7659f3d
* show-cluster-usage.py: A tool to summarize the usage of an
...
SGE cluster at a given snapshot in time.
At present the usage is broken down by the user;
other categories can be added in the future.
8 years ago
Wirawan Purwanto
4f28615bf0
* Added variants of node status to display (still work in progress).
8 years ago
Wirawan Purwanto
a0ad7c25bc
* Added analysis tool to summarize CPUs or group compute nodes based
...
on their CPUs.
8 years ago
Wirawan Purwanto
52619c3688
* Added tools to dump compute node info in batch.
8 years ago
Wirawan Purwanto
f1327c9562
* SGE: Added qconf dump tools.
...
Added from my hpc-explore/sge tools from late 2015 time frame.
8 years ago
Wirawan Purwanto
79e5b77df2
* bash-module-env.sh: Update was required due to incomplete pre-existing
...
MODULEPATH in some of Turing's compute nodes.
8 years ago
Wirawan Purwanto
6f0880c547
* bash-module-env.sh: an effort to facilitate bash support for batch
...
scripts.
Last modified date of this script: 2016-06-23.
8 years ago
Wirawan Purwanto
bf43a3b0b5
* sge-dump-job-status.sh: Initial version of a tool to dump desirable
...
SGE info from a running job.
8 years ago
Wirawan Purwanto
8b99995409
* Added find-run-hosts.sh: swiss-army tool to find hosts where a job
...
run, dump the process trees, etc.
8 years ago
Wirawan Purwanto
8ae0841ca6
* pwscf: Initial version of self-submitting script to launch pwscf
...
calculation (version 5.3).
8 years ago
Wirawan Purwanto
7f83f897c8
* sge: Added node-slot-status.sh to aggregate slot availability per node type.
9 years ago