UCSD Logo UCSD Logo For Printing Skip navigation links

Navigation

UCSD Triton Resource @ SDSC

Quick Status

Triton Resource Node Status

Saturday, November 21st 2009 02:01:01 PM PST


TCC Rack 2 Nodes Down (1)

tcc-2-72.local

TCC Rack 3 Nodes Down (1)

tcc-3-71.local

Total TCC Nodes Up: 246

Total 256GB (PDAF) Nodes Up: 20

Total 512GB (PDAFM) Nodes Up: 8

Rack 2 Up Count: 79

Rack 3 Up Count: 77

Rack 4 Up Count: 11

Rack 5 Up Count: 79

Running Interactive Parallel Batch Jobs on Triton

Interactive batch is for developing and debugging parallel codes or interactive applications like IDL. This page discusses how to make use of this capability.

Parallel Interactive Batch Job Execution


Note: The Triton Resource is now available to users in full production mode. Configuration and testing of Triton is complete. The Triton Compute Cluster (TCC) and Petascale Data Analysis Facility (PDAF) are using TAPP accounts to charge users for compute time as of Monday, October 5, 2009.

Early Adopter accounts have been converted to trial accounts and provisioned with 1000 complimentary SUs.

TAPP, the Triton Affiliates and Partners Program, is the prescribed way to manage your access.

Triton staff maintain a Discussion List to which all Triton users are encouraged to subscribe. Members can post questions and comments to Triton Discussion List (triton-discuss@sdsc.edu) to obtain help and support for issues and community feedback.

  1. Commands
  2. In order to run interactive parallel batch jobs on Triton, use the command:

    qsub -I

    which will provide a login to the launch node as well as the PBS_NODEFILE file with all nodes assigned to the interactive job.

    Other qsub options can be used, such as those described by the
    man qsub command.

    As with any job, the interactive job will wait in the queue until the specified number of nodes become available. Requesting fewer nodes and shorter wall clock times may reduce the wait time because the job can more easily backfill among larger jobs.

    The showbf command gives information on available time slots:

    Partition     Tasks  Nodes      Duration   StartOffset       StartDate
    ---------     -----  -----  ------------  ------------  --------------
    ALL               8      8      INFINITY      00:00:00  13:45:30_04/03
    
    
    This command will provide an accurate prediction of when the submitted job will be allowed to run.

    The exit command will end the interactive job.

  3. Example
  4. To run an interactive job with a wall clock limit of 30 minutes, using two nodes and two processors per node:

    $ qsub -I -V -l walltime=00:30:00,nodes=2:ppn=2
    qsub: waiting for job 75.triton-42.sdsc.edu to start
    qsub: job 75.triton-42.sdsc.edu ready
    
    $ echo $PBS_NODEFILE
    /opt/torque/aux/75.triton-42.sdsc.edu
    
    $ more /opt/torque/aux/75.triton-42.sdsc.edu
    tcc-2-31
    tcc-2-31
    tcc-2-25
    tcc-2-25
    
    $ mpirun -machinefile /opt/torque/aux/75.triton-42.sdsc.edu -np 4 hostname
    tcc-2-25.local
    tcc-2-25.local
    tcc-2-31.local
    tcc-2-31.local
    

On This Page

Contact Us

Open a Ticket with Triton Resource Support using the Support Ticket Form.

Join the Discussion Forum Sign up for our Email Discussion List.

Follow Triton on Twitter

FAQ Read the FAQ Page.

Terms of Use | Privacy