Caltech Center for Advanced Computing Research

Job Monitoring in an Interactive Grid Analysis Environment

Arshad Ali, Ashiq Anjum, Julian Bunn, Richard Cavanaugh, Frank van Lingen, Richard McClatchey, Harvey Newman, Wahas ur Rehman, Conrad Steenberg, Michael Thomas and Ian Willers (2004) Job Monitoring in an Interactive Grid Analysis Environment. In: Computing in High Energy Physics 2004, 2004, Interlaken, Switzerland. [CaltechCACR:2004.209]

Full text available as:

PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

The grid is emerging as a great computational resource but its dynamic behavior makes the Grid environment unpredictable. Systems and networks can fail, and the introduction of more users can result in resource starvation. Once a job has been submitted for execution on the grid, monitoring becomes essential for a user to see that the job is completed in an efficient way, and to detect any problems that occur while the job is running. In current environments once a user submits a job he loses direct control over the job and the system behaves like a batch system: the user submits the job and later gets a result back. The only information a user can obtain about a job is whether it is scheduled, running, cancelled or finished. Today users are becoming increasingly interested in such analysis grid environments in which they can check the progress of the job, obtain intermediate results, terminate the job based on the progress of job or intermediate results, steer the job to other nodes to achieve better performance and check the resources consumed by the job. In order to fulfill their requirements of interactivity a mechanism is needed that can provide the user with real time access to information about different attributes of a job. In this paper we present the design of a Job Monitoring Service, a web service that will provide interactive remote job monitoring by allowing users to access different attributes of a job once it has been submitted to the interactive Grid Analysis Environment.

EPrint Type:Conference or Workshop Item (Paper)
Subjects:All Records
ID Code:54
Deposited By:Sarah M. Emery
Deposited On:16 November 2004
Record Number:CaltechCACR:2004.209
Official Persistent URL:http://resolver.caltech.edu/CaltechCACR:2004.209
Usage Policy:You are granted permission for individual, educational, research and non-commercial reproduction, distribution, display and performance of this work in any format.

Archive Staff Only: edit this record