Version 1.8.10
- Made condor the default dispatcher at BNL (LH)
- Reshaped configuration format around gatekeeper (LH)
- Added a class of policy that will dynamically configure its self from an information service (LH)
- Added proxy setup for small site config (LH)
- Starting running the grid setup.csh script by default for all grid jobs.(LH\JL)
- Added buffer to vors information service in case of outage.(LH)
- Changed recovery of OSG variables to check local environment first.(LH)
- Made all local jobs run the users .cshrc.(LH)
- Modification to file splitting algorithm to get closer to optimal job size.(LH)
- Started dropping non-MuDst file from jobs with xrootd fileListSyntax (LH)
- Bug fixed, input from catalog and fileslist at the same time could cause program to exit.(LH)
- Added xrootddev file syntax, reshaped file syntax handling by site in config file.(JL/LH/PJ)
- Small change to virtual resources for BNL LSF cluster (LH\JL).
- Changed the ordering of the INPUTFILE[n] variables in SUMS from a lexigraphic sorting order to a numeric sort.(LH\GV)
- Bug fix for SUMS putting one too many slashes in xrootd url if it comes from the filelist input.(LH)
- Bug fix for min and max memory getting flipped when a job is resubmitted.(LH)
- Added CSH variable to display the number of times a job was resubmitted.(LH)
- Modified queue testing to work with new configuration(LH)
Version 1.8.9
- Complete ground up redesign of catalog and dataset processing (LH)
- Allowed xrootd to take HPSS files (PJ/LH)
- Added opportunistic condor policy bnl_condor_rcf (LH)
- Reshaped the bnl_condor_xxx policys to reflect changes done by the RCF to the pools (LH)
- Printed out location of log file, so users can find it faster (LH)
- Removed -m option when submitting with bnl_lsf_prod (LH)
- Added -debug [level] option to start submit command (LH)
- Added more pie charts for catalog queries on the statistics page (LH)
- Reshaped local sandbox to mirror directories and link only files (LH)
- Added "-u ie" option to ignore most errors when trying to submit a job (LH)
- Allowed the use of xrootd type syntax of filelist (PJ/LH)
- The default file name $JOBID.out or $JOBID.err will be added to the output or error if the user only specifies the path without the file name (LH)
- Changed the Process ID environment variable $PROCESSID to $JOBINDEX so it is not confused with the process ID on the OS or the LSF process ID (LH)
- Added environment variables for submitting node, submitting path, submitting time(GMT) and grid cert (LH)
- Added SUMS version information to session file, so that user must use same version to resubmit (LH)
Version 1.8.8
- Changed logger for java sdk logger to log4J (LH)
- Added several new policies (LH)
- Added a queue checking mechanism (LH)
- Change passive policy to send rootd and Xrood jobs to the local queue (LH)
- The resubmit command now works with just the job ID (no need to type full file name) but must still be used in the same directory (LH)
Version 1.8.7
- U-JDL change (SandBox added) (JL)
- Implementation of zip / local / packman sandboxes (LH)
- Added allowance for both rhic.bnl.gov and rcf.bnl.gov to be used as the site identifier. (JL \ LH)
- Changed FILEBASENAME to blank when not initialized. (LH)
- Multiple bug fixs to stop file splitting algorithm from hanging.(LH)
- Catalog queries are collected as statistics information.(LH)
- Added a -u grid option to force grid dispatching even when the destination is local.(LH)
- The -h option needed to be fixed (JL)
- The number of retries for submitting a job on the RCAS cluster at BNL has been dropped to zero.(LH)
Version 1.8.6
- Updated MySQL drivers so they are compatible with MySQL version 4.1 (LH)
- A �site [domainName] switch is now required to lunch the scheduler jar (see star-submit script)(LH)
- Reshape of layout of config file, changed much logic, converted old config files ,and added new classes (site, GateKeeper, BatchSystem, ConfigToolkit)(LH)
- The elements\objects logConf, statisticsConf, programLocations, and defaultPolicy have been moved inside the site object, see set/get function.
- The $SCRATCH is now located in programLocations. The job references it via queue --> batchSystem --> site --> ProgramLocations --> $SCRATCH
- The Condor-g and Condor_gLSF dispatchers have there globusscheduler priority set indirectly (not with set and get functions) by parsing the site structure.
- Improved the way domainName name is acquired in star-submit script (JL)
- Fix div by zero error (LH)
- Added an xrootd fileListSyntax option. (PJ)
- Set the default startup directory to $SCRATCH, with all dispatchers.(LH)
- Unified CondorGDispatcher and CondorGLSFDispatcher into the CondorGRSLDispatcher. (LH)
- Changed default limitation of the number of INPUTFILE(n) environment variables to 200 and added warning to outputstream.(LH)
- From UNIX the path is now recovered using the �echo $PWD� command instead of the java.io.File.getAbsolutePath() method. (JL + LH)
- The filelist syntax and the INPUTFILE(n) environment variables syntax have been made to be identical all the time, and jobs that use the rootd syntax are no longer sent to a specific node.(LH)
- Fixed bug in syntax of memory requirement for LSF dispatcher.(LH)
Version 1.8.2
- Fixed bug : nFiles ignored when used with the file list in the input tag. (LH)
- Appended to the fromScratch syntax a copy, link and register, these are mostly intended for grid use. (LH)
- U-JDL Schema changes for accommodating fromScratch syntax expansion. (JL)
- Change the behavior of the SGE, LSF and PDS dispatchers so that if the queue name is null or of zero length no queue option is used in the submit command.(LH)
- Stripped directory out of executable, because of inconsistency of directory path on some nodes.
- Modified the statistics collection module to send the queue ID instead of the queue name.
Version 1.8.0
- Grid config file has been reshaped with policies (Step1Policy, Step2Policy, Step3Policy) to better reflect our 3 step process for running on the grid.(LH)
- Object names in the RCAS config have been changed for less ambiguity.(LH)
- Gave each dispatcher its own set and get functions from there CSHApplication objects.(LH)
- Bug fix for: fromScratch tag was ignored during resubmitting or jobs. (LH)
- Behavior from fromScratch modified, a �./� before the file name means look in `pwd` and not in $SCRATCH for the files. (LH)
- This is the first deployed version of SUMS to include the RDL frame work. (DA,PH,JL,LB)
- A new Resource Strategy class working off of Resource Strategies was added, plus a base interface for Resource Strategies. (LH)
Version 1.7.9
- Filelist now supports files on distributed disk(LH)
- Bug fix for using quotes (") in catalog Queries (LH) (This patch failed and had to be pulled from the code.)
- In PDSF configuration, SGE dispatcher has been set as default (LH)
Version 1.7.7
- More reshaping of threads and streams used for dispatching. Fixes above normal rate of multiply dispatching seen in version 1.7.6. (JL)
Version 1.7.6
- Moved chmod to config file instead of hardcoded (+ full path specified) (LH)
- Logic reshape for Runtime().exec(). Especially, waitFor()
fixed and hack removed (JL)
Version 1.7.5
- star-submit-template would now get document starting with comments
or blank lines (JL)
- Our usage and monitoring JSP's / web pages and db still need updating.
- Fixed globus-url-copy and undid hard coding of globus-url-copy and cp (This may be modified yet again.)(LH)
- Added maxWallTime to the PBS Dispatcher (LH)
- Added SubmitTime, DispatchTime, and SubmitSuccessful(y/n) to report file. The report file has new layout with a global view of the scheduler. (LH)
- Added a new subsystem for killing and resubmitting jobs.(LH)
- Modified star-submit to work with resubmission syntax(LH)
- Modified star-submit-template to accept multiple -entities. (JL)
Added -simulate and -debug (JL)
- Added SGE Dispatcher (LH)
- Added a max elapse time to the LSF and SGE Dispatchers (at PDSF the defualt was moved 5000ms -> 9000ms ) (LH)
Version 1.7.0
- WARNING: The statistic table has a few more columns. Please,
check res/ext/createStatDB for more information (JL)
- Extend report file information (LH)
- Modified JobID scheme for better unicity (LH/JL)
- Modified CSH wrapper for other experiment support (JL)
- Restructured source code for more efficient development (LH)
- Swapped schema validation class with a class based on the sun multi schema validator (LH)
- Extended schema / code with minMemory, maxMemory, minStorageSpace, maxStorageSpace (JL/LH)
- Added new Queue objects, for more detailed definition of queues (LH)
- Modified PassivePolicy to take advantage of new queue objects and extended schema (LH)
- Changed PassivePolicy queue assignment algorithm (LH)
- Modified CondorGLSFDispatcher to take advantage of new schema (LH)
- Modified LSFDispatcher to take advantage of new schema (LH)
- Added the install.pl script (JL)
- Added CondorGPBSDispatcher (AW/JL)
- Added version number and date of submits to the log file (LH)
Version 1.6.2
- Display submit error to STDOUT (user feedback)
- Added XML error checking that works off of an internal xml schema.
- Added environment variable FILEBASENAME
- Changes EnvVariableLimit (still unused for now)
- Changes to pdsfConfig.xml
- Developper issue. Fixes in Ant script now takes care of the memory
leak and an issue with installing script files from Windows to
Unix.
Version 1.6.1
- Extended with a new mode nameEqualValueColumnSeparated for
handling PDSF style resources.
- Added EnvVariableLimit (unused for now)
- Added message when file are not found (was confusing to users when a list
was passed)
- Change a few message typos, updated URL
Version 1.6
- First GRID implementation ready
- Review of policies and dispatchers
- Enables to submit multiple processes with no inputs (see nProcesses)
- BUGFIX: the variable substitution didn't work when the variable was at the
end of the filenames
Version 1.5.2
- Enables trim of all letters of disk vault names (fixes PDSF resources)
- Allows to specify a directory as an output
Version 1.5.1
- Enables trim of the decimal part of disk vault names (fixes PDSF resources)
Version 1.5.0_01
User visible changes:
- Fixed a bug with LSF resources that would give: String index out of range:
-1
Version 1.5.0
User visible changes:
- The scripts produced by the scheduler will use the full path for /bin/cp
and /bin/rm
- Condor submission has been fixed
- Filelists are enabled within the request, by using the filelist:/path/name
URL
- For the queries, preferStorage is set automatically to local for big
requests, and to NFS for small requests (<= 100 files)
- Added orderInput keyword to order the input files within a list
Developer visible changes:
- XML scheduler configuration based on java bean XML serialization
- Source added in the deployed jar
Version 1.3.2
- Refactoring code for the CSH application.
- CondorGLSFDispatcher added: uses CondorG to dispatch on LSF, with some
extra variable added to the globus lsf-jobmanager to handle extended options.
- CondorGDispatcher added: to be used on the RCF Condor pool.
- LSF Resources revised.
- Bugfix - Checks whether the output directory exists before dispatching.
Version 1.3
- filesPerHour tag added: the scheduler can now decide on which queue to dispatch.
It will try to dispatch on a short queue, if minFilesPerProcess allows it.
- fileListSyntax tag added: we call "paths" the syntax the scheduler has been
using; a new one "rootd" is available. This is recognized by the MuDST maker,
and allows some advanced features. Consul the manual for details.
- Bugfix - the wildcard resolution wouldn't work if ls was aliased to something
else.
Version 1.2
- It is possible to discard the stdout and the stderr, by using discard="true"
- Entities are now allowed in the command line.
- Writes a report containing all the nodes, the number of files assigned to
each location and the number of processes
- Added minFilesPerProcess: it's doesn't provide the perfect solution, but it
might be sufficient.
- Bugfix - statistics are recorded in chunks (no more OutOfMemoryError at the
end)
- Bugfix - simulated submission wasn't formatted properly
- ADMIN - more data is being reported through the statistics
- ADMIN - file catalog implementation is set through the configuration file
- ADMIN - different LSF resource strategies for RCF and PDSF
- ADMIN - jobs can be dispatched on different queues depending on whether they
are using local files
Version 1.1
- Retries bsub up to 5 times. If it doesn't succeed goes ahead to the next process
- "name" property in the job tag (mapped to LSF Job name "-J")
- Bugfix - Catalog queries with < or > were not possible
- ADMIN - Log level in properties
- ADIMN - Added LSF resource usage
- ADIMN - New bsub retry configuration properties
- DEV - Process can have a different command line than the job request
- DEV - CondorG dispatching (experimental)
- DEV - SiteForwardPolicy
Version 1.0 RC 2
User
- Message displayed when directory for stderr doesn't exist
- Fixed a bug that prevent to specify the stdin
Version 1.0 RC 1
User
- Reversed order for process submission: displays the biggest number first and
counts to 0
- Allows preferStorage: when multiple copies of the same files are found, you
can specify which copy you prefer depending on it's storage type; if more than
one files are found in the preferred storage, a random one is chosen. (Consult
the manual at "input" element "preferStorage" attribute)
- Better comments in the script: each script also contain the bsub command used
to execute it; makes it easier to resubmit in case of problems.
- Job output specification and scratch space: each process will have a local
scratch directory to work into ($SCRATCH) and in the XML file you can use the
<output> tag to specify which files to bring back after the process is finished.
(Consult the manual on the "output" element)
- Name scheme changed: sched$JOBID.csh for the script and sched$JOBID.list for
the file list.
Developer
- Revised build and development environment
- Better separation of star catalog specifics
- LSFDispather refactoring
- Old code eliminated
- Inilitializer, Policy and Dispatcher are set from the properties file
- Log directory set in the properties file
- $SCRATCH directory set in the properties file
Version 1.0 beta 9
- LSF queue name and bsub extra option can be set through scheduler.properties
- Timeout on the bsub command
Version 1.0 beta 8
- job not submitted if queries and wildcards return no input
- Wildcard added for files on AFS/NFS
- attribute simulation changed to simulateSubmission
Version 1.0 beta 7
- Dispatching failures are now reported
- Dots are displayed while bsub is being called
- Dots are displayed while the catalog query is being executed
Version 1.0 beta 6
- changed filename for script and fileList
- nFiles tag added
- singleCopy tag added
Version 1.0 beta 5
- logging revised
- added the tag maxFilesPerProcess
- changed the queue on which to submit
Version 1.0 beta 4
- checks whether input files and output directories exist
- enforced XML grammar
Version 1.0 beta 3
- mail attribute added
- output stream declaration is enforced
- variable substitution for the I/O stream file names
- added exception logging
- added simulated submission
Levente Hajdu - page was last modified