adding number of open runs to monitoring
[u/mrichter/AliRoot.git] / SHUTTLE / AliShuttle.cxx
CommitLineData
73abe331 1/**************************************************************************
2 * Copyright(c) 1998-1999, ALICE Experiment at CERN, All rights reserved. *
3 * *
4 * Author: The ALICE Off-line Project. *
5 * Contributors are mentioned in the code where appropriate. *
6 * *
7 * Permission to use, copy, modify and distribute this software and its *
8 * documentation strictly for non-commercial purposes is hereby granted *
9 * without fee, provided that the above copyright notice appears in all *
10 * copies and that both the copyright notice and this permission notice *
11 * appear in the supporting documentation. The authors make no claims *
12 * about the suitability of this software for any purpose. It is *
13 * provided "as is" without express or implied warranty. *
14 **************************************************************************/
15
16/*
17$Log$
fbc112e3 18Revision 1.74 2007/12/17 03:23:32 jgrosseo
19several bugfixes
20added "empty preprocessor" as placeholder for Acorde in FDR
21
4508ab8b 22Revision 1.73 2007/12/14 19:31:36 acolla
23Sending email to DCS experts is temporarily commented
24
b5f1c82e 25Revision 1.72 2007/12/13 15:44:28 acolla
26Run type added in mail sent to detector expert (eases understanding)
27
6a1146c4 28Revision 1.71 2007/12/12 14:56:14 jgrosseo
29sending shuttle_ignore to ML also in case of 0 events
30
97e3c167 31Revision 1.70 2007/12/12 13:45:35 acolla
32Monalisa started in Collect() function. Alive message to monitor is sent at each Collect and every minute during preprocessor processing.
33
ee6f7523 34Revision 1.69 2007/12/12 10:06:29 acolla
35in AliShuttle.cxx: SHUTTLE logbook is updated in case of invalid run times:
36
37time_start==0 && time_end==0
38
39logbook is NOT updated if time_start != 0 && time_end == 0, because it may mean that the run is still ongoing.
40
1abfbb60 41Revision 1.68 2007/12/11 10:15:17 acolla
42Added marking SHUTTLE=DONE for invalid runs
43(invalid start time or end time) and runs with totalEvents < 1
44
b0e53b15 45Revision 1.67 2007/12/07 19:14:36 acolla
46in AliShuttleTrigger:
47
48Added automatic collection of new runs on a regular time basis (settable from the configuration)
49
50in AliShuttleConfig: new members
51
52- triggerWait: time to wait for DIM trigger (s) before starting automatic collection of new runs
53- mode: run mode (test, prod) -> used to build log folder (logs or logs_PROD)
54
55in AliShuttle:
56
57- logs now stored in logs/#RUN/DET_#RUN.log
58
7d4cf768 59Revision 1.66 2007/12/05 10:45:19 jgrosseo
60changed order of arguments to TMonaLisaWriter
61
468df1ce 62Revision 1.65 2007/11/26 16:58:37 acolla
63Monalisa configuration added: host and table name
64
b832ec02 65Revision 1.64 2007/11/13 16:15:47 acolla
66DCS map is stored in a file in the temp folder where the detector is processed.
67If the preprocessor fails, the temp folder is not removed. This will help the debugging of the problem.
68
b1d18693 69Revision 1.63 2007/11/02 10:53:16 acolla
70Protection added to AliShuttle::CopyFileLocally
71
7d43a416 72Revision 1.62 2007/10/31 18:23:13 acolla
73Furter developement on the Shuttle:
74
75- Shuttle now connects to the Grid as alidaq. The OCDB and Reference folders
76are now built from /alice/data, e.g.:
77/alice/data/2007/LHC07a/OCDB
78
79the year and LHC period are taken from the Shuttle.
80Raw metadata files are stored by GRP to:
81/alice/data/2007/LHC07a/<runNb>/Raw/RunMetadata.root
82
83- Shuttle sends a mail to DCS experts each time DP retrieval fails.
84
675f64cd 85Revision 1.61 2007/10/30 20:33:51 acolla
86Improved managing of temporary folders, which weren't correctly handled.
87Resolved bug introduced in StoreReferenceFile, which caused SPD preprocessor fail.
88
d524ade6 89Revision 1.60 2007/10/29 18:06:16 acolla
90
91New function StoreRunMetadataFile added to preprocessor and Shuttle interface
92This function can be used by GRP only. It stores raw data tags merged file to the
93raw data folder (e.g. /alice/data/2008/LHC08a/000099999/Raw).
94
95KNOWN ISSUES:
96
971. Shuttle cannot write to /alice/data/ because it belongs to alidaq. Tag file is stored in /alice/simulation/... for the time being.
982. Due to a bug in TAlien::Mkdir, the creation of a folder in recursive mode (-p option) does not work. The problem
99has been corrected in the root package on the Shuttle machine.
100
c88ad5db 101Revision 1.59 2007/10/05 12:40:55 acolla
102
103Result error code added to AliDCSClient data members (it was "lost" with the new implementation of TMap* GetAliasValues and GetDPValues).
104
1790d4b7 105Revision 1.58 2007/09/28 15:27:40 acolla
106
107AliDCSClient "multiSplit" option added in the DCS configuration
108in AliDCSMessage: variable MAX_BODY_SIZE set to 500000
109
542b6cc8 110Revision 1.57 2007/09/27 16:53:13 acolla
111Detectors can have more than one AMANDA server. SHUTTLE queries the servers sequentially,
112merges the dcs aliases/DPs in one TMap and sends it to the preprocessor.
113
1d172743 114Revision 1.56 2007/09/14 16:46:14 jgrosseo
1151) Connect and Close are called before and after each query, so one can
116keep the same AliDCSClient object.
1172) The splitting of a query is moved to GetDPValues/GetAliasValues.
1183) Splitting interval can be specified in constructor
119
b41b252a 120Revision 1.55 2007/08/06 12:26:40 acolla
121Function Bool_t GetHLTStatus added to preprocessor. It returns the status of HLT
122read from the run logbook.
123
4859271b 124Revision 1.54 2007/07/12 09:51:25 jgrosseo
125removed duplicated log message in GetFile
126
4f0749a8 127Revision 1.53 2007/07/12 09:26:28 jgrosseo
128updating hlt fxs base path
129
42fde080 130Revision 1.52 2007/07/12 08:06:45 jgrosseo
131adding log messages in getfile... functions
132adding not implemented copy constructor in alishuttleconfigholder
133
1bcd28db 134Revision 1.51 2007/07/03 17:24:52 acolla
135root moved to v5-16-00. TFileMerger->Cp moved to TFile::Cp.
136
a986b218 137Revision 1.50 2007/07/02 17:19:32 acolla
138preprocessor is run in a temp directory that is removed when process is finished.
139
5bac2bde 140Revision 1.49 2007/06/29 10:45:06 acolla
141Number of columns in MySql Shuttle logbook increased by one (HLT added)
142
db99d43e 143Revision 1.48 2007/06/21 13:06:19 acolla
144GetFileSources returns dummy list with 1 source if system=DCS (better than
145returning error as it was)
146
6297b37d 147Revision 1.47 2007/06/19 17:28:56 acolla
148HLT updated; missing map bug removed.
149
dc25836b 150Revision 1.46 2007/06/09 13:01:09 jgrosseo
151Switching to retrieval of several DCS DPs at a time (multiDPrequest)
152
a038aa70 153Revision 1.45 2007/05/30 06:35:20 jgrosseo
154Adding functionality to the Shuttle/TestShuttle:
155o) Function to retrieve list of sources from a given system (GetFileSources with id=0)
156o) Function to retrieve list of IDs for a given source (GetFileIDs)
157These functions are needed for dealing with the tag files that are saved for the GRP preprocessor
158Example code has been added to the TestProcessor in TestShuttle
159
4a33bdd9 160Revision 1.44 2007/05/11 16:09:32 acolla
161Reference files for ITS, MUON and PHOS are now stored in OfflineDetName/OnlineDetName/run_...
162example: ITS/SPD/100_filename.root
163
2d9019b4 164Revision 1.43 2007/05/10 09:59:51 acolla
165Various bug fixes in StoreRefFilesToGrid; Cleaning of reference storage before processing detector (CleanReferenceStorage)
166
546242fb 167Revision 1.42 2007/05/03 08:01:39 jgrosseo
168typo in last commit :-(
169
8b739301 170Revision 1.41 2007/05/03 08:00:48 jgrosseo
171fixing log message when pp want to skip dcs value retrieval
172
651fdaab 173Revision 1.40 2007/04/27 07:06:48 jgrosseo
174GetFileSources returns empty list in case of no files, but successful query
175No mails sent in testmode
176
86aa42c3 177Revision 1.39 2007/04/17 12:43:57 acolla
178Correction in StoreOCDB; change of text in mail to detector expert
179
26758fce 180Revision 1.38 2007/04/12 08:26:18 jgrosseo
181updated comment
182
3c2a21c8 183Revision 1.37 2007/04/10 16:53:14 jgrosseo
184redirecting sub detector stdout, stderr to sub detector log file
185
3d8bc902 186Revision 1.35 2007/04/04 16:26:38 acolla
1871. Re-organization of function calls in TestPreprocessor to make it more meaningful.
1882. Added missing dependency in test preprocessors.
1893. in AliShuttle.cxx: processing time and memory consumption info on a single line.
190
886d60e6 191Revision 1.34 2007/04/04 10:33:36 jgrosseo
1921) Storing of files to the Grid is now done _after_ your preprocessors succeeded. This is transparent, which means that you can still use the same functions (Store, StoreReferenceData) to store files to the Grid. However, the Shuttle first stores them locally and transfers them after the preprocessor finished. The return code of these two functions has changed from UInt_t to Bool_t which gives you the success of the storing.
193In case of an error with the Grid, the Shuttle will retry the storing later, the preprocessor does not need to be run again.
194
1952) The meaning of the return code of the preprocessor has changed. 0 is now success and any other value means failure. This value is stored in the log and you can use it to keep details about the error condition.
196
1973) New function StoreReferenceFile to _directly_ store a file (without opening it) to the reference storage.
198
1994) The memory usage of the preprocessor is monitored. If it exceeds 2 GB it is terminated.
200
2015) New function AliPreprocessor::ProcessDCS(). If you do not need to have DCS data in all cases, you can skip the processing by implemting this function and returning kFALSE under certain conditions. E.g. if there is a certain run type.
202If you always need DCS data (like before), you do not need to implement it.
203
2046) The run type has been added to the monitoring page
205
9827400b 206Revision 1.33 2007/04/03 13:56:01 acolla
207Grid Storage at the end of preprocessing. Added virtual method to disable DCS query according to the
208run type.
209
3301427a 210Revision 1.32 2007/02/28 10:41:56 acolla
211Run type field added in SHUTTLE framework. Run type is read from "run type" logbook and retrieved by
212AliPreprocessor::GetRunType() function.
213Added some ldap definition files.
214
d386d623 215Revision 1.30 2007/02/13 11:23:21 acolla
216Moved getters and setters of Shuttle's main OCDB/Reference, local
217OCDB/Reference, temp and log folders to AliShuttleInterface
218
9d733021 219Revision 1.27 2007/01/30 17:52:42 jgrosseo
220adding monalisa monitoring
221
e7f62f16 222Revision 1.26 2007/01/23 19:20:03 acolla
223Removed old ldif files, added TOF, MCH ldif files. Added some options in
224AliShuttleConfig::Print. Added in Ali Shuttle: SetShuttleTempDir and
225SetShuttleLogDir
226
36c99a6a 227Revision 1.25 2007/01/15 19:13:52 acolla
228Moved some AliInfo to AliDebug in SendMail function
229
fc5a4708 230Revision 1.21 2006/12/07 08:51:26 jgrosseo
231update (alberto):
232table, db names in ldap configuration
233added GRP preprocessor
234DCS data can also be retrieved by data point
235
2c15234c 236Revision 1.20 2006/11/16 16:16:48 jgrosseo
237introducing strict run ordering flag
238removed giving preprocessor name to preprocessor, they have to know their name themselves ;-)
239
be48e3ea 240Revision 1.19 2006/11/06 14:23:04 jgrosseo
241major update (Alberto)
242o) reading of run parameters from the logbook
243o) online offline naming conversion
244o) standalone DCSclient package
245
eba76848 246Revision 1.18 2006/10/20 15:22:59 jgrosseo
247o) Adding time out to the execution of the preprocessors: The Shuttle forks and the parent process monitors the child
248o) Merging Collect, CollectAll, CollectNew function
249o) Removing implementation of empty copy constructors (declaration still there!)
250
cb343cfd 251Revision 1.17 2006/10/05 16:20:55 jgrosseo
252adapting to new CDB classes
253
6ec0e06c 254Revision 1.16 2006/10/05 15:46:26 jgrosseo
255applying to the new interface
256
481441a2 257Revision 1.15 2006/10/02 16:38:39 jgrosseo
258update (alberto):
259fixed memory leaks
260storing of objects that failed to be stored to the grid before
261interfacing of shuttle status table in daq system
262
2bb7b766 263Revision 1.14 2006/08/29 09:16:05 jgrosseo
264small update
265
85a80aa9 266Revision 1.13 2006/08/15 10:50:00 jgrosseo
267effc++ corrections (alberto)
268
4f0ab988 269Revision 1.12 2006/08/08 14:19:29 jgrosseo
270Update to shuttle classes (Alberto)
271
272- Possibility to set the full object's path in the Preprocessor's and
273Shuttle's Store functions
274- Possibility to extend the object's run validity in the same classes
275("startValidity" and "validityInfinite" parameters)
276- Implementation of the StoreReferenceData function to store reference
277data in a dedicated CDB storage.
278
84090f85 279Revision 1.11 2006/07/21 07:37:20 jgrosseo
280last run is stored after each run
281
7bfb2090 282Revision 1.10 2006/07/20 09:54:40 jgrosseo
283introducing status management: The processing per subdetector is divided into several steps,
284after each step the status is stored on disk. If the system crashes in any of the steps the Shuttle
285can keep track of the number of failures and skips further processing after a certain threshold is
286exceeded. These thresholds can be configured in LDAP.
287
5164a766 288Revision 1.9 2006/07/19 10:09:55 jgrosseo
289new configuration, accesst to DAQ FES (Alberto)
290
57f50b3c 291Revision 1.8 2006/07/11 12:44:36 jgrosseo
292adding parameters for extended validity range of data produced by preprocessor
293
17111222 294Revision 1.7 2006/07/10 14:37:09 jgrosseo
295small fix + todo comment
296
e090413b 297Revision 1.6 2006/07/10 13:01:41 jgrosseo
298enhanced storing of last sucessfully processed run (alberto)
299
a7160fe9 300Revision 1.5 2006/07/04 14:59:57 jgrosseo
301revision of AliDCSValue: Removed wrapper classes, reduced storage size per value by factor 2
302
45a493ce 303Revision 1.4 2006/06/12 09:11:16 jgrosseo
304coding conventions (Alberto)
305
58bc3020 306Revision 1.3 2006/06/06 14:26:40 jgrosseo
307o) removed files that were moved to STEER
308o) shuttle updated to follow the new interface (Alberto)
309
b948db8d 310Revision 1.2 2006/03/07 07:52:34 hristov
311New version (B.Yordanov)
312
d477ad88 313Revision 1.6 2005/11/19 17:19:14 byordano
314RetrieveDATEEntries and RetrieveConditionsData added
315
316Revision 1.5 2005/11/19 11:09:27 byordano
317AliShuttle declaration added
318
319Revision 1.4 2005/11/17 17:47:34 byordano
320TList changed to TObjArray
321
322Revision 1.3 2005/11/17 14:43:23 byordano
323import to local CVS
324
325Revision 1.1.1.1 2005/10/28 07:33:58 hristov
326Initial import as subdirectory in AliRoot
327
73abe331 328Revision 1.2 2005/09/13 08:41:15 byordano
329default startTime endTime added
330
331Revision 1.4 2005/08/30 09:13:02 byordano
332some docs added
333
334Revision 1.3 2005/08/29 21:15:47 byordano
335some docs added
336
337*/
338
339//
340// This class is the main manager for AliShuttle.
341// It organizes the data retrieval from DCS and call the
b948db8d 342// interface methods of AliPreprocessor.
73abe331 343// For every detector in AliShuttleConfgi (see AliShuttleConfig),
344// data for its set of aliases is retrieved. If there is registered
b948db8d 345// AliPreprocessor for this detector then it will be used
346// accroding to the schema (see AliPreprocessor).
347// If there isn't registered AliPreprocessor than the retrieved
73abe331 348// data is stored automatically to the undelying AliCDBStorage.
349// For detSpec is used the alias name.
350//
351
352#include "AliShuttle.h"
353
354#include "AliCDBManager.h"
355#include "AliCDBStorage.h"
356#include "AliCDBId.h"
84090f85 357#include "AliCDBRunRange.h"
358#include "AliCDBPath.h"
5164a766 359#include "AliCDBEntry.h"
73abe331 360#include "AliShuttleConfig.h"
eba76848 361#include "DCSClient/AliDCSClient.h"
73abe331 362#include "AliLog.h"
b948db8d 363#include "AliPreprocessor.h"
5164a766 364#include "AliShuttleStatus.h"
2bb7b766 365#include "AliShuttleLogbookEntry.h"
73abe331 366
57f50b3c 367#include <TSystem.h>
58bc3020 368#include <TObject.h>
b948db8d 369#include <TString.h>
57f50b3c 370#include <TTimeStamp.h>
73abe331 371#include <TObjString.h>
57f50b3c 372#include <TSQLServer.h>
373#include <TSQLResult.h>
374#include <TSQLRow.h>
cb343cfd 375#include <TMutex.h>
9827400b 376#include <TSystemDirectory.h>
377#include <TSystemFile.h>
a986b218 378#include <TFile.h>
9827400b 379#include <TGrid.h>
380#include <TGridResult.h>
73abe331 381
e7f62f16 382#include <TMonaLisaWriter.h>
383
5164a766 384#include <fstream>
385
cb343cfd 386#include <sys/types.h>
387#include <sys/wait.h>
388
73abe331 389ClassImp(AliShuttle)
390
b948db8d 391//______________________________________________________________________________________________
392AliShuttle::AliShuttle(const AliShuttleConfig* config,
393 UInt_t timeout, Int_t retries):
4f0ab988 394fConfig(config),
395fTimeout(timeout), fRetries(retries),
396fPreprocessorMap(),
2bb7b766 397fLogbookEntry(0),
eba76848 398fCurrentDetector(),
85a80aa9 399fStatusEntry(0),
cb343cfd 400fMonitoringMutex(0),
eba76848 401fLastActionTime(0),
e7f62f16 402fLastAction(),
9827400b 403fMonaLisa(0),
404fTestMode(kNone),
ffa29e93 405fReadTestMode(kFALSE),
406fOutputRedirected(kFALSE)
73abe331 407{
408 //
409 // config: AliShuttleConfig used
73abe331 410 // timeout: timeout used for AliDCSClient connection
411 // retries: the number of retries in case of connection error.
412 //
413
57f50b3c 414 if (!fConfig->IsValid()) AliFatal("********** !!!!! Invalid configuration !!!!! **********");
be48e3ea 415 for(int iSys=0;iSys<4;iSys++) {
57f50b3c 416 fServer[iSys]=0;
be48e3ea 417 if (iSys < 3)
2c15234c 418 fFXSlist[iSys].SetOwner(kTRUE);
57f50b3c 419 }
2bb7b766 420 fPreprocessorMap.SetOwner(kTRUE);
be48e3ea 421
422 for (UInt_t iDet=0; iDet<NDetectors(); iDet++)
423 fFirstUnprocessed[iDet] = kFALSE;
424
cb343cfd 425 fMonitoringMutex = new TMutex();
58bc3020 426}
427
b948db8d 428//______________________________________________________________________________________________
57f50b3c 429AliShuttle::~AliShuttle()
58bc3020 430{
9827400b 431 //
432 // destructor
433 //
58bc3020 434
b948db8d 435 fPreprocessorMap.DeleteAll();
be48e3ea 436 for(int iSys=0;iSys<4;iSys++)
57f50b3c 437 if(fServer[iSys]) {
438 fServer[iSys]->Close();
439 delete fServer[iSys];
eba76848 440 fServer[iSys] = 0;
57f50b3c 441 }
2bb7b766 442
443 if (fStatusEntry){
444 delete fStatusEntry;
445 fStatusEntry = 0;
446 }
cb343cfd 447
448 if (fMonitoringMutex)
449 {
450 delete fMonitoringMutex;
451 fMonitoringMutex = 0;
452 }
73abe331 453}
454
b948db8d 455//______________________________________________________________________________________________
57f50b3c 456void AliShuttle::RegisterPreprocessor(AliPreprocessor* preprocessor)
58bc3020 457{
73abe331 458 //
b948db8d 459 // Registers new AliPreprocessor.
73abe331 460 // It uses GetName() for indentificator of the pre processor.
461 // The pre processor is registered it there isn't any other
462 // with the same identificator (GetName()).
463 //
464
eba76848 465 const char* detName = preprocessor->GetName();
466 if(GetDetPos(detName) < 0)
467 AliFatal(Form("********** !!!!! Invalid detector name: %s !!!!! **********", detName));
468
469 if (fPreprocessorMap.GetValue(detName)) {
470 AliWarning(Form("AliPreprocessor %s is already registered!", detName));
73abe331 471 return;
472 }
473
eba76848 474 fPreprocessorMap.Add(new TObjString(detName), preprocessor);
73abe331 475}
b948db8d 476//______________________________________________________________________________________________
3301427a 477Bool_t AliShuttle::Store(const AliCDBPath& path, TObject* object,
84090f85 478 AliCDBMetaData* metaData, Int_t validityStart, Bool_t validityInfinite)
73abe331 479{
9827400b 480 // Stores a CDB object in the storage for offline reconstruction. Objects that are not needed for
481 // offline reconstruction, but should be stored anyway (e.g. for debugging) should NOT be stored
482 // using this function. Use StoreReferenceData instead!
483 // It calls StoreLocally function which temporarily stores the data locally; when the preprocessor
484 // finishes the data are transferred to the main storage (Grid).
b948db8d 485
3301427a 486 return StoreLocally(fgkLocalCDB, path, object, metaData, validityStart, validityInfinite);
84090f85 487}
488
489//______________________________________________________________________________________________
3301427a 490Bool_t AliShuttle::StoreReferenceData(const AliCDBPath& path, TObject* object, AliCDBMetaData* metaData)
84090f85 491{
9827400b 492 // Stores a CDB object in the storage for reference data. This objects will not be available during
493 // offline reconstrunction. Use this function for reference data only!
494 // It calls StoreLocally function which temporarily stores the data locally; when the preprocessor
495 // finishes the data are transferred to the main storage (Grid).
85a80aa9 496
3301427a 497 return StoreLocally(fgkLocalRefStorage, path, object, metaData);
85a80aa9 498}
499
500//______________________________________________________________________________________________
3301427a 501Bool_t AliShuttle::StoreLocally(const TString& localUri,
85a80aa9 502 const AliCDBPath& path, TObject* object, AliCDBMetaData* metaData,
503 Int_t validityStart, Bool_t validityInfinite)
504{
9827400b 505 // Store object temporarily in local storage. Parameters are passed by Store and StoreReferenceData functions.
506 // when the preprocessor finishes the data are transferred to the main storage (Grid).
507 // The parameters are:
508 // 1) Uri of the backup storage (Local)
509 // 2) the object's path.
510 // 3) the object to be stored
511 // 4) the metaData to be associated with the object
512 // 5) the validity start run number w.r.t. the current run,
513 // if the data is valid only for this run leave the default 0
514 // 6) specifies if the calibration data is valid for infinity (this means until updated),
515 // typical for calibration runs, the default is kFALSE
516 //
517 // returns 0 if fail, 1 otherwise
84090f85 518
9827400b 519 if (fTestMode & kErrorStorage)
520 {
521 Log(fCurrentDetector, "StoreLocally - In TESTMODE - Simulating error while storing locally");
522 return kFALSE;
523 }
524
3301427a 525 const char* cdbType = (localUri == fgkLocalCDB) ? "CDB" : "Reference";
2bb7b766 526
85a80aa9 527 Int_t firstRun = GetCurrentRun() - validityStart;
84090f85 528 if(firstRun < 0) {
9827400b 529 AliWarning("First valid run happens to be less than 0! Setting it to 0.");
84090f85 530 firstRun=0;
531 }
532
533 Int_t lastRun = -1;
534 if(validityInfinite) {
535 lastRun = AliCDBRunRange::Infinity();
536 } else {
537 lastRun = GetCurrentRun();
538 }
539
3301427a 540 // Version is set to current run, it will be used later to transfer data to Grid
541 AliCDBId id(path, firstRun, lastRun, GetCurrentRun(), -1);
2bb7b766 542
543 if(! dynamic_cast<TObjString*> (metaData->GetProperty("RunUsed(TObjString)"))){
544 TObjString runUsed = Form("%d", GetCurrentRun());
9e080f92 545 metaData->SetProperty("RunUsed(TObjString)", runUsed.Clone());
2bb7b766 546 }
84090f85 547
3301427a 548 Bool_t result = kFALSE;
84090f85 549
3301427a 550 if (!(AliCDBManager::Instance()->GetStorage(localUri))) {
551 Log("SHUTTLE", Form("StoreLocally - Cannot activate local %s storage", cdbType));
84090f85 552 } else {
3301427a 553 result = AliCDBManager::Instance()->GetStorage(localUri)
84090f85 554 ->Put(object, id, metaData);
555 }
556
557 if(!result) {
558
9827400b 559 Log(fCurrentDetector, Form("StoreLocally - Can't store object <%s>!", id.ToString().Data()));
3301427a 560 }
2bb7b766 561
3301427a 562 return result;
563}
84090f85 564
3301427a 565//______________________________________________________________________________________________
566Bool_t AliShuttle::StoreOCDB()
567{
9827400b 568 //
569 // Called when preprocessor ends successfully or when previous storage attempt failed (kStoreError status)
570 // Calls underlying StoreOCDB(const char*) function twice, for OCDB and Reference storage.
571 // Then calls StoreRefFilesToGrid to store reference files.
572 //
573
574 if (fTestMode & kErrorGrid)
575 {
576 Log("SHUTTLE", "StoreOCDB - In TESTMODE - Simulating error while storing in the Grid");
577 Log(fCurrentDetector, "StoreOCDB - In TESTMODE - Simulating error while storing in the Grid");
578 return kFALSE;
579 }
580
c88ad5db 581 Log("SHUTTLE","StoreOCDB - Storing OCDB data ...");
86aa42c3 582 Bool_t resultCDB = StoreOCDB(fgkMainCDB);
583
c88ad5db 584 Log("SHUTTLE","StoreOCDB - Storing reference data ...");
3301427a 585 Bool_t resultRef = StoreOCDB(fgkMainRefStorage);
9827400b 586
c88ad5db 587 Log("SHUTTLE","StoreOCDB - Storing reference files ...");
588 Bool_t resultRefFiles = CopyFilesToGrid("reference");
589
590 Bool_t resultMetadata = kTRUE;
591 if(fCurrentDetector == "GRP")
592 {
593 Log("StoreOCDB - SHUTTLE","Storing Run Metadata file ...");
594 resultMetadata = CopyFilesToGrid("metadata");
595 }
9827400b 596
c88ad5db 597 return resultCDB && resultRef && resultRefFiles && resultMetadata;
3301427a 598}
599
600//______________________________________________________________________________________________
601Bool_t AliShuttle::StoreOCDB(const TString& gridURI)
602{
603 //
604 // Called by StoreOCDB(), performs actual storage to the main OCDB and reference storages (Grid)
605 //
606
607 TObjArray* gridIds=0;
608
609 Bool_t result = kTRUE;
610
611 const char* type = 0;
612 TString localURI;
613 if(gridURI == fgkMainCDB) {
614 type = "OCDB";
615 localURI = fgkLocalCDB;
616 } else if(gridURI == fgkMainRefStorage) {
617 type = "reference";
618 localURI = fgkLocalRefStorage;
619 } else {
620 AliError(Form("Invalid storage URI: %s", gridURI.Data()));
621 return kFALSE;
622 }
623
624 AliCDBManager* man = AliCDBManager::Instance();
625
626 AliCDBStorage *gridSto = man->GetStorage(gridURI);
627 if(!gridSto) {
628 Log("SHUTTLE",
629 Form("StoreOCDB - cannot activate main %s storage", type));
630 return kFALSE;
631 }
632
633 gridIds = gridSto->GetQueryCDBList();
634
635 // get objects previously stored in local CDB
636 AliCDBStorage *localSto = man->GetStorage(localURI);
637 if(!localSto) {
638 Log("SHUTTLE",
639 Form("StoreOCDB - cannot activate local %s storage", type));
640 return kFALSE;
641 }
642 AliCDBPath aPath(GetOfflineDetName(fCurrentDetector.Data()),"*","*");
643 // Local objects were stored with current run as Grid version!
644 TList* localEntries = localSto->GetAll(aPath.GetPath(), GetCurrentRun(), GetCurrentRun());
645 localEntries->SetOwner(1);
646
647 // loop on local stored objects
648 TIter localIter(localEntries);
649 AliCDBEntry *aLocEntry = 0;
650 while((aLocEntry = dynamic_cast<AliCDBEntry*> (localIter.Next()))){
651 aLocEntry->SetOwner(1);
652 AliCDBId aLocId = aLocEntry->GetId();
653 aLocEntry->SetVersion(-1);
654 aLocEntry->SetSubVersion(-1);
655
656 // If local object is valid up to infinity we store it only if it is
657 // the first unprocessed run!
658 if (aLocId.GetLastRun() == AliCDBRunRange::Infinity() &&
659 !fFirstUnprocessed[GetDetPos(fCurrentDetector)])
660 {
661 Log("SHUTTLE", Form("StoreOCDB - %s: object %s has validity infinite but "
662 "there are previous unprocessed runs!",
663 fCurrentDetector.Data(), aLocId.GetPath().Data()));
4508ab8b 664 result = kFALSE;
3301427a 665 continue;
666 }
667
668 // loop on Grid valid Id's
669 Bool_t store = kTRUE;
670 TIter gridIter(gridIds);
671 AliCDBId* aGridId = 0;
672 while((aGridId = dynamic_cast<AliCDBId*> (gridIter.Next()))){
673 if(aGridId->GetPath() != aLocId.GetPath()) continue;
674 // skip all objects valid up to infinity
675 if(aGridId->GetLastRun() == AliCDBRunRange::Infinity()) continue;
676 // if we get here, it means there's already some more recent object stored on Grid!
677 store = kFALSE;
678 break;
679 }
680
681 // If we get here, the file can be stored!
682 Bool_t storeOk = gridSto->Put(aLocEntry);
683 if(!store || storeOk){
684
685 if (!store)
686 {
687 Log(fCurrentDetector.Data(),
688 Form("StoreOCDB - A more recent object already exists in %s storage: <%s>",
689 type, aGridId->ToString().Data()));
690 } else {
691 Log("SHUTTLE",
692 Form("StoreOCDB - Object <%s> successfully put into %s storage",
693 aLocId.ToString().Data(), type));
2d9019b4 694 Log(fCurrentDetector.Data(),
695 Form("StoreOCDB - Object <%s> successfully put into %s storage",
696 aLocId.ToString().Data(), type));
3301427a 697 }
84090f85 698
3301427a 699 // removing local filename...
700 TString filename;
701 localSto->IdToFilename(aLocId, filename);
c88ad5db 702 Log("SHUTTLE", Form("StoreOCDB - Removing local file %s", filename.Data()));
3301427a 703 RemoveFile(filename.Data());
704 continue;
705 } else {
706 Log("SHUTTLE",
707 Form("StoreOCDB - Grid %s storage of object <%s> failed",
708 type, aLocId.ToString().Data()));
2d9019b4 709 Log(fCurrentDetector.Data(),
710 Form("StoreOCDB - Grid %s storage of object <%s> failed",
711 type, aLocId.ToString().Data()));
3301427a 712 result = kFALSE;
b948db8d 713 }
714 }
3301427a 715 localEntries->Clear();
2bb7b766 716
b948db8d 717 return result;
3301427a 718}
719
546242fb 720//______________________________________________________________________________________________
721Bool_t AliShuttle::CleanReferenceStorage(const char* detector)
722{
2d9019b4 723 // clears the directory used to store reference files of a given subdetector
546242fb 724
725 AliCDBManager* man = AliCDBManager::Instance();
726 AliCDBStorage* sto = man->GetStorage(fgkLocalRefStorage);
2d9019b4 727 TString localBaseFolder = sto->GetBaseFolder();
728
729 TString targetDir = GetRefFilePrefix(localBaseFolder.Data(), detector);
730
d524ade6 731 Log("SHUTTLE", Form("CleanReferenceStorage - Cleaning %s", targetDir.Data()));
2d9019b4 732
733 TString begin;
734 begin.Form("%d_", GetCurrentRun());
735
736 TSystemDirectory* baseDir = new TSystemDirectory("/", targetDir);
737 if (!baseDir)
738 return kTRUE;
739
740 TList* dirList = baseDir->GetListOfFiles();
741 delete baseDir;
742
743 if (!dirList) return kTRUE;
744
745 if (dirList->GetEntries() < 3)
746 {
747 delete dirList;
748 return kTRUE;
749 }
750
751 Int_t nDirs = 0, nDel = 0;
752 TIter dirIter(dirList);
753 TSystemFile* entry = 0;
546242fb 754
2d9019b4 755 Bool_t success = kTRUE;
546242fb 756
2d9019b4 757 while ((entry = dynamic_cast<TSystemFile*> (dirIter.Next())))
758 {
759 if (entry->IsDirectory())
760 continue;
761
762 TString fileName(entry->GetName());
763 if (!fileName.BeginsWith(begin))
764 continue;
765
766 nDirs++;
767
768 // delete file
769 Int_t result = gSystem->Unlink(fileName.Data());
770
771 if (result)
772 {
d524ade6 773 Log("SHUTTLE", Form("CleanReferenceStorage - Could not delete file %s!", fileName.Data()));
2d9019b4 774 success = kFALSE;
775 } else {
776 nDel++;
777 }
778 }
779
780 if(nDirs > 0)
781 Log("SHUTTLE", Form("CleanReferenceStorage - %d (over %d) reference files in folder %s were deleted.",
782 nDel, nDirs, targetDir.Data()));
783
784
785 delete dirList;
786 return success;
787
788
789
790
791
546242fb 792
793 Int_t result = gSystem->GetPathInfo(targetDir, 0, (Long64_t*) 0, 0, 0);
794 if (result == 0)
795 {
796 // delete directory
d524ade6 797 result = gSystem->Exec(Form("rm -rf %s", targetDir.Data()));
546242fb 798 if (result != 0)
799 {
d524ade6 800 Log("SHUTTLE", Form("CleanReferenceStorage - Could not clean directory %s", targetDir.Data()));
546242fb 801 return kFALSE;
802 }
803 }
804
805 result = gSystem->mkdir(targetDir, kTRUE);
806 if (result != 0)
807 {
c88ad5db 808 Log("SHUTTLE", Form("CleanReferenceStorage - Error creating base directory %s", targetDir.Data()));
546242fb 809 return kFALSE;
810 }
811
812 return kTRUE;
813}
814
9827400b 815//______________________________________________________________________________________________
816Bool_t AliShuttle::StoreReferenceFile(const char* detector, const char* localFile, const char* gridFileName)
817{
818 //
3c2a21c8 819 // Stores reference file directly (without opening it). This function stores the file locally.
9827400b 820 //
3c2a21c8 821 // The file is stored under the following location:
822 // <base folder of local reference storage>/<DET>/<RUN#>_<gridFileName>
823 // where <gridFileName> is the second parameter given to the function
824 //
9827400b 825
826 if (fTestMode & kErrorStorage)
827 {
828 Log(fCurrentDetector, "StoreReferenceFile - In TESTMODE - Simulating error while storing locally");
829 return kFALSE;
830 }
831
832 AliCDBManager* man = AliCDBManager::Instance();
833 AliCDBStorage* sto = man->GetStorage(fgkLocalRefStorage);
834
835 TString localBaseFolder = sto->GetBaseFolder();
836
d524ade6 837 TString target = GetRefFilePrefix(localBaseFolder.Data(), detector);
838 target.Append(Form("/%d_%s", GetCurrentRun(), gridFileName));
9827400b 839
d524ade6 840 return CopyFileLocally(localFile, target);
c88ad5db 841}
842
843//______________________________________________________________________________________________
844Bool_t AliShuttle::StoreRunMetadataFile(const char* localFile, const char* gridFileName)
845{
846 //
847 // Stores Run metadata file to the Grid, in the run folder
848 //
849 // Only GRP can call this function.
850
851 if (fTestMode & kErrorStorage)
852 {
853 Log(fCurrentDetector, "StoreRunMetaDataFile - In TESTMODE - Simulating error while storing locally");
854 return kFALSE;
855 }
856
857 AliCDBManager* man = AliCDBManager::Instance();
858 AliCDBStorage* sto = man->GetStorage(fgkLocalRefStorage);
859
860 TString localBaseFolder = sto->GetBaseFolder();
861
862 // Build Run level folder
863 // folder = /alice/data/year/lhcPeriod/runNb/Raw
864
c88ad5db 865
675f64cd 866 TString lhcPeriod = GetLHCPeriod();
c88ad5db 867 if (lhcPeriod.Length() == 0)
868 {
869 Log("SHUTTLE","StoreRunMetaDataFile - LHCPeriod not found in logbook!");
870 return 0;
871 }
872
675f64cd 873 TString target = Form("%s/GRP/RunMetadata/alice/data/%d/%s/%09d/Raw/%s",
874 localBaseFolder.Data(), GetCurrentYear(),
d524ade6 875 lhcPeriod.Data(), GetCurrentRun(), gridFileName);
c88ad5db 876
d524ade6 877 return CopyFileLocally(localFile, target);
c88ad5db 878}
879
880//______________________________________________________________________________________________
d524ade6 881Bool_t AliShuttle::CopyFileLocally(const char* localFile, const TString& target)
c88ad5db 882{
883 //
884 // Stores file locally. Called by StoreReferenceFile and StoreRunMetadataFile
d524ade6 885 // Files are temporarily stored in the local reference storage. When the preprocessor
886 // finishes, the Shuttle calls CopyFilesToGrid to transfer the files to AliEn
887 // (in reference or run level folders)
c88ad5db 888 //
889
d524ade6 890 TString targetDir(target(0, target.Last('/')));
891
892 //try to open base dir folder, if it does not exist
2d9019b4 893 void* dir = gSystem->OpenDirectory(targetDir.Data());
894 if (dir == NULL) {
895 if (gSystem->mkdir(targetDir.Data(), kTRUE)) {
c88ad5db 896 Log("SHUTTLE", Form("StoreFileLocally - Can't open directory <%s>", targetDir.Data()));
2d9019b4 897 return kFALSE;
898 }
899
900 } else {
901 gSystem->FreeDirectory(dir);
902 }
9827400b 903
7d43a416 904 Int_t result = 0;
905
906 result = gSystem->GetPathInfo(localFile, 0, (Long64_t*) 0, 0, 0);
9827400b 907 if (result)
908 {
c88ad5db 909 Log("SHUTTLE", Form("StoreFileLocally - %s does not exist", localFile));
546242fb 910 return kFALSE;
9827400b 911 }
546242fb 912
7d43a416 913 result = gSystem->GetPathInfo(target, 0, (Long64_t*) 0, 0, 0);
914 if (!result)
915 {
916 Log("SHUTTLE", Form("StoreFileLocally - target file %s already exist, removing...", target.Data()));
917 if (gSystem->Unlink(target.Data()))
918 {
919 Log("SHUTTLE", Form("StoreFileLocally - Could not remove existing target file %s!", target.Data()));
920 return kFALSE;
921 }
922 }
923
9827400b 924 result = gSystem->CopyFile(localFile, target);
925
926 if (result == 0)
927 {
c88ad5db 928 Log("SHUTTLE", Form("StoreFileLocally - File %s stored locally to %s", localFile, target.Data()));
9827400b 929 return kTRUE;
930 }
931 else
932 {
d524ade6 933 Log("SHUTTLE", Form("StoreFileLocally - Could not store file %s to %s! Error code = %d",
546242fb 934 localFile, target.Data(), result));
9827400b 935 return kFALSE;
936 }
c88ad5db 937
938
939
9827400b 940}
941
942//______________________________________________________________________________________________
c88ad5db 943Bool_t AliShuttle::CopyFilesToGrid(const char* type)
9827400b 944{
945 //
c88ad5db 946 // Transfers local files to the Grid. Local files can be reference files
947 // or run metadata file (from GRP only).
9827400b 948 //
c88ad5db 949 // According to the type (ref, metadata) the files are stored under the following location:
950 // ref --> <base folder of reference storage>/<DET>/<RUN#>_<gridFileName>
951 // metadata --> <run data folder>/<MetadataFileName>
86aa42c3 952 //
c88ad5db 953
9827400b 954 AliCDBManager* man = AliCDBManager::Instance();
955 AliCDBStorage* sto = man->GetStorage(fgkLocalRefStorage);
956 if (!sto)
957 return kFALSE;
958 TString localBaseFolder = sto->GetBaseFolder();
9827400b 959
c88ad5db 960 TString dir;
961 TString alienDir;
9827400b 962 TString begin;
9827400b 963
c88ad5db 964 if (strcmp(type, "reference") == 0)
965 {
966 dir = GetRefFilePrefix(localBaseFolder.Data(), fCurrentDetector.Data());
967 AliCDBStorage* gridSto = man->GetStorage(fgkMainRefStorage);
968 if (!gridSto)
969 return kFALSE;
970 TString gridBaseFolder = gridSto->GetBaseFolder();
971 alienDir = GetRefFilePrefix(gridBaseFolder.Data(), fCurrentDetector.Data());
972 begin = Form("%d_", GetCurrentRun());
973 }
974 else if (strcmp(type, "metadata") == 0)
975 {
c88ad5db 976
675f64cd 977 TString lhcPeriod = GetLHCPeriod();
c88ad5db 978
979 if (lhcPeriod.Length() == 0)
980 {
981 Log("SHUTTLE","CopyFilesToGrid - LHCPeriod not found in logbook!");
982 return 0;
983 }
984
675f64cd 985 dir = Form("%s/GRP/RunMetadata/alice/data/%d/%s/%09d/Raw",
986 localBaseFolder.Data(), GetCurrentYear(),
c88ad5db 987 lhcPeriod.Data(), GetCurrentRun());
675f64cd 988 alienDir = dir(dir.Index("/alice/data/"), dir.Length());
989
c88ad5db 990 begin = "";
991 }
992 else
993 {
994 Log("SHUTTLE", "CopyFilesToGrid - Unexpected: type label must be reference or metadata!");
995 return kFALSE;
996 }
997
9827400b 998 TSystemDirectory* baseDir = new TSystemDirectory("/", dir);
3d8bc902 999 if (!baseDir)
1000 return kTRUE;
1001
2d9019b4 1002 TList* dirList = baseDir->GetListOfFiles();
1003 delete baseDir;
1004
1005 if (!dirList) return kTRUE;
1006
1007 if (dirList->GetEntries() < 3)
3d8bc902 1008 {
2d9019b4 1009 delete dirList;
9827400b 1010 return kTRUE;
3d8bc902 1011 }
2d9019b4 1012
546242fb 1013 if (!gGrid)
1014 {
c88ad5db 1015 Log("SHUTTLE", "CopyFilesToGrid - Connection to Grid failed: Cannot continue!");
2d9019b4 1016 delete dirList;
546242fb 1017 return kFALSE;
1018 }
1019
2d9019b4 1020 Int_t nDirs = 0, nTransfer = 0;
1021 TIter dirIter(dirList);
1022 TSystemFile* entry = 0;
1023
9827400b 1024 Bool_t success = kTRUE;
3d8bc902 1025 Bool_t first = kTRUE;
9827400b 1026
2d9019b4 1027 while ((entry = dynamic_cast<TSystemFile*> (dirIter.Next())))
1028 {
9827400b 1029 if (entry->IsDirectory())
1030 continue;
1031
1032 TString fileName(entry->GetName());
1033 if (!fileName.BeginsWith(begin))
1034 continue;
1035
2d9019b4 1036 nDirs++;
1037
3d8bc902 1038 if (first)
1039 {
1040 first = kFALSE;
c88ad5db 1041 // check that folder exists, otherwise create it
3d8bc902 1042 TGridResult* result = gGrid->Ls(alienDir.Data(), "a");
1043
1044 if (!result)
2d9019b4 1045 {
1046 delete dirList;
3d8bc902 1047 return kFALSE;
2d9019b4 1048 }
3d8bc902 1049
546242fb 1050 if (!result->GetFileName(1)) // TODO: It looks like element 0 is always 0!!
3d8bc902 1051 {
675f64cd 1052 // TODO It does not work currently! Bug in TAliEn::Mkdir
1053 // TODO Manually fixed in local root v5-16-00
c88ad5db 1054 if (!gGrid->Mkdir(alienDir.Data(),"-p",0))
3d8bc902 1055 {
c88ad5db 1056 Log("SHUTTLE", Form("CopyFilesToGrid - Cannot create directory %s",
3d8bc902 1057 alienDir.Data()));
2d9019b4 1058 delete dirList;
3d8bc902 1059 return kFALSE;
546242fb 1060 } else {
c88ad5db 1061 Log("SHUTTLE",Form("CopyFilesToGrid - Folder %s created", alienDir.Data()));
3d8bc902 1062 }
1063
546242fb 1064 } else {
c88ad5db 1065 Log("SHUTTLE",Form("CopyFilesToGrid - Folder %s found", alienDir.Data()));
3d8bc902 1066 }
1067 }
1068
9827400b 1069 TString fullLocalPath;
1070 fullLocalPath.Form("%s/%s", dir.Data(), fileName.Data());
1071
1072 TString fullGridPath;
1073 fullGridPath.Form("alien://%s/%s", alienDir.Data(), fileName.Data());
1074
a986b218 1075 Bool_t result = TFile::Cp(fullLocalPath, fullGridPath);
9827400b 1076
1077 if (result)
1078 {
c88ad5db 1079 Log("SHUTTLE", Form("CopyFilesToGrid - Copying local file %s to %s succeeded!",
1080 fullLocalPath.Data(), fullGridPath.Data()));
9827400b 1081 RemoveFile(fullLocalPath);
2d9019b4 1082 nTransfer++;
9827400b 1083 }
1084 else
1085 {
c88ad5db 1086 Log("SHUTTLE", Form("CopyFilesToGrid - Copying local file %s to %s FAILED!",
1087 fullLocalPath.Data(), fullGridPath.Data()));
9827400b 1088 success = kFALSE;
1089 }
1090 }
2d9019b4 1091
c88ad5db 1092 Log("SHUTTLE", Form("CopyFilesToGrid - %d (over %d) files in folder %s copied to Grid.",
1093 nTransfer, nDirs, dir.Data()));
2d9019b4 1094
1095
1096 delete dirList;
9827400b 1097 return success;
1098}
1099
2d9019b4 1100//______________________________________________________________________________________________
1101const char* AliShuttle::GetRefFilePrefix(const char* base, const char* detector)
1102{
1103 //
1104 // Get folder name of reference files
1105 //
1106
1107 TString offDetStr(GetOfflineDetName(detector));
1108 TString dir;
1109 if (offDetStr == "ITS" || offDetStr == "MUON" || offDetStr == "PHOS")
1110 {
1111 dir.Form("%s/%s/%s", base, offDetStr.Data(), detector);
1112 } else {
1113 dir.Form("%s/%s", base, offDetStr.Data());
1114 }
1115
1116 return dir.Data();
1117
1118
1119}
c88ad5db 1120
3301427a 1121//______________________________________________________________________________________________
1122void AliShuttle::CleanLocalStorage(const TString& uri)
1123{
9827400b 1124 //
1125 // Called in case the preprocessor is declared failed. Remove remaining objects from the local storages.
1126 //
3301427a 1127
1128 const char* type = 0;
1129 if(uri == fgkLocalCDB) {
1130 type = "OCDB";
1131 } else if(uri == fgkLocalRefStorage) {
546242fb 1132 type = "Reference";
3301427a 1133 } else {
1134 AliError(Form("Invalid storage URI: %s", uri.Data()));
1135 return;
1136 }
1137
1138 AliCDBManager* man = AliCDBManager::Instance();
b948db8d 1139
3301427a 1140 // open local storage
1141 AliCDBStorage *localSto = man->GetStorage(uri);
1142 if(!localSto) {
1143 Log("SHUTTLE",
1144 Form("CleanLocalStorage - cannot activate local %s storage", type));
1145 return;
1146 }
1147
1148 TString filename(Form("%s/%s/*/Run*_v%d_s*.root",
546242fb 1149 localSto->GetBaseFolder().Data(), GetOfflineDetName(fCurrentDetector.Data()), GetCurrentRun()));
3301427a 1150
c88ad5db 1151 AliDebug(2, Form("filename = %s", filename.Data()));
3301427a 1152
c88ad5db 1153 Log("SHUTTLE", Form("Removing remaining local files for run %d and detector %s ...",
3301427a 1154 GetCurrentRun(), fCurrentDetector.Data()));
1155
1156 RemoveFile(filename.Data());
1157
1158}
1159
1160//______________________________________________________________________________________________
1161void AliShuttle::RemoveFile(const char* filename)
1162{
9827400b 1163 //
1164 // removes local file
1165 //
3301427a 1166
1167 TString command(Form("rm -f %s", filename));
1168
1169 Int_t result = gSystem->Exec(command.Data());
1170 if(result != 0)
1171 {
1172 Log("SHUTTLE", Form("RemoveFile - %s: Cannot remove file %s!",
1173 fCurrentDetector.Data(), filename));
1174 }
73abe331 1175}
1176
b948db8d 1177//______________________________________________________________________________________________
5164a766 1178AliShuttleStatus* AliShuttle::ReadShuttleStatus()
1179{
9827400b 1180 //
1181 // Reads the AliShuttleStatus from the CDB
1182 //
5164a766 1183
2bb7b766 1184 if (fStatusEntry){
1185 delete fStatusEntry;
1186 fStatusEntry = 0;
1187 }
5164a766 1188
10a5a932 1189 fStatusEntry = AliCDBManager::Instance()->GetStorage(GetLocalCDB())
2bb7b766 1190 ->Get(Form("/SHUTTLE/STATUS/%s", fCurrentDetector.Data()), GetCurrentRun());
5164a766 1191
2bb7b766 1192 if (!fStatusEntry) return 0;
1193 fStatusEntry->SetOwner(1);
5164a766 1194
2bb7b766 1195 AliShuttleStatus* status = dynamic_cast<AliShuttleStatus*> (fStatusEntry->GetObject());
1196 if (!status) {
1197 AliError("Invalid object stored to CDB!");
1198 return 0;
1199 }
5164a766 1200
2bb7b766 1201 return status;
5164a766 1202}
1203
1204//______________________________________________________________________________________________
7bfb2090 1205Bool_t AliShuttle::WriteShuttleStatus(AliShuttleStatus* status)
5164a766 1206{
9827400b 1207 //
1208 // writes the status for one subdetector
1209 //
2bb7b766 1210
1211 if (fStatusEntry){
1212 delete fStatusEntry;
1213 fStatusEntry = 0;
1214 }
5164a766 1215
2bb7b766 1216 Int_t run = GetCurrentRun();
5164a766 1217
2bb7b766 1218 AliCDBId id(AliCDBPath("SHUTTLE", "STATUS", fCurrentDetector), run, run);
5164a766 1219
2bb7b766 1220 fStatusEntry = new AliCDBEntry(status, id, new AliCDBMetaData);
1221 fStatusEntry->SetOwner(1);
5164a766 1222
2bb7b766 1223 UInt_t result = AliCDBManager::Instance()->GetStorage(fgkLocalCDB)->Put(fStatusEntry);
7bfb2090 1224
2bb7b766 1225 if (!result) {
3301427a 1226 Log("SHUTTLE", Form("WriteShuttleStatus - Failed for %s, run %d",
1227 fCurrentDetector.Data(), run));
2bb7b766 1228 return kFALSE;
1229 }
e7f62f16 1230
1231 SendMLInfo();
7bfb2090 1232
2bb7b766 1233 return kTRUE;
5164a766 1234}
1235
1236//______________________________________________________________________________________________
1237void AliShuttle::UpdateShuttleStatus(AliShuttleStatus::Status newStatus, Bool_t increaseCount)
1238{
9827400b 1239 //
1240 // changes the AliShuttleStatus for the given detector and run to the given status
1241 //
5164a766 1242
2bb7b766 1243 if (!fStatusEntry){
1244 AliError("UNEXPECTED: fStatusEntry empty");
1245 return;
1246 }
5164a766 1247
2bb7b766 1248 AliShuttleStatus* status = dynamic_cast<AliShuttleStatus*> (fStatusEntry->GetObject());
5164a766 1249
2bb7b766 1250 if (!status){
c88ad5db 1251 Log("SHUTTLE", "UpdateShuttleStatus - UNEXPECTED: status could not be read from current CDB entry");
2bb7b766 1252 return;
1253 }
5164a766 1254
2c15234c 1255 TString actionStr = Form("UpdateShuttleStatus - %s: Changing state from %s to %s",
eba76848 1256 fCurrentDetector.Data(),
36c99a6a 1257 status->GetStatusName(),
eba76848 1258 status->GetStatusName(newStatus));
cb343cfd 1259 Log("SHUTTLE", actionStr);
1260 SetLastAction(actionStr);
5164a766 1261
2bb7b766 1262 status->SetStatus(newStatus);
1263 if (increaseCount) status->IncreaseCount();
5164a766 1264
2bb7b766 1265 AliCDBManager::Instance()->GetStorage(fgkLocalCDB)->Put(fStatusEntry);
e7f62f16 1266
1267 SendMLInfo();
5164a766 1268}
e7f62f16 1269
1270//______________________________________________________________________________________________
1271void AliShuttle::SendMLInfo()
1272{
1273 //
1274 // sends ML information about the current status of the current detector being processed
1275 //
1276
1277 AliShuttleStatus* status = dynamic_cast<AliShuttleStatus*> (fStatusEntry->GetObject());
1278
1279 if (!status){
3301427a 1280 Log("SHUTTLE", "SendMLInfo - UNEXPECTED: status could not be read from current CDB entry");
e7f62f16 1281 return;
1282 }
1283
1284 TMonaLisaText mlStatus(Form("%s_status", fCurrentDetector.Data()), status->GetStatusName());
1285 TMonaLisaValue mlRetryCount(Form("%s_count", fCurrentDetector.Data()), status->GetCount());
1286
1287 TList mlList;
1288 mlList.Add(&mlStatus);
1289 mlList.Add(&mlRetryCount);
1290
ee6f7523 1291 TString mlID;
1292 mlID.Form("%d", GetCurrentRun());
1293 fMonaLisa->SendParameters(&mlList, mlID);
e7f62f16 1294}
1295
5164a766 1296//______________________________________________________________________________________________
1297Bool_t AliShuttle::ContinueProcessing()
1298{
9827400b 1299 // this function reads the AliShuttleStatus information from CDB and
1300 // checks if the processing should be continued
1301 // if yes it returns kTRUE and updates the AliShuttleStatus with nextStatus
2bb7b766 1302
57c1a579 1303 if (!fConfig->HostProcessDetector(fCurrentDetector)) return kFALSE;
1304
1305 AliPreprocessor* aPreprocessor =
1306 dynamic_cast<AliPreprocessor*> (fPreprocessorMap.GetValue(fCurrentDetector));
1307 if (!aPreprocessor)
1308 {
c88ad5db 1309 Log("SHUTTLE", Form("ContinueProcessing - %s: no preprocessor registered", fCurrentDetector.Data()));
57c1a579 1310 return kFALSE;
1311 }
1312
2bb7b766 1313 AliShuttleLogbookEntry::Status entryStatus =
eba76848 1314 fLogbookEntry->GetDetectorStatus(fCurrentDetector);
2bb7b766 1315
1316 if(entryStatus != AliShuttleLogbookEntry::kUnprocessed) {
c88ad5db 1317 Log("SHUTTLE", Form("ContinueProcessing - %s is %s",
2bb7b766 1318 fCurrentDetector.Data(),
1319 fLogbookEntry->GetDetectorStatusName(entryStatus)));
1320 return kFALSE;
1321 }
1322
1323 // if we get here, according to Shuttle logbook subdetector is in UNPROCESSED state
be48e3ea 1324
1325 // check if current run is first unprocessed run for current detector
1326 if (fConfig->StrictRunOrder(fCurrentDetector) &&
1327 !fFirstUnprocessed[GetDetPos(fCurrentDetector)])
1328 {
86aa42c3 1329 if (fTestMode == kNone)
1330 {
c88ad5db 1331 Log("SHUTTLE", Form("ContinueProcessing - %s requires strict run ordering"
1332 " but this is not the first unprocessed run!"));
86aa42c3 1333 return kFALSE;
1334 }
1335 else
1336 {
c88ad5db 1337 Log("SHUTTLE", Form("ContinueProcessing - In TESTMODE - "
1338 "Although %s requires strict run ordering "
1339 "and this is not the first unprocessed run, "
1340 "the SHUTTLE continues"));
86aa42c3 1341 }
be48e3ea 1342 }
1343
2bb7b766 1344 AliShuttleStatus* status = ReadShuttleStatus();
1345 if (!status) {
1346 // first time
1347 Log("SHUTTLE", Form("ContinueProcessing - %s: Processing first time",
1348 fCurrentDetector.Data()));
1349 status = new AliShuttleStatus(AliShuttleStatus::kStarted);
1350 return WriteShuttleStatus(status);
1351 }
1352
1353 // The following two cases shouldn't happen if Shuttle Logbook was correctly updated.
1354 // If it happens it may mean Logbook updating failed... let's do it now!
1355 if (status->GetStatus() == AliShuttleStatus::kDone ||
1356 status->GetStatus() == AliShuttleStatus::kFailed){
1357 Log("SHUTTLE", Form("ContinueProcessing - %s is already %s. Updating Shuttle Logbook",
1358 fCurrentDetector.Data(),
1359 status->GetStatusName(status->GetStatus())));
1360 UpdateShuttleLogbook(fCurrentDetector.Data(),
1361 status->GetStatusName(status->GetStatus()));
1362 return kFALSE;
1363 }
1364
3301427a 1365 if (status->GetStatus() == AliShuttleStatus::kStoreError) {
2bb7b766 1366 Log("SHUTTLE",
c88ad5db 1367 Form("ContinueProcessing - %s: Grid storage of one or more "
1368 "objects failed. Trying again now",
2bb7b766 1369 fCurrentDetector.Data()));
9827400b 1370 UpdateShuttleStatus(AliShuttleStatus::kStoreStarted);
1371 if (StoreOCDB()){
c88ad5db 1372 Log("SHUTTLE", Form("ContinueProcessing - %s: all objects "
1373 "successfully stored into main storage",
3301427a 1374 fCurrentDetector.Data()));
2bb7b766 1375 } else {
1376 Log("SHUTTLE",
1377 Form("ContinueProcessing - %s: Grid storage failed again",
1378 fCurrentDetector.Data()));
9827400b 1379 UpdateShuttleStatus(AliShuttleStatus::kStoreError);
2bb7b766 1380 }
1381 return kFALSE;
1382 }
1383
1384 // if we get here, there is a restart
57c1a579 1385 Bool_t cont = kFALSE;
2bb7b766 1386
1387 // abort conditions
cb343cfd 1388 if (status->GetCount() >= fConfig->GetMaxRetries()) {
57c1a579 1389 Log("SHUTTLE", Form("ContinueProcessing - %s failed %d times in status %s - "
1390 "Updating Shuttle Logbook", fCurrentDetector.Data(),
2bb7b766 1391 status->GetCount(), status->GetStatusName()));
1392 UpdateShuttleLogbook(fCurrentDetector.Data(), "FAILED");
e7f62f16 1393 UpdateShuttleStatus(AliShuttleStatus::kFailed);
3301427a 1394
1395 // there may still be objects in local OCDB and reference storage
1396 // and FXS databases may be not updated: do it now!
9827400b 1397
1398 // TODO Currently disabled, we want to keep files in case of failure!
1399 // CleanLocalStorage(fgkLocalCDB);
1400 // CleanLocalStorage(fgkLocalRefStorage);
1401 // UpdateTableFailCase();
1402
1403 // Send mail to detector expert!
c88ad5db 1404 Log("SHUTTLE", Form("ContinueProcessing - Sending mail to %s expert...",
1405 fCurrentDetector.Data()));
9827400b 1406 if (!SendMail())
1407 Log("SHUTTLE", Form("ContinueProcessing - Could not send mail to %s expert",
1408 fCurrentDetector.Data()));
3301427a 1409
57c1a579 1410 } else {
1411 Log("SHUTTLE", Form("ContinueProcessing - %s: restarting. "
1412 "Aborted before with %s. Retry number %d.", fCurrentDetector.Data(),
1413 status->GetStatusName(), status->GetCount()));
9827400b 1414 Bool_t increaseCount = kTRUE;
c88ad5db 1415 if (status->GetStatus() == AliShuttleStatus::kDCSError ||
1416 status->GetStatus() == AliShuttleStatus::kDCSStarted)
1417 increaseCount = kFALSE;
675f64cd 1418
9827400b 1419 UpdateShuttleStatus(AliShuttleStatus::kStarted, increaseCount);
57c1a579 1420 cont = kTRUE;
2bb7b766 1421 }
1422
57c1a579 1423 return cont;
5164a766 1424}
1425
1426//______________________________________________________________________________________________
2bb7b766 1427Bool_t AliShuttle::Process(AliShuttleLogbookEntry* entry)
58bc3020 1428{
73abe331 1429 //
b948db8d 1430 // Makes data retrieval for all detectors in the configuration.
2bb7b766 1431 // entry: Shuttle logbook entry, contains run paramenters and status of detectors
1432 // (Unprocessed, Inactive, Failed or Done).
d477ad88 1433 // Returns kFALSE in case of error occured and kTRUE otherwise
73abe331 1434 //
1435
9827400b 1436 if (!entry) return kFALSE;
2bb7b766 1437
1438 fLogbookEntry = entry;
1439
c88ad5db 1440 Log("SHUTTLE", Form("\t\t\t^*^*^*^*^*^*^*^*^*^*^*^* run %d: START ^*^*^*^*^*^*^*^*^*^*^*^*",
9827400b 1441 GetCurrentRun()));
2bb7b766 1442
e7f62f16 1443 // Send the information to ML
1444 TMonaLisaText mlStatus("SHUTTLE_status", "Processing");
9827400b 1445 TMonaLisaText mlRunType("SHUTTLE_runtype", Form("%s (%s)", entry->GetRunType(), entry->GetRunParameter("log")));
e7f62f16 1446
1447 TList mlList;
1448 mlList.Add(&mlStatus);
9827400b 1449 mlList.Add(&mlRunType);
e7f62f16 1450
ee6f7523 1451 TString mlID;
1452 mlID.Form("%d", GetCurrentRun());
1453 fMonaLisa->SendParameters(&mlList, mlID);
3301427a 1454
9827400b 1455 if (fLogbookEntry->IsDone())
1456 {
1457 Log("SHUTTLE","Process - Shuttle is already DONE. Updating logbook");
1458 UpdateShuttleLogbook("shuttle_done");
1459 fLogbookEntry = 0;
1460 return kTRUE;
1461 }
1462
1463 // read test mode if flag is set
1464 if (fReadTestMode)
1465 {
3d8bc902 1466 fTestMode = kNone;
9827400b 1467 TString logEntry(entry->GetRunParameter("log"));
1468 //printf("log entry = %s\n", logEntry.Data());
1469 TString searchStr("Testmode: ");
1470 Int_t pos = logEntry.Index(searchStr.Data());
1471 //printf("%d\n", pos);
1472 if (pos >= 0)
1473 {
1474 TSubString subStr = logEntry(pos + searchStr.Length(), logEntry.Length());
1475 //printf("%s\n", subStr.String().Data());
1476 TString newStr(subStr.Data());
1477 TObjArray* token = newStr.Tokenize(' ');
1478 if (token)
1479 {
1480 //token->Print();
1481 TObjString* tmpStr = dynamic_cast<TObjString*> (token->First());
1482 if (tmpStr)
1483 {
1484 Int_t testMode = tmpStr->String().Atoi();
1485 if (testMode > 0)
1486 {
c88ad5db 1487 Log("SHUTTLE", Form("Process - Enabling test mode %d", testMode));
9827400b 1488 SetTestMode((TestMode) testMode);
1489 }
1490 }
1491 delete token;
1492 }
1493 }
1494 }
c88ad5db 1495
eba76848 1496 fLogbookEntry->Print("all");
57f50b3c 1497
1498 // Initialization
d477ad88 1499 Bool_t hasError = kFALSE;
5164a766 1500
675f64cd 1501 // Set the CDB and Reference folders according to the year and LHC period
1502 TString lhcPeriod(GetLHCPeriod());
1503 if (lhcPeriod.Length() == 0)
1504 {
7d4cf768 1505 Log("SHUTTLE","Process - LHCPeriod not found in logbook!");
1506 return 0;
675f64cd 1507 }
1508
1509 if (fgkMainCDB.Length() == 0)
1510 fgkMainCDB = Form("alien://folder=/alice/data/%d/%s/OCDB?user=alidaq?cacheFold=/tmp/OCDBCache",
1511 GetCurrentYear(), lhcPeriod.Data());
1512
1513 if (fgkMainRefStorage.Length() == 0)
1514 fgkMainRefStorage = Form("alien://folder=/alice/data/%d/%s/Reference?user=alidaq?cacheFold=/tmp/OCDBCache",
1515 GetCurrentYear(), lhcPeriod.Data());
1516
57f50b3c 1517 // Loop on detectors in the configuration
b948db8d 1518 TIter iter(fConfig->GetDetectors());
2bb7b766 1519 TObjString* aDetector = 0;
b948db8d 1520
4508ab8b 1521 Bool_t first = kTRUE;
1522
be48e3ea 1523 while ((aDetector = (TObjString*) iter.Next()))
1524 {
7bfb2090 1525 fCurrentDetector = aDetector->String();
5164a766 1526
9e080f92 1527 if (ContinueProcessing() == kFALSE) continue;
4508ab8b 1528
1529 if (first)
1530 {
1531 // only read QueryCDB when needed and only once
1532 AliCDBStorage *mainCDBSto = AliCDBManager::Instance()->GetStorage(fgkMainCDB);
1533 if(mainCDBSto) mainCDBSto->QueryCDB(GetCurrentRun());
1534 AliCDBStorage *mainRefSto = AliCDBManager::Instance()->GetStorage(fgkMainRefStorage);
1535 if(mainRefSto) mainRefSto->QueryCDB(GetCurrentRun());
1536 first = kFALSE;
1537 }
9e080f92 1538
c88ad5db 1539 Log("SHUTTLE", Form("\t\t\t****** run %d - %s: START ******",
2bb7b766 1540 GetCurrentRun(), aDetector->GetName()));
1541
9d733021 1542 for(Int_t iSys=0;iSys<3;iSys++) fFXSCalled[iSys]=kFALSE;
1543
c88ad5db 1544 Log(fCurrentDetector.Data(), "Process - Starting processing");
85a80aa9 1545
be48e3ea 1546 Int_t pid = fork();
1547
1548 if (pid < 0)
1549 {
c88ad5db 1550 Log("SHUTTLE", "Process - ERROR: Forking failed");
be48e3ea 1551 }
1552 else if (pid > 0)
1553 {
1554 // parent
c88ad5db 1555 Log("SHUTTLE", Form("Process - In parent process of %d - %s: Starting monitoring",
be48e3ea 1556 GetCurrentRun(), aDetector->GetName()));
1557
1558 Long_t begin = time(0);
1559
1560 int status; // to be used with waitpid, on purpose an int (not Int_t)!
1561 while (waitpid(pid, &status, WNOHANG) == 0)
1562 {
1563 Long_t expiredTime = time(0) - begin;
1564
1565 if (expiredTime > fConfig->GetPPTimeOut())
1566 {
9827400b 1567 TString tmp;
c88ad5db 1568 tmp.Form("Process - Process of %s time out. "
1569 "Run time: %d seconds. Killing...",
1570 fCurrentDetector.Data(), expiredTime);
9827400b 1571 Log("SHUTTLE", tmp);
1572 Log(fCurrentDetector, tmp);
be48e3ea 1573
1574 kill(pid, 9);
1575
3301427a 1576 UpdateShuttleStatus(AliShuttleStatus::kPPTimeOut);
be48e3ea 1577 hasError = kTRUE;
1578
1579 gSystem->Sleep(1000);
1580 }
1581 else
1582 {
be48e3ea 1583 gSystem->Sleep(1000);
9827400b 1584
1585 TString checkStr;
1586 checkStr.Form("ps -o vsize --pid %d | tail -n 1", pid);
1587 FILE* pipe = gSystem->OpenPipe(checkStr, "r");
1588 if (!pipe)
1589 {
c88ad5db 1590 Log("SHUTTLE", Form("Process - Error: "
1591 "Could not open pipe to %s", checkStr.Data()));
9827400b 1592 continue;
1593 }
1594
1595 char buffer[100];
1596 if (!fgets(buffer, 100, pipe))
1597 {
c88ad5db 1598 Log("SHUTTLE", "Process - Error: ps did not return anything");
9827400b 1599 gSystem->ClosePipe(pipe);
1600 continue;
1601 }
1602 gSystem->ClosePipe(pipe);
1603
1604 //Log("SHUTTLE", Form("ps returned %s", buffer));
1605
1606 Int_t mem = 0;
1607 if ((sscanf(buffer, "%d\n", &mem) != 1) || !mem)
1608 {
c88ad5db 1609 Log("SHUTTLE", "Process - Error: Could not parse output of ps");
9827400b 1610 continue;
1611 }
1612
1613 if (expiredTime % 60 == 0)
ee6f7523 1614 {
c88ad5db 1615 Log("SHUTTLE", Form("Process - %s: Checking process. "
1616 "Run time: %d seconds - Memory consumption: %d KB",
1617 fCurrentDetector.Data(), expiredTime, mem));
ee6f7523 1618 SendAlive();
1619 }
9827400b 1620
1621 if (mem > fConfig->GetPPMaxMem())
1622 {
1623 TString tmp;
c88ad5db 1624 tmp.Form("Process - Process exceeds maximum allowed memory "
1625 "(%d KB > %d KB). Killing...",
9827400b 1626 mem, fConfig->GetPPMaxMem());
1627 Log("SHUTTLE", tmp);
1628 Log(fCurrentDetector, tmp);
1629
1630 kill(pid, 9);
1631
1632 UpdateShuttleStatus(AliShuttleStatus::kPPOutOfMemory);
1633 hasError = kTRUE;
1634
1635 gSystem->Sleep(1000);
1636 }
be48e3ea 1637 }
1638 }
1639
c88ad5db 1640 Log("SHUTTLE", Form("Process - In parent process of %d - %s: Client has terminated.",
be48e3ea 1641 GetCurrentRun(), aDetector->GetName()));
1642
1643 if (WIFEXITED(status))
1644 {
1645 Int_t returnCode = WEXITSTATUS(status);
1646
c88ad5db 1647 Log("SHUTTLE", Form("Process - %s: the return code is %d", fCurrentDetector.Data(),
3301427a 1648 returnCode));
be48e3ea 1649
9827400b 1650 if (returnCode == 0) hasError = kTRUE;
be48e3ea 1651 }
1652 }
1653 else if (pid == 0)
1654 {
1655 // client
c88ad5db 1656 Log("SHUTTLE", Form("Process - In client process of %d - %s", GetCurrentRun(),
1657 aDetector->GetName()));
be48e3ea 1658
c88ad5db 1659 Log("SHUTTLE", Form("Process - Redirecting output to %s log",fCurrentDetector.Data()));
ffa29e93 1660
546242fb 1661 if ((freopen(GetLogFileName(fCurrentDetector), "a", stdout)) == 0)
ffa29e93 1662 {
c88ad5db 1663 Log("SHUTTLE", "Process - Could not freopen stdout");
ffa29e93 1664 }
1665 else
1666 {
1667 fOutputRedirected = kTRUE;
1668 if ((dup2(fileno(stdout), fileno(stderr))) < 0)
c88ad5db 1669 Log("SHUTTLE", "Process - Could not redirect stderr");
ffa29e93 1670
1671 }
1672
5bac2bde 1673 TString wd = gSystem->WorkingDirectory();
675f64cd 1674 TString tmpDir = Form("%s/%s_%d_process", GetShuttleTempDir(),
1675 fCurrentDetector.Data(), GetCurrentRun());
5bac2bde 1676
d524ade6 1677 Int_t result = gSystem->GetPathInfo(tmpDir.Data(), 0, (Long64_t*) 0, 0, 0);
1678 if (!result) // temp dir already exists!
1679 {
1680 Log(fCurrentDetector.Data(),
1681 Form("Process - %s dir already exists! Removing...", tmpDir.Data()));
1682 gSystem->Exec(Form("rm -rf %s",tmpDir.Data()));
675f64cd 1683 }
1684
1685 if (gSystem->mkdir(tmpDir.Data(), 1))
1686 {
1687 Log(fCurrentDetector.Data(), "Process - could not make temp directory!!");
1688 gSystem->Exit(1);
d524ade6 1689 }
1690
1691 if (!gSystem->ChangeDirectory(tmpDir.Data()))
1692 {
1693 Log(fCurrentDetector.Data(), "Process - could not change directory!!");
1694 gSystem->Exit(1);
1695 }
5bac2bde 1696
9827400b 1697 Bool_t success = ProcessCurrentDetector();
5bac2bde 1698
1699 gSystem->ChangeDirectory(wd.Data());
b1d18693 1700
9827400b 1701 if (success) // Preprocessor finished successfully!
1702 {
b1d18693 1703 // remove temporary folder
fbc112e3 1704 // temporary commented (JF)
1705 //gSystem->Exec(Form("rm -rf %s",tmpDir.Data()));
b1d18693 1706
3301427a 1707 // Update time_processed field in FXS DB
1708 if (UpdateTable() == kFALSE)
5bac2bde 1709 Log("SHUTTLE", Form("Process - %s: Could not update FXS databases!",
1710 fCurrentDetector.Data()));
3301427a 1711
1712 // Transfer the data from local storage to main storage (Grid)
1713 UpdateShuttleStatus(AliShuttleStatus::kStoreStarted);
1714 if (StoreOCDB() == kFALSE)
1715 {
c88ad5db 1716 Log("SHUTTLE",
1717 Form("\t\t\t****** run %d - %s: STORAGE ERROR ******",
3301427a 1718 GetCurrentRun(), aDetector->GetName()));
1719 UpdateShuttleStatus(AliShuttleStatus::kStoreError);
9827400b 1720 success = kFALSE;
3301427a 1721 } else {
c88ad5db 1722 Log("SHUTTLE",
1723 Form("\t\t\t****** run %d - %s: DONE ******",
3301427a 1724 GetCurrentRun(), aDetector->GetName()));
1725 UpdateShuttleStatus(AliShuttleStatus::kDone);
9827400b 1726 UpdateShuttleLogbook(fCurrentDetector, "DONE");
3301427a 1727 }
c88ad5db 1728 } else
1729 {
1730 Log("SHUTTLE",
1731 Form("\t\t\t****** run %d - %s: PP ERROR ******",
1732 GetCurrentRun(), aDetector->GetName()));
be48e3ea 1733 }
1734
4b95672b 1735 for (UInt_t iSys=0; iSys<3; iSys++)
1736 {
1737 if (fFXSCalled[iSys]) fFXSlist[iSys].Clear();
1738 }
1739
c88ad5db 1740 Log("SHUTTLE", Form("Process - Client process of %d - %s is exiting now with %d.",
9827400b 1741 GetCurrentRun(), aDetector->GetName(), success));
be48e3ea 1742
1743 // the client exits here
9827400b 1744 gSystem->Exit(success);
be48e3ea 1745
1746 AliError("We should never get here!!!");
1747 }
7bfb2090 1748 }
5164a766 1749
c88ad5db 1750 Log("SHUTTLE", Form("\t\t\t^*^*^*^*^*^*^*^*^*^*^*^* run %d: FINISH ^*^*^*^*^*^*^*^*^*^*^*^*",
2bb7b766 1751 GetCurrentRun()));
1752
1753 //check if shuttle is done for this run, if so update logbook
1754 TObjArray checkEntryArray;
1755 checkEntryArray.SetOwner(1);
9e080f92 1756 TString whereClause = Form("where run=%d", GetCurrentRun());
b0e53b15 1757 if (!QueryShuttleLogbook(whereClause.Data(), checkEntryArray) ||
1758 checkEntryArray.GetEntries() == 0) {
9e080f92 1759 Log("SHUTTLE", Form("Process - Warning: Cannot check status of run %d on Shuttle logbook!",
1760 GetCurrentRun()));
1761 return hasError == kFALSE;
1762 }
b948db8d 1763
9e080f92 1764 AliShuttleLogbookEntry* checkEntry = dynamic_cast<AliShuttleLogbookEntry*>
1765 (checkEntryArray.At(0));
2bb7b766 1766
9e080f92 1767 if (checkEntry)
1768 {
1769 if (checkEntry->IsDone())
be48e3ea 1770 {
9e080f92 1771 Log("SHUTTLE","Process - Shuttle is DONE. Updating logbook");
1772 UpdateShuttleLogbook("shuttle_done");
1773 }
1774 else
1775 {
1776 for (UInt_t iDet=0; iDet<NDetectors(); iDet++)
be48e3ea 1777 {
9e080f92 1778 if (checkEntry->GetDetectorStatus(iDet) == AliShuttleLogbookEntry::kUnprocessed)
be48e3ea 1779 {
9e080f92 1780 AliDebug(2, Form("Run %d: setting %s as \"not first time unprocessed\"",
1781 checkEntry->GetRun(), GetDetName(iDet)));
1782 fFirstUnprocessed[iDet] = kFALSE;
be48e3ea 1783 }
1784 }
2bb7b766 1785 }
1786 }
1787
1788 fLogbookEntry = 0;
85a80aa9 1789
a7160fe9 1790 return hasError == kFALSE;
73abe331 1791}
1792
b948db8d 1793//______________________________________________________________________________________________
9827400b 1794Bool_t AliShuttle::ProcessCurrentDetector()
73abe331 1795{
1796 //
2bb7b766 1797 // Makes data retrieval just for a specific detector (fCurrentDetector).
73abe331 1798 // Threre should be a configuration for this detector.
73abe331 1799
1d172743 1800 Log("SHUTTLE", Form("ProcessCurrentDetector - Retrieving values for %s, run %d",
1801 fCurrentDetector.Data(), GetCurrentRun()));
73abe331 1802
d524ade6 1803 TString wd = gSystem->WorkingDirectory();
1804
2d9019b4 1805 if (!CleanReferenceStorage(fCurrentDetector.Data()))
546242fb 1806 return kFALSE;
d524ade6 1807
1808 gSystem->ChangeDirectory(wd.Data());
1809
1d172743 1810 TMap* dcsMap = new TMap();
3301427a 1811
1812 // call preprocessor
1813 AliPreprocessor* aPreprocessor =
1814 dynamic_cast<AliPreprocessor*> (fPreprocessorMap.GetValue(fCurrentDetector));
1815
1816 aPreprocessor->Initialize(GetCurrentRun(), GetCurrentStartTime(), GetCurrentEndTime());
1817
1818 Bool_t processDCS = aPreprocessor->ProcessDCS();
d477ad88 1819
651fdaab 1820 if (!processDCS)
1821 {
1d172743 1822 Log(fCurrentDetector, "ProcessCurrentDetector -"
1823 " The preprocessor requested to skip the retrieval of DCS values");
651fdaab 1824 }
8b739301 1825 else if (fTestMode & kSkipDCS)
2c15234c 1826 {
1d172743 1827 Log(fCurrentDetector, "ProcessCurrentDetector - In TESTMODE: Skipping DCS processing");
9827400b 1828 }
1829 else if (fTestMode & kErrorDCS)
1830 {
1d172743 1831 Log(fCurrentDetector, "ProcessCurrentDetector - In TESTMODE: Simulating DCS error");
3d8bc902 1832 UpdateShuttleStatus(AliShuttleStatus::kDCSStarted);
9827400b 1833 UpdateShuttleStatus(AliShuttleStatus::kDCSError);
1d172743 1834 delete dcsMap;
9827400b 1835 return kFALSE;
2c15234c 1836 } else {
3301427a 1837
1838 UpdateShuttleStatus(AliShuttleStatus::kDCSStarted);
1839
1d172743 1840 // Query DCS archive
1841 Int_t nServers = fConfig->GetNServers(fCurrentDetector);
a038aa70 1842
1d172743 1843 for (int iServ=0; iServ<nServers; iServ++)
2c15234c 1844 {
1d172743 1845
1846 TString host(fConfig->GetDCSHost(fCurrentDetector, iServ));
1847 Int_t port = fConfig->GetDCSPort(fCurrentDetector, iServ);
542b6cc8 1848 Int_t multiSplit = fConfig->GetMultiSplit(fCurrentDetector, iServ);
1849
1790d4b7 1850 Log(fCurrentDetector, Form("ProcessCurrentDetector -"
1851 " Querying DCS Amanda server %s:%d (%d of %d)",
1852 host.Data(), port, iServ+1, nServers));
1d172743 1853
1854 TMap* aliasMap = 0;
1855 TMap* dpMap = 0;
1856
1857 if (fConfig->GetDCSAliases(fCurrentDetector, iServ)->GetEntries() > 0)
2c15234c 1858 {
1d172743 1859 aliasMap = GetValueSet(host, port,
542b6cc8 1860 fConfig->GetDCSAliases(fCurrentDetector, iServ),
1861 kAlias, multiSplit);
1d172743 1862 if (!aliasMap)
1863 {
1864 Log(fCurrentDetector,
1865 Form("ProcessCurrentDetector -"
675f64cd 1866 " Error retrieving DCS aliases from server %s."
1867 " Sending mail to DCS experts!", host.Data()));
1d172743 1868 UpdateShuttleStatus(AliShuttleStatus::kDCSError);
675f64cd 1869
b5f1c82e 1870 //if (!SendMailToDCS())
1871 // Log("SHUTTLE", Form("ProcessCurrentDetector - Could not send mail to DCS experts!"));
675f64cd 1872
a038aa70 1873 delete dcsMap;
1d172743 1874 return kFALSE;
1875 }
2c15234c 1876 }
a038aa70 1877
1d172743 1878 if (fConfig->GetDCSDataPoints(fCurrentDetector, iServ)->GetEntries() > 0)
a038aa70 1879 {
1d172743 1880 dpMap = GetValueSet(host, port,
542b6cc8 1881 fConfig->GetDCSDataPoints(fCurrentDetector, iServ),
1882 kDP, multiSplit);
1d172743 1883 if (!dpMap)
1884 {
1885 Log(fCurrentDetector,
1886 Form("ProcessCurrentDetector -"
675f64cd 1887 " Error retrieving DCS data points from server %s."
1888 " Sending mail to DCS experts!", host.Data()));
1d172743 1889 UpdateShuttleStatus(AliShuttleStatus::kDCSError);
675f64cd 1890
b5f1c82e 1891 //if (!SendMailToDCS())
1892 // Log("SHUTTLE", Form("ProcessCurrentDetector - Could not send mail to DCS experts!"));
675f64cd 1893
1d172743 1894 if (aliasMap) delete aliasMap;
1895 delete dcsMap;
1896 return kFALSE;
1897 }
a038aa70 1898 }
1d172743 1899
1900 // merge aliasMap and dpMap into dcsMap
1901 if(aliasMap) {
1902 TIter iter(aliasMap);
a038aa70 1903 TObjString* key = 0;
1904 while ((key = (TObjString*) iter.Next()))
1d172743 1905 dcsMap->Add(key, aliasMap->GetValue(key->String()));
1906
1907 aliasMap->SetOwner(kFALSE);
1908 delete aliasMap;
1909 }
1910
1911 if(dpMap) {
1912 TIter iter(dpMap);
1913 TObjString* key = 0;
1914 while ((key = (TObjString*) iter.Next()))
1915 dcsMap->Add(key, dpMap->GetValue(key->String()));
1916
1917 dpMap->SetOwner(kFALSE);
1918 delete dpMap;
a038aa70 1919 }
73abe331 1920 }
1921 }
dc25836b 1922
b1d18693 1923 // save map into file, to help debugging in case of preprocessor error
fbc112e3 1924 TFile* f = TFile::Open("DCSMap.root","recreate");
b1d18693 1925 f->cd();
1926 dcsMap->Write("DCSMap", TObject::kSingleKey);
1927 f->Close();
fbc112e3 1928 delete f;
b1d18693 1929
2bb7b766 1930 // DCS Archive DB processing successful. Call Preprocessor!
85a80aa9 1931 UpdateShuttleStatus(AliShuttleStatus::kPPStarted);
a7160fe9 1932
a038aa70 1933 UInt_t returnValue = aPreprocessor->Process(dcsMap);
b948db8d 1934
3301427a 1935 if (returnValue > 0) // Preprocessor error!
1936 {
c88ad5db 1937 Log(fCurrentDetector, Form("ProcessCurrentDetector - "
1938 "Preprocessor failed. Process returned %d.", returnValue));
cb343cfd 1939 UpdateShuttleStatus(AliShuttleStatus::kPPError);
a038aa70 1940 dcsMap->DeleteAll();
1941 delete dcsMap;
9827400b 1942 return kFALSE;
1943 }
1944
1945 // preprocessor ok!
1946 UpdateShuttleStatus(AliShuttleStatus::kPPDone);
1947 Log(fCurrentDetector, Form("ProcessCurrentDetector - %s preprocessor returned success",
1948 fCurrentDetector.Data()));
b948db8d 1949
a038aa70 1950 dcsMap->DeleteAll();
1951 delete dcsMap;
b948db8d 1952
9827400b 1953 return kTRUE;
2bb7b766 1954}
1955
fbc112e3 1956//______________________________________________________________________________________________
1957void AliShuttle::CountOpenRuns()
1958{
1959 // Query DAQ's Shuttle logbook and sends the number of open runs to ML
1960
1961 // check connection, in case connect
1962 if (!Connect(3))
1963 return;
1964
1965 TString sqlQuery;
1966 sqlQuery = Form("select count(*) from %s where shuttle_done=0", fConfig->GetShuttlelbTable());
1967
1968 TSQLResult* aResult = fServer[3]->Query(sqlQuery);
1969 if (!aResult) {
1970 AliError(Form("Can't execute query <%s>!", sqlQuery.Data()));
1971 return;
1972 }
1973
1974 AliDebug(2,Form("Query = %s", sqlQuery.Data()));
1975
1976 if (aResult->GetRowCount() == 0) {
1977 AliError(Form("No result for query %s received", sqlQuery.Data()));
1978 return;
1979 }
1980
1981 if (aResult->GetFieldCount() != 1) {
1982 AliError(Form("Invalid field count for query %s received", sqlQuery.Data()));
1983 return;
1984 }
1985
1986 TSQLRow* aRow = aResult->Next();
1987 if (!aRow) {
1988 AliError(Form("Could not receive result of query %s", sqlQuery.Data()));
1989 return;
1990 }
1991
1992 TString result(aRow->GetField(0), aRow->GetFieldLength(0));
1993 Int_t count = result.Atoi();
1994
1995 Log("SHUTTLE", Form("%d unprocessed runs", count));
1996
1997 delete aRow;
1998 delete aResult;
1999
2000 TMonaLisaValue mlStatus("SHUTTLE_openruns", count);
2001
2002 TList mlList;
2003 mlList.Add(&mlStatus);
2004
2005 fMonaLisa->SendParameters(&mlList, "__PROCESSINGINFO__");
2006}
2007
2bb7b766 2008//______________________________________________________________________________________________
2009Bool_t AliShuttle::QueryShuttleLogbook(const char* whereClause,
2010 TObjArray& entries)
2011{
9827400b 2012 // Query DAQ's Shuttle logbook and fills detector status object.
2013 // Call QueryRunParameters to query DAQ logbook for run parameters.
2014 //
2bb7b766 2015
fc5a4708 2016 entries.SetOwner(1);
2017
2bb7b766 2018 // check connection, in case connect
fbc112e3 2019 if (!Connect(3)) return kFALSE;
2bb7b766 2020
2021 TString sqlQuery;
441b0e9c 2022 sqlQuery = Form("select * from %s %s order by run", fConfig->GetShuttlelbTable(), whereClause);
2bb7b766 2023
be48e3ea 2024 TSQLResult* aResult = fServer[3]->Query(sqlQuery);
2bb7b766 2025 if (!aResult) {
2026 AliError(Form("Can't execute query <%s>!", sqlQuery.Data()));
2027 return kFALSE;
2028 }
2029
fc5a4708 2030 AliDebug(2,Form("Query = %s", sqlQuery.Data()));
2031
2bb7b766 2032 if(aResult->GetRowCount() == 0) {
c88ad5db 2033 Log("SHUTTLE", "No entries in Shuttle Logbook match request");
9827400b 2034 delete aResult;
2035 return kTRUE;
2bb7b766 2036 }
2037
2038 // TODO Check field count!
db99d43e 2039 const UInt_t nCols = 23;
2bb7b766 2040 if (aResult->GetFieldCount() != (Int_t) nCols) {
c88ad5db 2041 Log("SHUTTLE", "Invalid SQL result field number!");
2bb7b766 2042 delete aResult;
2043 return kFALSE;
2044 }
2045
2bb7b766 2046 TSQLRow* aRow;
2047 while ((aRow = aResult->Next())) {
2048 TString runString(aRow->GetField(0), aRow->GetFieldLength(0));
2049 Int_t run = runString.Atoi();
2050
eba76848 2051 AliShuttleLogbookEntry *entry = QueryRunParameters(run);
2052 if (!entry)
2053 continue;
2bb7b766 2054
2055 // loop on detectors
eba76848 2056 for(UInt_t ii = 0; ii < nCols; ii++)
2057 entry->SetDetectorStatus(aResult->GetFieldName(ii), aRow->GetField(ii));
2bb7b766 2058
eba76848 2059 entries.AddLast(entry);
2bb7b766 2060 delete aRow;
2061 }
2062
2bb7b766 2063 delete aResult;
2064 return kTRUE;
2065}
2066
2067//______________________________________________________________________________________________
eba76848 2068AliShuttleLogbookEntry* AliShuttle::QueryRunParameters(Int_t run)
2bb7b766 2069{
eba76848 2070 //
2071 // Retrieve run parameters written in the DAQ logbook and sets them into AliShuttleLogbookEntry object
2072 //
2bb7b766 2073
2074 // check connection, in case connect
be48e3ea 2075 if (!Connect(3))
eba76848 2076 return 0;
2bb7b766 2077
2078 TString sqlQuery;
2c15234c 2079 sqlQuery.Form("select * from %s where run=%d", fConfig->GetDAQlbTable(), run);
2bb7b766 2080
be48e3ea 2081 TSQLResult* aResult = fServer[3]->Query(sqlQuery);
2bb7b766 2082 if (!aResult) {
c88ad5db 2083 Log("SHUTTLE", Form("Can't execute query <%s>!", sqlQuery.Data()));
eba76848 2084 return 0;
2bb7b766 2085 }
2086
eba76848 2087 if (aResult->GetRowCount() == 0) {
2bb7b766 2088 Log("SHUTTLE", Form("QueryRunParameters - No entry in DAQ Logbook for run %d. Skipping", run));
2089 delete aResult;
eba76848 2090 return 0;
2bb7b766 2091 }
2092
eba76848 2093 if (aResult->GetRowCount() > 1) {
c88ad5db 2094 Log("SHUTTLE", Form("QueryRunParameters - UNEXPECTED: "
2095 "more than one entry in DAQ Logbook for run %d!", run));
2bb7b766 2096 delete aResult;
eba76848 2097 return 0;
2bb7b766 2098 }
2099
eba76848 2100 TSQLRow* aRow = aResult->Next();
2101 if (!aRow)
2102 {
c88ad5db 2103 Log("SHUTTLE", Form("QueryRunParameters - Could not retrieve row for run %d. Skipping", run));
eba76848 2104 delete aResult;
2105 return 0;
2106 }
2bb7b766 2107
eba76848 2108 AliShuttleLogbookEntry* entry = new AliShuttleLogbookEntry(run);
2bb7b766 2109
eba76848 2110 for (Int_t ii = 0; ii < aResult->GetFieldCount(); ii++)
2111 entry->SetRunParameter(aResult->GetFieldName(ii), aRow->GetField(ii));
2bb7b766 2112
eba76848 2113 UInt_t startTime = entry->GetStartTime();
2114 UInt_t endTime = entry->GetEndTime();
2115
1abfbb60 2116// if (!startTime || !endTime || startTime > endTime)
2117// {
2118// Log("SHUTTLE",
2119// Form("QueryRunParameters - Invalid parameters for Run %d: startTime = %d, endTime = %d. Skipping!",
2120// run, startTime, endTime));
2121//
2122// Log("SHUTTLE", Form("Marking SHUTTLE done for run %d", run));
2123// fLogbookEntry = entry;
2124// if (!UpdateShuttleLogbook("shuttle_done"))
2125// {
2126// AliError(Form("Could not update logbook for run %d !", run));
2127// }
2128// fLogbookEntry = 0;
2129//
2130// delete entry;
2131// delete aRow;
2132// delete aResult;
2133// return 0;
2134// }
2135
2136 if (!startTime)
b0e53b15 2137 {
2138 Log("SHUTTLE",
1abfbb60 2139 Form("QueryRunParameters - Invalid parameters for Run %d: "
2140 "startTime = %d, endTime = %d. Skipping!",
2141 run, startTime, endTime));
b0e53b15 2142
2143 Log("SHUTTLE", Form("Marking SHUTTLE done for run %d", run));
2144 fLogbookEntry = entry;
ee6f7523 2145 if (!UpdateShuttleLogbook("shuttle_ignored"))
b0e53b15 2146 {
2147 AliError(Form("Could not update logbook for run %d !", run));
2148 }
2149 fLogbookEntry = 0;
2150
2151 delete entry;
2152 delete aRow;
2153 delete aResult;
2154 return 0;
2155 }
2156
1abfbb60 2157 if (startTime && !endTime)
2158 {
2159 // TODO Here we don't mark SHUTTLE done, because this may mean
2160 //the run is still ongoing!!
2161 Log("SHUTTLE",
2162 Form("QueryRunParameters - Invalid parameters for Run %d: "
2163 "startTime = %d, endTime = %d. Skipping (Shuttle won't be marked as DONE)!",
2164 run, startTime, endTime));
2165
2166 //Log("SHUTTLE", Form("Marking SHUTTLE done for run %d", run));
2167 //fLogbookEntry = entry;
2168 //if (!UpdateShuttleLogbook("shuttle_done"))
2169 //{
2170 // AliError(Form("Could not update logbook for run %d !", run));
2171 //}
2172 //fLogbookEntry = 0;
2173
2174 delete entry;
2175 delete aRow;
2176 delete aResult;
2177 return 0;
2178 }
2179
2180 if (startTime && endTime && (startTime > endTime))
2181 {
2182 Log("SHUTTLE",
2183 Form("QueryRunParameters - Invalid parameters for Run %d: "
2184 "startTime = %d, endTime = %d. Skipping!",
2185 run, startTime, endTime));
2186
2187 Log("SHUTTLE", Form("Marking SHUTTLE done for run %d", run));
2188 fLogbookEntry = entry;
ee6f7523 2189 if (!UpdateShuttleLogbook("shuttle_ignored"))
1abfbb60 2190 {
2191 AliError(Form("Could not update logbook for run %d !", run));
2192 }
2193 fLogbookEntry = 0;
2194
2195 delete entry;
2196 delete aRow;
2197 delete aResult;
2198 return 0;
2199 }
2200
b0e53b15 2201 TString totEventsStr = entry->GetRunParameter("totalEvents");
2202 Int_t totEvents = totEventsStr.Atoi();
2203 if (totEvents < 1)
2204 {
eba76848 2205 Log("SHUTTLE",
b0e53b15 2206 Form("QueryRunParameters - Run %d has 0 events - Skipping!", run));
2207
2208 Log("SHUTTLE", Form("Marking SHUTTLE done for run %d", run));
2209 fLogbookEntry = entry;
97e3c167 2210 if (!UpdateShuttleLogbook("shuttle_ignored"))
b0e53b15 2211 {
2212 AliError(Form("Could not update logbook for run %d !", run));
2213 }
2214 fLogbookEntry = 0;
2215
eba76848 2216 delete entry;
2bb7b766 2217 delete aRow;
eba76848 2218 delete aResult;
2219 return 0;
2bb7b766 2220 }
2221
eba76848 2222 delete aRow;
2bb7b766 2223 delete aResult;
eba76848 2224
2225 return entry;
2bb7b766 2226}
2227
a038aa70 2228//______________________________________________________________________________________________
2229TMap* AliShuttle::GetValueSet(const char* host, Int_t port, const TSeqCollection* entries,
542b6cc8 2230 DCSType type, Int_t multiSplit)
a038aa70 2231{
2232 // Retrieve all "entry" data points from the DCS server
2233 // host, port: TSocket connection parameters
2234 // entries: list of name of the alias or data point
2235 // type: kAlias or kDP
2236 // returns TMap of values, 0 when failure
542b6cc8 2237
2238 AliDCSClient client(host, port, fTimeout, fRetries, multiSplit);
b41b252a 2239
a038aa70 2240 TMap* result = 0;
b41b252a 2241 if (type == kAlias)
a038aa70 2242 {
b41b252a 2243 result = client.GetAliasValues(entries, GetCurrentStartTime(),
2244 GetCurrentEndTime());
2245 }
2246 else if (type == kDP)
2247 {
2248 result = client.GetDPValues(entries, GetCurrentStartTime(),
2249 GetCurrentEndTime());
2250 }
a038aa70 2251
b41b252a 2252 if (result == 0)
2253 {
2254 Log(fCurrentDetector.Data(), Form("GetValueSet - Can't get entries! Reason: %s",
1790d4b7 2255 client.GetErrorString(client.GetResultErrorCode())));
2256 if (client.GetResultErrorCode() == AliDCSClient::fgkServerError)
2257 Log(fCurrentDetector.Data(), Form("GetValueSet - Server error code: %s",
2258 client.GetServerError().Data()));
a038aa70 2259
b41b252a 2260 return 0;
a038aa70 2261 }
b41b252a 2262
a038aa70 2263 return result;
2264}
b41b252a 2265
b948db8d 2266//______________________________________________________________________________________________
57f50b3c 2267const char* AliShuttle::GetFile(Int_t system, const char* detector,
2268 const char* id, const char* source)
b948db8d 2269{
9827400b 2270 // Get calibration file from file exchange servers
2271 // First queris the FXS database for the file name, using the run, detector, id and source info
2272 // then calls RetrieveFile(filename) for actual copy to local disk
2273 // run: current run being processed (given by Logbook entry fLogbookEntry)
2274 // detector: the Preprocessor name
2275 // id: provided as a parameter by the Preprocessor
2276 // source: provided by the Preprocessor through GetFileSources function
2277
2278 // check if test mode should simulate a FXS error
2279 if (fTestMode & kErrorFXSFiles)
2280 {
2281 Log(detector, Form("GetFile - In TESTMODE - Simulating error while connecting to %s FXS", GetSystemName(system)));
2282 return 0;
2283 }
2284
57f50b3c 2285 // check connection, in case connect
9d733021 2286 if (!Connect(system))
eba76848 2287 {
9d733021 2288 Log(detector, Form("GetFile - Couldn't connect to %s FXS database", GetSystemName(system)));
57f50b3c 2289 return 0;
2290 }
2291
2292 // Query preparation
9d733021 2293 TString sourceName(source);
d386d623 2294 Int_t nFields = 3;
2295 TString sqlQueryStart = Form("select filePath,size,fileChecksum from %s where",
2296 fConfig->GetFXSdbTable(system));
2297 TString whereClause = Form("run=%d and detector=\"%s\" and fileId=\"%s\"",
2298 GetCurrentRun(), detector, id);
2299
9d733021 2300 if (system == kDAQ)
2301 {
d386d623 2302 whereClause += Form(" and DAQsource=\"%s\"", source);
57f50b3c 2303 }
9d733021 2304 else if (system == kDCS)
eba76848 2305 {
9d733021 2306 sourceName="none";
57f50b3c 2307 }
9d733021 2308 else if (system == kHLT)
9e080f92 2309 {
d386d623 2310 whereClause += Form(" and DDLnumbers=\"%s\"", source);
9d733021 2311 nFields = 3;
9e080f92 2312 }
2313
9e080f92 2314 TString sqlQuery = Form("%s %s", sqlQueryStart.Data(), whereClause.Data());
2315
2316 AliDebug(2, Form("SQL query: \n%s",sqlQuery.Data()));
2317
2318 // Query execution
2319 TSQLResult* aResult = 0;
9d733021 2320 aResult = dynamic_cast<TSQLResult*> (fServer[system]->Query(sqlQuery));
9e080f92 2321 if (!aResult) {
9d733021 2322 Log(detector, Form("GetFileName - Can't execute SQL query to %s database for: id = %s, source = %s",
2323 GetSystemName(system), id, sourceName.Data()));
9e080f92 2324 return 0;
2325 }
2326
2327 if(aResult->GetRowCount() == 0)
2328 {
2329 Log(detector,
9d733021 2330 Form("GetFileName - No entry in %s FXS db for: id = %s, source = %s",
2331 GetSystemName(system), id, sourceName.Data()));
9e080f92 2332 delete aResult;
2333 return 0;
2334 }
2bb7b766 2335
9e080f92 2336 if (aResult->GetRowCount() > 1) {
2337 Log(detector,
9d733021 2338 Form("GetFileName - More than one entry in %s FXS db for: id = %s, source = %s",
2339 GetSystemName(system), id, sourceName.Data()));
9e080f92 2340 delete aResult;
2341 return 0;
2342 }
2343
9d733021 2344 if (aResult->GetFieldCount() != nFields) {
9e080f92 2345 Log(detector,
9d733021 2346 Form("GetFileName - Wrong field count in %s FXS db for: id = %s, source = %s",
2347 GetSystemName(system), id, sourceName.Data()));
9e080f92 2348 delete aResult;
2349 return 0;
2350 }
2351
2352 TSQLRow* aRow = dynamic_cast<TSQLRow*> (aResult->Next());
2353
2354 if (!aRow){
9d733021 2355 Log(detector, Form("GetFileName - Empty set result in %s FXS db from query: id = %s, source = %s",
2356 GetSystemName(system), id, sourceName.Data()));
9e080f92 2357 delete aResult;
2358 return 0;
2359 }
2360
2361 TString filePath(aRow->GetField(0), aRow->GetFieldLength(0));
2362 TString fileSize(aRow->GetField(1), aRow->GetFieldLength(1));
d386d623 2363 TString fileChecksum(aRow->GetField(2), aRow->GetFieldLength(2));
9e080f92 2364
2365 delete aResult;
2366 delete aRow;
2367
d386d623 2368 AliDebug(2, Form("filePath = %s; size = %s, fileChecksum = %s",
2369 filePath.Data(), fileSize.Data(), fileChecksum.Data()));
9e080f92 2370
9e080f92 2371 // retrieved file is renamed to make it unique
675f64cd 2372 TString localFileName = Form("%s/%s_%d_process/%s_%s_%d_%s_%s.shuttle",
2373 GetShuttleTempDir(), detector, GetCurrentRun(),
d524ade6 2374 GetSystemName(system), detector, GetCurrentRun(),
2375 id, sourceName.Data());
9d733021 2376
9e080f92 2377
9d733021 2378 // file retrieval from FXS
4b95672b 2379 UInt_t nRetries = 0;
2380 UInt_t maxRetries = 3;
2381 Bool_t result = kFALSE;
2382
2383 // copy!! if successful TSystem::Exec returns 0
2384 while(nRetries++ < maxRetries) {
2385 AliDebug(2, Form("Trying to copy file. Retry # %d", nRetries));
2386 result = RetrieveFile(system, filePath.Data(), localFileName.Data());
2387 if(!result)
2388 {
2389 Log(detector, Form("GetFileName - Copy of file %s from %s FXS failed",
9d733021 2390 filePath.Data(), GetSystemName(system)));
4b95672b 2391 continue;
4f0749a8 2392 }
9e080f92 2393
d386d623 2394 if (fileChecksum.Length()>0)
4b95672b 2395 {
2396 // compare md5sum of local file with the one stored in the FXS DB
d524ade6 2397 Int_t md5Comp = gSystem->Exec(Form("md5sum %s |grep %s 2>&1 > /dev/null",
2398 localFileName.Data(), fileChecksum.Data()));
9e080f92 2399
4b95672b 2400 if (md5Comp != 0)
2401 {
2402 Log(detector, Form("GetFileName - md5sum of file %s does not match with local copy!",
2403 filePath.Data()));
2404 result = kFALSE;
2405 continue;
2406 }
d386d623 2407 } else {
2408 Log(fCurrentDetector, Form("GetFile - md5sum of file %s not set in %s database, skipping comparison",
2409 filePath.Data(), GetSystemName(system)));
9d733021 2410 }
4b95672b 2411 if (result) break;
9e080f92 2412 }
2413
4b95672b 2414 if(!result) return 0;
2415
9d733021 2416 fFXSCalled[system]=kTRUE;
2417 TObjString *fileParams = new TObjString(Form("%s#!?!#%s", id, sourceName.Data()));
2418 fFXSlist[system].Add(fileParams);
9e080f92 2419
675f64cd 2420 static TString staticLocalFileName;
2421 staticLocalFileName.Form("%s", localFileName.Data());
2422
c88ad5db 2423 Log(fCurrentDetector, Form("GetFile - Retrieved file with id %s and "
2424 "source %s from %s to %s", id, source,
d524ade6 2425 GetSystemName(system), localFileName.Data()));
675f64cd 2426
d524ade6 2427 return staticLocalFileName.Data();
2bb7b766 2428}
2429
2430//______________________________________________________________________________________________
9d733021 2431Bool_t AliShuttle::RetrieveFile(UInt_t system, const char* fxsFileName, const char* localFileName)
9e080f92 2432{
9827400b 2433 //
2434 // Copies file from FXS to local Shuttle machine
2435 //
2bb7b766 2436
9e080f92 2437 // check temp directory: trying to cd to temp; if it does not exist, create it
d524ade6 2438 AliDebug(2, Form("Copy file %s from %s FXS into %s",
2439 GetSystemName(system), fxsFileName, localFileName));
2440
2441 TString tmpDir(localFileName);
2442
2443 tmpDir = tmpDir(0,tmpDir.Last('/'));
9e080f92 2444
d524ade6 2445 Int_t noDir = gSystem->GetPathInfo(tmpDir.Data(), 0, (Long64_t*) 0, 0, 0);
2446 if (noDir) // temp dir does not exists!
2447 {
2448 if (gSystem->mkdir(tmpDir.Data(), 1))
2449 {
2450 Log(fCurrentDetector.Data(), "RetrieveFile - could not make temp directory!!");
9e080f92 2451 return kFALSE;
2452 }
9e080f92 2453 }
2454
9d733021 2455 TString baseFXSFolder;
2456 if (system == kDAQ)
2457 {
2458 baseFXSFolder = "FES/";
2459 }
2460 else if (system == kDCS)
2461 {
2462 baseFXSFolder = "";
2463 }
2464 else if (system == kHLT)
2465 {
42fde080 2466 baseFXSFolder = "/opt/FXS/";
9d733021 2467 }
2468
2469
d524ade6 2470 TString command = Form("scp -oPort=%d -2 %s@%s:%s%s %s",
9d733021 2471 fConfig->GetFXSPort(system),
2472 fConfig->GetFXSUser(system),
2473 fConfig->GetFXSHost(system),
2474 baseFXSFolder.Data(),
2475 fxsFileName,
9e080f92 2476 localFileName);
2477
2478 AliDebug(2, Form("%s",command.Data()));
2479
4b95672b 2480 Bool_t result = (gSystem->Exec(command.Data()) == 0);
9e080f92 2481
4b95672b 2482 return result;
9e080f92 2483}
2484
2485//______________________________________________________________________________________________
9d733021 2486TList* AliShuttle::GetFileSources(Int_t system, const char* detector, const char* id)
2487{
9827400b 2488 //
2489 // Get sources producing the condition file Id from file exchange servers
4a33bdd9 2490 // if id is NULL all sources are returned (distinct)
9827400b 2491 //
1bcd28db 2492
2493 Log(detector, Form("GetFileSources - Retrieving sources with id %s from %s", id, GetSystemName(system)));
9827400b 2494
2495 // check if test mode should simulate a FXS error
2496 if (fTestMode & kErrorFXSSources)
2497 {
2498 Log(detector, Form("GetFileSources - In TESTMODE - Simulating error while connecting to %s FXS", GetSystemName(system)));
2499 return 0;
2500 }
2501
9d733021 2502 if (system == kDCS)
2503 {
c88ad5db 2504 Log(detector, "GetFileSources - WARNING: DCS system has only one source of data!");
6297b37d 2505 TList *list = new TList();
2506 list->SetOwner(1);
2507 list->Add(new TObjString(" "));
2508 return list;
9d733021 2509 }
9e080f92 2510
2511 // check connection, in case connect
9d733021 2512 if (!Connect(system))
2513 {
4a33bdd9 2514 Log(detector, Form("GetFileSources - Couldn't connect to %s FXS database", GetSystemName(system)));
9d733021 2515 return NULL;
9e080f92 2516 }
2517
9d733021 2518 TString sourceName = 0;
2519 if (system == kDAQ)
2520 {
2521 sourceName = "DAQsource";
2522 } else if (system == kHLT)
2523 {
2524 sourceName = "DDLnumbers";
2525 }
2526
4a33bdd9 2527 TString sqlQueryStart = Form("select distinct %s from %s where", sourceName.Data(), fConfig->GetFXSdbTable(system));
2528 TString whereClause = Form("run=%d and detector=\"%s\"",
2529 GetCurrentRun(), detector);
2530 if (id)
2531 whereClause += Form(" and fileId=\"%s\"", id);
9e080f92 2532 TString sqlQuery = Form("%s %s", sqlQueryStart.Data(), whereClause.Data());
2533
2534 AliDebug(2, Form("SQL query: \n%s",sqlQuery.Data()));
2535
2536 // Query execution
2537 TSQLResult* aResult;
9d733021 2538 aResult = fServer[system]->Query(sqlQuery);
9e080f92 2539 if (!aResult) {
9d733021 2540 Log(detector, Form("GetFileSources - Can't execute SQL query to %s database for id: %s",
2541 GetSystemName(system), id));
9e080f92 2542 return 0;
2543 }
2544
86aa42c3 2545 TList *list = new TList();
2546 list->SetOwner(1);
2547
9d733021 2548 if (aResult->GetRowCount() == 0)
2549 {
9e080f92 2550 Log(detector,
9d733021 2551 Form("GetFileSources - No entry in %s FXS table for id: %s", GetSystemName(system), id));
9e080f92 2552 delete aResult;
86aa42c3 2553 return list;
9e080f92 2554 }
2555
1bcd28db 2556 Log(detector, Form("GetFileSources - Found %d sources", aResult->GetRowCount()));
9e080f92 2557
1bcd28db 2558 TSQLRow* aRow;
9d733021 2559 while ((aRow = aResult->Next()))
2560 {
9e080f92 2561
9d733021 2562 TString source(aRow->GetField(0), aRow->GetFieldLength(0));
2563 AliDebug(2, Form("%s = %s", sourceName.Data(), source.Data()));
2564 list->Add(new TObjString(source));
9e080f92 2565 delete aRow;
2566 }
9d733021 2567
9e080f92 2568 delete aResult;
2569
2570 return list;
2bb7b766 2571}
2572
4a33bdd9 2573//______________________________________________________________________________________________
2574TList* AliShuttle::GetFileIDs(Int_t system, const char* detector, const char* source)
2575{
2576 //
2577 // Get all ids of condition files produced by a given source from file exchange servers
2578 //
2579
1bcd28db 2580 Log(detector, Form("GetFileIDs - Retrieving ids with source %s with %s", source, GetSystemName(system)));
2581
4a33bdd9 2582 // check if test mode should simulate a FXS error
2583 if (fTestMode & kErrorFXSSources)
2584 {
2585 Log(detector, Form("GetFileIDs - In TESTMODE - Simulating error while connecting to %s FXS", GetSystemName(system)));
2586 return 0;
2587 }
2588
2589 // check connection, in case connect
2590 if (!Connect(system))
2591 {
2592 Log(detector, Form("GetFileIDs - Couldn't connect to %s FXS database", GetSystemName(system)));
2593 return NULL;
2594 }
2595
2596 TString sourceName = 0;
2597 if (system == kDAQ)
2598 {
2599 sourceName = "DAQsource";
2600 } else if (system == kHLT)
2601 {
2602 sourceName = "DDLnumbers";
2603 }
2604
2605 TString sqlQueryStart = Form("select fileId from %s where", fConfig->GetFXSdbTable(system));
2606 TString whereClause = Form("run=%d and detector=\"%s\"",
2607 GetCurrentRun(), detector);
2608 if (sourceName.Length() > 0 && source)
2609 whereClause += Form(" and %s=\"%s\"", sourceName.Data(), source);
2610 TString sqlQuery = Form("%s %s", sqlQueryStart.Data(), whereClause.Data());
2611
2612 AliDebug(2, Form("SQL query: \n%s",sqlQuery.Data()));
2613
2614 // Query execution
2615 TSQLResult* aResult;
2616 aResult = fServer[system]->Query(sqlQuery);
2617 if (!aResult) {
2618 Log(detector, Form("GetFileIDs - Can't execute SQL query to %s database for source: %s",
2619 GetSystemName(system), source));
2620 return 0;
2621 }
2622
2623 TList *list = new TList();
2624 list->SetOwner(1);
2625
2626 if (aResult->GetRowCount() == 0)
2627 {
2628 Log(detector,
2629 Form("GetFileIDs - No entry in %s FXS table for source: %s", GetSystemName(system), source));
2630 delete aResult;
2631 return list;
2632 }
2633
1bcd28db 2634 Log(detector, Form("GetFileIDs - Found %d ids", aResult->GetRowCount()));
2635
4a33bdd9 2636 TSQLRow* aRow;
2637
2638 while ((aRow = aResult->Next()))
2639 {
2640
2641 TString id(aRow->GetField(0), aRow->GetFieldLength(0));
2642 AliDebug(2, Form("fileId = %s", id.Data()));
2643 list->Add(new TObjString(id));
2644 delete aRow;
2645 }
2646
2647 delete aResult;
2648
2649 return list;
2650}
2651
2bb7b766 2652//______________________________________________________________________________________________
9d733021 2653Bool_t AliShuttle::Connect(Int_t system)
2bb7b766 2654{
9827400b 2655 // Connect to MySQL Server of the system's FXS MySQL databases
2656 // DAQ Logbook, Shuttle Logbook and DAQ FXS db are on the same host
2657 //
57f50b3c 2658
9d733021 2659 // check connection: if already connected return
2660 if(fServer[system] && fServer[system]->IsConnected()) return kTRUE;
57f50b3c 2661
9d733021 2662 TString dbHost, dbUser, dbPass, dbName;
57f50b3c 2663
9d733021 2664 if (system < 3) // FXS db servers
2665 {
2666 dbHost = Form("mysql://%s:%d", fConfig->GetFXSdbHost(system), fConfig->GetFXSdbPort(system));
2667 dbUser = fConfig->GetFXSdbUser(system);
2668 dbPass = fConfig->GetFXSdbPass(system);
2669 dbName = fConfig->GetFXSdbName(system);
2670 } else { // Run & Shuttle logbook servers
2671 // TODO Will the Shuttle logbook server be the same as the Run logbook server ???
2672 dbHost = Form("mysql://%s:%d", fConfig->GetDAQlbHost(), fConfig->GetDAQlbPort());
2673 dbUser = fConfig->GetDAQlbUser();
2674 dbPass = fConfig->GetDAQlbPass();
2675 dbName = fConfig->GetDAQlbDB();
2676 }
57f50b3c 2677
9d733021 2678 fServer[system] = TSQLServer::Connect(dbHost.Data(), dbUser.Data(), dbPass.Data());
2679 if (!fServer[system] || !fServer[system]->IsConnected()) {
2680 if(system < 3)
2681 {
2682 AliError(Form("Can't establish connection to FXS database for %s",
2683 AliShuttleInterface::GetSystemName(system)));
2684 } else {
2685 AliError("Can't establish connection to Run logbook.");
57f50b3c 2686 }
9d733021 2687 if(fServer[system]) delete fServer[system];
2688 return kFALSE;
2bb7b766 2689 }
57f50b3c 2690
9d733021 2691 // Get tables
2692 TSQLResult* aResult=0;
2693 switch(system){
2694 case kDAQ:
2695 aResult = fServer[kDAQ]->GetTables(dbName.Data());
2696 break;
2697 case kDCS:
2698 aResult = fServer[kDCS]->GetTables(dbName.Data());
2699 break;
2700 case kHLT:
2701 aResult = fServer[kHLT]->GetTables(dbName.Data());
2702 break;
2703 default:
2704 aResult = fServer[3]->GetTables(dbName.Data());
2705 break;
2706 }
2707
2708 delete aResult;
2bb7b766 2709 return kTRUE;
2710}
57f50b3c 2711
9e080f92 2712//______________________________________________________________________________________________
9d733021 2713Bool_t AliShuttle::UpdateTable()
9e080f92 2714{
9827400b 2715 //
2716 // Update FXS table filling time_processed field in all rows corresponding to current run and detector
2717 //
9e080f92 2718
9d733021 2719 Bool_t result = kTRUE;
9e080f92 2720
9d733021 2721 for (UInt_t system=0; system<3; system++)
2722 {
2723 if(!fFXSCalled[system]) continue;
9e080f92 2724
9d733021 2725 // check connection, in case connect
2726 if (!Connect(system))
2727 {
2728 Log(fCurrentDetector, Form("UpdateTable - Couldn't connect to %s FXS database", GetSystemName(system)));
2729 result = kFALSE;
2730 continue;
9e080f92 2731 }
9e080f92 2732
9d733021 2733 TTimeStamp now; // now
2734
2735 // Loop on FXS list entries
2736 TIter iter(&fFXSlist[system]);
2737 TObjString *aFXSentry=0;
2738 while ((aFXSentry = dynamic_cast<TObjString*> (iter.Next())))
2739 {
2740 TString aFXSentrystr = aFXSentry->String();
2741 TObjArray *aFXSarray = aFXSentrystr.Tokenize("#!?!#");
2742 if (!aFXSarray || aFXSarray->GetEntries() != 2 )
2743 {
2744 Log(fCurrentDetector, Form("UpdateTable - error updating %s FXS entry. Check string: <%s>",
2745 GetSystemName(system), aFXSentrystr.Data()));
2746 if(aFXSarray) delete aFXSarray;
2747 result = kFALSE;
2748 continue;
2749 }
2750 const char* fileId = ((TObjString*) aFXSarray->At(0))->GetName();
2751 const char* source = ((TObjString*) aFXSarray->At(1))->GetName();
2752
2753 TString whereClause;
2754 if (system == kDAQ)
2755 {
2756 whereClause = Form("where run=%d and detector=\"%s\" and fileId=\"%s\" and DAQsource=\"%s\";",
2757 GetCurrentRun(), fCurrentDetector.Data(), fileId, source);
2758 }
2759 else if (system == kDCS)
2760 {
2761 whereClause = Form("where run=%d and detector=\"%s\" and fileId=\"%s\";",
2762 GetCurrentRun(), fCurrentDetector.Data(), fileId);
2763 }
2764 else if (system == kHLT)
2765 {
2766 whereClause = Form("where run=%d and detector=\"%s\" and fileId=\"%s\" and DDLnumbers=\"%s\";",
2767 GetCurrentRun(), fCurrentDetector.Data(), fileId, source);
2768 }
2769
2770 delete aFXSarray;
9e080f92 2771
9d733021 2772 TString sqlQuery = Form("update %s set time_processed=%d %s", fConfig->GetFXSdbTable(system),
2773 now.GetSec(), whereClause.Data());
9e080f92 2774
9d733021 2775 AliDebug(2, Form("SQL query: \n%s",sqlQuery.Data()));
9e080f92 2776
9d733021 2777 // Query execution
2778 TSQLResult* aResult;
2779 aResult = dynamic_cast<TSQLResult*> (fServer[system]->Query(sqlQuery));
2780 if (!aResult)
2781 {
2782 Log(fCurrentDetector, Form("UpdateTable - %s db: can't execute SQL query <%s>",
2783 GetSystemName(system), sqlQuery.Data()));
2784 result = kFALSE;
2785 continue;
2786 }
2787 delete aResult;
9e080f92 2788 }
9e080f92 2789 }
2790
9d733021 2791 return result;
9e080f92 2792}
57f50b3c 2793
3301427a 2794//______________________________________________________________________________________________
2795Bool_t AliShuttle::UpdateTableFailCase()
2796{
9827400b 2797 // Update FXS table filling time_processed field in all rows corresponding to current run and detector
2798 // this is called in case the preprocessor is declared failed for the current run, because
2799 // the fields are updated only in case of success
3301427a 2800
2801 Bool_t result = kTRUE;
2802
2803 for (UInt_t system=0; system<3; system++)
2804 {
2805 // check connection, in case connect
2806 if (!Connect(system))
2807 {
2808 Log(fCurrentDetector, Form("UpdateTableFailCase - Couldn't connect to %s FXS database",
2809 GetSystemName(system)));
2810 result = kFALSE;
2811 continue;
2812 }
2813
2814 TTimeStamp now; // now
2815
2816 // Loop on FXS list entries
2817
2818 TString whereClause = Form("where run=%d and detector=\"%s\";",
2819 GetCurrentRun(), fCurrentDetector.Data());
2820
2821
2822 TString sqlQuery = Form("update %s set time_processed=%d %s", fConfig->GetFXSdbTable(system),
2823 now.GetSec(), whereClause.Data());
2824
2825 AliDebug(2, Form("SQL query: \n%s",sqlQuery.Data()));
2826
2827 // Query execution
2828 TSQLResult* aResult;
2829 aResult = dynamic_cast<TSQLResult*> (fServer[system]->Query(sqlQuery));
2830 if (!aResult)
2831 {
2832 Log(fCurrentDetector, Form("UpdateTableFailCase - %s db: can't execute SQL query <%s>",
2833 GetSystemName(system), sqlQuery.Data()));
2834 result = kFALSE;
2835 continue;
2836 }
2837 delete aResult;
2838 }
2839
2840 return result;
2841}
2842
2bb7b766 2843//______________________________________________________________________________________________
2844Bool_t AliShuttle::UpdateShuttleLogbook(const char* detector, const char* status)
2845{
e7f62f16 2846 //
2847 // Update Shuttle logbook filling detector or shuttle_done column
2848 // ex. of usage: UpdateShuttleLogbook("PHOS", "DONE") or UpdateShuttleLogbook("shuttle_done")
2849 //
57f50b3c 2850
2bb7b766 2851 // check connection, in case connect
be48e3ea 2852 if(!Connect(3)){
2bb7b766 2853 Log("SHUTTLE", "UpdateShuttleLogbook - Couldn't connect to DAQ Logbook.");
2854 return kFALSE;
57f50b3c 2855 }
2856
2bb7b766 2857 TString detName(detector);
2858 TString setClause;
ee6f7523 2859 if (detName == "shuttle_done" || detName == "shuttle_ignored")
e7f62f16 2860 {
2bb7b766 2861 setClause = "set shuttle_done=1";
e7f62f16 2862
ee6f7523 2863 if (detName == "shuttle_done")
b0e53b15 2864 {
2865 // Send the information to ML
2866 TMonaLisaText mlStatus("SHUTTLE_status", "Done");
e7f62f16 2867
b0e53b15 2868 TList mlList;
2869 mlList.Add(&mlStatus);
2870
ee6f7523 2871 TString mlID;
2872 mlID.Form("%d", GetCurrentRun());
2873 fMonaLisa->SendParameters(&mlList, mlID);
b0e53b15 2874 }
2bb7b766 2875 } else {
2bb7b766 2876 TString statusStr(status);
2877 if(statusStr.Contains("done", TString::kIgnoreCase) ||
2878 statusStr.Contains("failed", TString::kIgnoreCase)){
eba76848 2879 setClause = Form("set %s=\"%s\"", detector, status);
2bb7b766 2880 } else {
2881 Log("SHUTTLE",
2882 Form("UpdateShuttleLogbook - Invalid status <%s> for detector %s",
2883 status, detector));
2884 return kFALSE;
2885 }
2886 }
57f50b3c 2887
2bb7b766 2888 TString whereClause = Form("where run=%d", GetCurrentRun());
2889
441b0e9c 2890 TString sqlQuery = Form("update %s %s %s",
2891 fConfig->GetShuttlelbTable(), setClause.Data(), whereClause.Data());
57f50b3c 2892
2bb7b766 2893 AliDebug(2, Form("SQL query: \n%s",sqlQuery.Data()));
2894
2895 // Query execution
2896 TSQLResult* aResult;
be48e3ea 2897 aResult = dynamic_cast<TSQLResult*> (fServer[3]->Query(sqlQuery));
2bb7b766 2898 if (!aResult) {
2899 Log("SHUTTLE", Form("UpdateShuttleLogbook - Can't execute query <%s>", sqlQuery.Data()));
2900 return kFALSE;
57f50b3c 2901 }
2bb7b766 2902 delete aResult;
57f50b3c 2903
2904 return kTRUE;
2905}
2906
2907//______________________________________________________________________________________________
2bb7b766 2908Int_t AliShuttle::GetCurrentRun() const
2909{
9827400b 2910 //
2911 // Get current run from logbook entry
2912 //
57f50b3c 2913
2bb7b766 2914 return fLogbookEntry ? fLogbookEntry->GetRun() : -1;
57f50b3c 2915}
2916
2917//______________________________________________________________________________________________
2bb7b766 2918UInt_t AliShuttle::GetCurrentStartTime() const
2919{
9827400b 2920 //
2921 // get current start time
2922 //
57f50b3c 2923
2bb7b766 2924 return fLogbookEntry ? fLogbookEntry->GetStartTime() : 0;
57f50b3c 2925}
2926
2927//______________________________________________________________________________________________
2bb7b766 2928UInt_t AliShuttle::GetCurrentEndTime() const
2929{
9827400b 2930 //
2931 // get current end time from logbook entry
2932 //
57f50b3c 2933
2bb7b766 2934 return fLogbookEntry ? fLogbookEntry->GetEndTime() : 0;
57f50b3c 2935}
2936
675f64cd 2937//______________________________________________________________________________________________
2938UInt_t AliShuttle::GetCurrentYear() const
2939{
2940 //
2941 // Get current year from logbook entry
2942 //
2943
2944 if (!fLogbookEntry) return 0;
2945
2946 TTimeStamp startTime(GetCurrentStartTime());
2947 TString year = Form("%d",startTime.GetDate());
2948 year = year(0,4);
2949
2950 return year.Atoi();
2951}
2952
2953//______________________________________________________________________________________________
2954const char* AliShuttle::GetLHCPeriod() const
2955{
2956 //
2957 // Get current LHC period from logbook entry
2958 //
2959
2960 if (!fLogbookEntry) return 0;
2961
2962 return fLogbookEntry->GetRunParameter("LHCperiod");
2963}
2964
b948db8d 2965//______________________________________________________________________________________________
2966void AliShuttle::Log(const char* detector, const char* message)
2967{
9827400b 2968 //
2969 // Fill log string with a message
2970 //
b948db8d 2971
7d4cf768 2972 TString logRunDir = GetShuttleLogDir();
2973 if (GetCurrentRun() >=0)
2974 logRunDir += Form("/%d", GetCurrentRun());
2975
2976 void* dir = gSystem->OpenDirectory(logRunDir.Data());
84090f85 2977 if (dir == NULL) {
7d4cf768 2978 if (gSystem->mkdir(logRunDir.Data(), kTRUE)) {
36c99a6a 2979 AliError(Form("Can't open directory <%s>", GetShuttleLogDir()));
84090f85 2980 return;
2981 }
b948db8d 2982
84090f85 2983 } else {
2984 gSystem->FreeDirectory(dir);
2985 }
b948db8d 2986
cb343cfd 2987 TString toLog = Form("%s (%d): %s - ", TTimeStamp(time(0)).AsString("s"), getpid(), detector);
e7f62f16 2988 if (GetCurrentRun() >= 0)
2989 toLog += Form("run %d - ", GetCurrentRun());
2bb7b766 2990 toLog += Form("%s", message);
2991
84090f85 2992 AliInfo(toLog.Data());
ffa29e93 2993
2994 // if we redirect the log output already to the file, leave here
2995 if (fOutputRedirected && strcmp(detector, "SHUTTLE") != 0)
2996 return;
b948db8d 2997
ffa29e93 2998 TString fileName = GetLogFileName(detector);
e7f62f16 2999
84090f85 3000 gSystem->ExpandPathName(fileName);
3001
3002 ofstream logFile;
3003 logFile.open(fileName, ofstream::out | ofstream::app);
3004
3005 if (!logFile.is_open()) {
3006 AliError(Form("Could not open file %s", fileName.Data()));
3007 return;
3008 }
7bfb2090 3009
84090f85 3010 logFile << toLog.Data() << "\n";
b948db8d 3011
84090f85 3012 logFile.close();
b948db8d 3013}
2bb7b766 3014
ffa29e93 3015//______________________________________________________________________________________________
3016TString AliShuttle::GetLogFileName(const char* detector) const
3017{
3018 //
3019 // returns the name of the log file for a given sub detector
3020 //
3021
3022 TString fileName;
3023
3024 if (GetCurrentRun() >= 0)
7d4cf768 3025 {
3026 fileName.Form("%s/%d/%s_%d.log", GetShuttleLogDir(), GetCurrentRun(),
3027 detector, GetCurrentRun());
3028 } else {
ffa29e93 3029 fileName.Form("%s/%s.log", GetShuttleLogDir(), detector);
7d4cf768 3030 }
ffa29e93 3031
3032 return fileName;
3033}
3034
ee6f7523 3035//______________________________________________________________________________________________
3036void AliShuttle::SendAlive()
3037{
3038 // sends alive message to ML
3039
3040 TMonaLisaText mlStatus("SHUTTLE_status", "Alive");
3041
3042 TList mlList;
3043 mlList.Add(&mlStatus);
3044
3045 fMonaLisa->SendParameters(&mlList, "__PROCESSINGINFO__");
3046}
3047
2bb7b766 3048//______________________________________________________________________________________________
3049Bool_t AliShuttle::Collect(Int_t run)
3050{
9827400b 3051 //
3052 // Collects conditions data for all UNPROCESSED run written to DAQ LogBook in case of run = -1 (default)
3053 // If a dedicated run is given this run is processed
3054 //
3055 // In operational mode, this is the Shuttle function triggered by the EOR signal.
3056 //
2bb7b766 3057
eba76848 3058 if (run == -1)
3059 Log("SHUTTLE","Collect - Shuttle called. Collecting conditions data for unprocessed runs");
3060 else
3061 Log("SHUTTLE", Form("Collect - Shuttle called. Collecting conditions data for run %d", run));
cb343cfd 3062
3063 SetLastAction("Starting");
2bb7b766 3064
ee6f7523 3065 // create ML instance
3066 if (!fMonaLisa)
3067 fMonaLisa = new TMonaLisaWriter(fConfig->GetMonitorHost(), fConfig->GetMonitorTable());
3068
ee6f7523 3069 SendAlive();
fbc112e3 3070 CountOpenRuns();
ee6f7523 3071
2bb7b766 3072 TString whereClause("where shuttle_done=0");
eba76848 3073 if (run != -1)
3074 whereClause += Form(" and run=%d", run);
2bb7b766 3075
3076 TObjArray shuttleLogbookEntries;
be48e3ea 3077 if (!QueryShuttleLogbook(whereClause, shuttleLogbookEntries))
3078 {
cb343cfd 3079 Log("SHUTTLE", "Collect - Can't retrieve entries from Shuttle logbook");
2bb7b766 3080 return kFALSE;
3081 }
3082
9e080f92 3083 if (shuttleLogbookEntries.GetEntries() == 0)
3084 {
3085 if (run == -1)
3086 Log("SHUTTLE","Collect - Found no UNPROCESSED runs in Shuttle logbook");
3087 else
3088 Log("SHUTTLE", Form("Collect - Run %d is already DONE "
3089 "or it does not exist in Shuttle logbook", run));
3090 return kTRUE;
3091 }
3092
be48e3ea 3093 for (UInt_t iDet=0; iDet<NDetectors(); iDet++)
3094 fFirstUnprocessed[iDet] = kTRUE;
3095
fc5a4708 3096 if (run != -1)
be48e3ea 3097 {
3098 // query Shuttle logbook for earlier runs, check if some detectors are unprocessed,
3099 // flag them into fFirstUnprocessed array
3100 TString whereClause(Form("where shuttle_done=0 and run < %d", run));
3101 TObjArray tmpLogbookEntries;
3102 if (!QueryShuttleLogbook(whereClause, tmpLogbookEntries))
3103 {
3104 Log("SHUTTLE", "Collect - Can't retrieve entries from Shuttle logbook");
3105 return kFALSE;
3106 }
3107
3108 TIter iter(&tmpLogbookEntries);
3109 AliShuttleLogbookEntry* anEntry = 0;
3110 while ((anEntry = dynamic_cast<AliShuttleLogbookEntry*> (iter.Next())))
3111 {
3112 for (UInt_t iDet=0; iDet<NDetectors(); iDet++)
3113 {
3114 if (anEntry->GetDetectorStatus(iDet) == AliShuttleLogbookEntry::kUnprocessed)
3115 {
3116 AliDebug(2, Form("Run %d: setting %s as \"not first time unprocessed\"",
3117 anEntry->GetRun(), GetDetName(iDet)));
3118 fFirstUnprocessed[iDet] = kFALSE;
3119 }
3120 }
3121
3122 }
3123
3124 }
3125
3126 if (!RetrieveConditionsData(shuttleLogbookEntries))
3127 {
cb343cfd 3128 Log("SHUTTLE", "Collect - Process of at least one run failed");
2bb7b766 3129 return kFALSE;
3130 }
3131
36c99a6a 3132 Log("SHUTTLE", "Collect - Requested run(s) successfully processed");
eba76848 3133 return kTRUE;
2bb7b766 3134}
3135
2bb7b766 3136//______________________________________________________________________________________________
3137Bool_t AliShuttle::RetrieveConditionsData(const TObjArray& dateEntries)
3138{
9827400b 3139 //
3140 // Retrieve conditions data for all runs that aren't processed yet
3141 //
2bb7b766 3142
3143 Bool_t hasError = kFALSE;
3144
3145 TIter iter(&dateEntries);
3146 AliShuttleLogbookEntry* anEntry;
3147
3148 while ((anEntry = (AliShuttleLogbookEntry*) iter.Next())){
3149 if (!Process(anEntry)){
3150 hasError = kTRUE;
3151 }
4b95672b 3152
3153 // clean SHUTTLE temp directory
d524ade6 3154 //TString filename = Form("%s/*.shuttle", GetShuttleTempDir());
3155 //RemoveFile(filename.Data());
2bb7b766 3156 }
3157
3158 return hasError == kFALSE;
3159}
cb343cfd 3160
3161//______________________________________________________________________________________________
3162ULong_t AliShuttle::GetTimeOfLastAction() const
3163{
9827400b 3164 //
3165 // Gets time of last action
3166 //
3167
cb343cfd 3168 ULong_t tmp;
36c99a6a 3169
cb343cfd 3170 fMonitoringMutex->Lock();
be48e3ea 3171