Public Member Functions | |
def | __init__ (self, name, user, pattern='.*root', run_range=None, dbsInstance=None) |
def init(self, name, user, pattern='. More... | |
def | buildListOfBadFiles (self) |
def | buildListOfFiles (self, pattern) |
def | extractFileSizes (self) |
def | getPrimaryDatasetEntries (self) |
def | listOfFiles (self) |
def | listOfGoodFiles (self) |
def | listOfGoodFilesWithPrescale (self, prescale) |
def | printFiles (self, abspath=True, info=True) |
def | printInfo (self) |
Public Attributes | |
bad_files | |
dbsInstance | |
MM. More... | |
files | |
filesAndSizes | |
good_files | |
name | |
pattern | |
primaryDatasetEntries | |
MM. More... | |
report | |
run_range | |
user | |
Definition at line 23 of file dataset.py.
def dataset.BaseDataset.__init__ | ( | self, | |
name, | |||
user, | |||
pattern = '.*root' , |
|||
run_range = None , |
|||
dbsInstance = None |
|||
) |
def init(self, name, user, pattern='.
*root', run_range=None):
Definition at line 26 of file dataset.py.
def dataset.BaseDataset.buildListOfBadFiles | ( | self | ) |
Reimplemented in dataset.Dataset.
Definition at line 49 of file dataset.py.
def dataset.BaseDataset.buildListOfFiles | ( | self, | |
pattern | |||
) |
Reimplemented in dataset.PrivateDataset, dataset.Dataset, dataset.EOSDataset, dataset.LocalDataset, and dataset.CMSDataset.
Definition at line 41 of file dataset.py.
Referenced by dataset.BaseDataset.printFiles().
def dataset.BaseDataset.extractFileSizes | ( | self | ) |
Get the file size for each file, from the eos ls -l command.
Reimplemented in dataset.Dataset.
Definition at line 44 of file dataset.py.
def dataset.BaseDataset.getPrimaryDatasetEntries | ( | self | ) |
Reimplemented in dataset.PrivateDataset, dataset.Dataset, and dataset.CMSDataset.
Definition at line 57 of file dataset.py.
References dataset.BaseDataset.primaryDatasetEntries.
def dataset.BaseDataset.listOfFiles | ( | self | ) |
Returns all files, even the bad ones.
Definition at line 83 of file dataset.py.
References readConfig.fileINI.files, dataset.BaseDataset.files, chain.Chain.files, MatrixReader.MatrixReader.files, MatrixUtil.InputInfo.files, and JsonOutputProducer::JsonConfigurationBlock.files.
def dataset.BaseDataset.listOfGoodFiles | ( | self | ) |
Returns all files flagged as good in the integrity check text output, or not present in this file, are considered as good.
Definition at line 87 of file dataset.py.
References mps_setup.append, dataset.BaseDataset.bad_files, readConfig.fileINI.files, dataset.BaseDataset.files, chain.Chain.files, MatrixReader.MatrixReader.files, MatrixUtil.InputInfo.files, JsonOutputProducer::JsonConfigurationBlock.files, and dataset.BaseDataset.good_files.
Referenced by dataset.BaseDataset.listOfGoodFilesWithPrescale().
def dataset.BaseDataset.listOfGoodFilesWithPrescale | ( | self, | |
prescale | |||
) |
Takes the list of good files and selects a random sample from them according to the prescale factor. E.g. a prescale of 10 will select 1 in 10 files.
Definition at line 97 of file dataset.py.
References dataset.BaseDataset.good_files, dataset.int, and dataset.BaseDataset.listOfGoodFiles().
Definition at line 60 of file dataset.py.
References dataset.BaseDataset.bad_files, dataset.BaseDataset.buildListOfFiles(), readConfig.fileINI.files, dataset.BaseDataset.files, chain.Chain.files, MatrixReader.MatrixReader.files, MatrixUtil.InputInfo.files, JsonOutputProducer::JsonConfigurationBlock.files, dataset.BaseDataset.filesAndSizes, dataset.BaseDataset.good_files, CSCALCT.pattern, dataset.BaseDataset.pattern, presentation.PageLayout.pattern, l1t::EMTFRoad.pattern, L1TMuon::TriggerPrimitive::CSCData.pattern, cscdqm::MOFilterItem.pattern, CSCCorrelatedLCTDigi.pattern, l1t::EMTFHit.pattern, dataset.BaseDataset.primaryDatasetEntries, and edm.print().
def dataset.BaseDataset.printInfo | ( | self | ) |
Reimplemented in dataset.Dataset.
Definition at line 53 of file dataset.py.
References ElectronMVAID.ElectronMVAID.name, DigiComparisonTask.name, TestTask.name, TPComparisonTask.name, HcalOfflineHarvesting.name, HcalOnlineHarvesting.name, HFRaddamTask.name, LaserTask.name, NoCQTask.name, PedestalTask.name, QIE10Task.name, RecHitTask.name, QIE11Task.name, UMNioTask.name, ZDCTask.name, AlignableObjectId::entry.name, RawTask.name, average.Average.name, counter.Counter.name, TPTask.name, DigiTask.name, histograms.Histograms.name, LEDTask.name, cond::persistency::TAG::NAME.name, cond::persistency::RUN_INFO::RUN_NUMBER.name, cond::persistency::GTEditorData.name, TmModule.name, cond::persistency::GLOBAL_TAG::NAME.name, cond::persistency::RUN_INFO::START_TIME.name, cond::persistency::TAG::TIME_TYPE.name, cond::persistency::GLOBAL_TAG::VALIDITY.name, cond::persistency::RUN_INFO::END_TIME.name, cond::persistency::TAG::OBJECT_TYPE.name, core.autovars.NTupleVariable.name, cond::persistency::GLOBAL_TAG::DESCRIPTION.name, cond::persistency::O2O_RUN::JOB_NAME.name, DQMRivetClient::NormOption.name, cond::persistency::TAG::SYNCHRONIZATION.name, cond::persistency::GLOBAL_TAG::RELEASE.name, cond::persistency::O2O_RUN::START_TIME.name, cond::persistency::TAG::END_OF_VALIDITY.name, MEPSet.name, cond::persistency::GLOBAL_TAG::SNAPSHOT_TIME.name, cond::persistency::O2O_RUN::END_TIME.name, cond::persistency::O2O_RUN::STATUS_CODE.name, cms::dd::NameValuePair< T >.name, cond::persistency::TAG::DESCRIPTION.name, cond::persistency::GLOBAL_TAG::INSERTION_TIME.name, cond::persistency::O2O_RUN::LOG.name, cond::persistency::TAG::LAST_VALIDATED_TIME.name, cond::persistency::TAG::INSERTION_TIME.name, FWTGeoRecoGeometry::Info.name, cond::persistency::TAG::MODIFICATION_TIME.name, nanoaod::MergeableCounterTable::SingleColumn< T >.name, ParameterSet.name, preexistingValidation.PreexistingValidation.name, OutputMEPSet.name, PixelDCSObject< T >::Item.name, cms::DDSpecPar.name, cms::dd::ValuePair< T, U >.name, dataset.BaseDataset.name, personalPlayback.Applet.name, Types._Untracked.name, MagCylinder.name, ParSet.name, analyzer.Analyzer.name, DQMRivetClient::LumiOption.name, cond::persistency::GTProxyData.name, Barrel.name, edm::PathTimingSummary.name, DQMRivetClient::ScaleFactorOption.name, EgHLTOfflineSummaryClient::SumHistBinData.name, cms::DDAlgoArguments.name, SingleObjectCondition.name, cond::TimeTypeSpecs.name, perftools::EdmEventSize::BranchRecord.name, core.autovars.NTupleObjectType.name, MyWatcher.name, edm::PathSummary.name, EcalLogicID.name, lumi::TriggerInfo.name, alignment.Alignment.name, PixelEndcapLinkMaker::Item.name, XMLProcessor::_loaderBaseConfig.name, MEtoEDM< T >::MEtoEDMObject.name, FWTableViewManager::TableEntry.name, PixelBarrelLinkMaker::Item.name, ExpressionHisto< T >.name, DQMGenericClient::EfficOption.name, Supermodule.name, TreeCrawler.Package.name, cond::persistency::GLOBAL_TAG_MAP::GLOBAL_TAG_NAME.name, genericValidation.GenericValidation.name, cond::persistency::GLOBAL_TAG_MAP::RECORD.name, options.ConnectionHLTMenu.name, cond::persistency::GLOBAL_TAG_MAP::LABEL.name, cond::persistency::GLOBAL_TAG_MAP::TAG_NAME.name, cond::Tag_t.name, FastHFShowerLibrary.name, dqmoffline::l1t::HistDefinition.name, DQMGenericClient::ProfileOption.name, nanoaod::MergeableCounterTable::VectorColumn< T >.name, magneticfield::BaseVolumeHandle.name, cms::DDParsingContext::CompositeMaterial.name, emtf::Node.name, h4DSegm.name, DQMGenericClient::NormOption.name, core.TriggerMatchAnalyzer.TriggerMatchAnalyzer.name, cond::persistency::PAYLOAD::HASH.name, cond::persistency::PAYLOAD::OBJECT_TYPE.name, looper.Looper.name, DQMGenericClient::CDOption.name, PhysicsTools::Calibration::Variable.name, cond::persistency::PAYLOAD::DATA.name, cond::TagInfo_t.name, cond::persistency::PAYLOAD::STREAMER_INFO.name, cond::persistency::PAYLOAD::VERSION.name, TrackerSectorStruct.name, cond::persistency::PAYLOAD::INSERTION_TIME.name, MuonGeometrySanityCheckPoint.name, FCDTask.name, DQMGenericClient::NoFlowOption.name, CounterChecker.name, Mapper::definition< ScannerT >.name, EDMtoMEConverter.name, classes.MonitorData.name, HistogramManager.name, classes.OutputData.name, Crystal.name, cond::persistency::IOV::TAG_NAME.name, cond::persistency::IOV::SINCE.name, cond::persistency::IOV::PAYLOAD_HASH.name, h2DSegm.name, cond::persistency::IOV::INSERTION_TIME.name, options.HLTProcessOptions.name, DQMNet::WaitObject.name, core.TriggerBitAnalyzer.TriggerBitAnalyzer.name, AlpgenParameterName.name, config.Analyzer.name, geometry.Structure.name, core.autovars.NTupleSubObject.name, Capsule.name, core.autovars.NTupleObject.name, Ceramic.name, SiStripMonitorDigi.name, BulkSilicon.name, config.Service.name, APD.name, nanoaod::FlatTable::Column.name, core.autovars.NTupleCollection.name, BPHRecoBuilder::BPHRecoSource.name, cond::payloadInspector::TagReference.name, BPHRecoBuilder::BPHCompSource.name, cond::persistency::TAG_LOG::TAG_NAME.name, cond::persistency::TAG_LOG::EVENT_TIME.name, cond::persistency::TAG_LOG::USER_NAME.name, cond::persistency::TAG_LOG::HOST_NAME.name, cond::persistency::TAG_LOG::COMMAND.name, cond::persistency::TAG_LOG::ACTION.name, cond::persistency::TAG_LOG::USER_TEXT.name, InnerLayerVolume.name, personalPlayback.FrameworkJob.name, Grid.name, Grille.name, BackPipe.name, plotscripts.SawTeethFunction.name, PatchPanel.name, BackCoolTank.name, DryAirTube.name, crabFunctions.CrabTask.name, MBCoolTube.name, MBManif.name, cscdqm::ParHistoDef.name, hTMaxCell.name, BeautifulSoup.Tag.name, SummaryOutputProducer::GenericSummary.name, BeautifulSoup.SoupStrainer.name, edm.print(), EcalTPGParamReaderFromDB.user, dataset.BaseDataset.user, popcon::RpcDataT.user, popcon::RpcDataV.user, popcon::RpcObGasData.user, popcon::RPCObPVSSmapData.user, popcon::RpcDataUXC.user, popcon::RpcDataI.user, popcon::RpcDataFebmap.user, popcon::RpcDataGasMix.user, popcon::RpcDataS.user, EcalDBConnection.user, and MatrixInjector.MatrixInjector.user.
dataset.BaseDataset.bad_files |
Definition at line 51 of file dataset.py.
Referenced by dataset.BaseDataset.listOfGoodFiles(), and dataset.BaseDataset.printFiles().
dataset.BaseDataset.dbsInstance |
MM.
Definition at line 32 of file dataset.py.
Referenced by dataset.PrivateDataset.getPrimaryDatasetEntries().
dataset.BaseDataset.files |
Definition at line 42 of file dataset.py.
Referenced by dataset.BaseDataset.listOfFiles(), dataset.BaseDataset.listOfGoodFiles(), and dataset.BaseDataset.printFiles().
dataset.BaseDataset.filesAndSizes |
Definition at line 47 of file dataset.py.
Referenced by dataset.BaseDataset.printFiles().
dataset.BaseDataset.good_files |
Definition at line 50 of file dataset.py.
Referenced by dataset.BaseDataset.listOfGoodFiles(), dataset.BaseDataset.listOfGoodFilesWithPrescale(), and dataset.BaseDataset.printFiles().
dataset.BaseDataset.name |
Definition at line 27 of file dataset.py.
Referenced by FWLite.ElectronMVAID.__call__(), dirstructure.Directory.__create_pie_image(), DisplayManager.DisplayManager.__del__(), dqm_interfaces.DirID.__eq__(), dirstructure.Directory.__get_full_path(), dirstructure.Comparison.__get_img_name(), dataset.Dataset.__getDataType(), dataset.Dataset.__getFileInfoList(), dirstructure.Comparison.__make_image(), dirstructure.Directory.__repr__(), dqm_interfaces.DirID.__repr__(), dirstructure.Comparison.__repr__(), config.CFG.__str__(), FWLite.WorkingPoints._reformat_cut_definitions(), dataset.CMSDataset.buildListOfFiles(), dataset.LocalDataset.buildListOfFiles(), dataset.CMSDataset.buildListOfFilesDBS(), dirstructure.Directory.calcStats(), validation.Sample.digest(), python.rootplot.utilities.Hist.divide(), python.rootplot.utilities.Hist.divide_wilson(), DisplayManager.DisplayManager.Draw(), utils.StatisticalTest.get_status(), production_tasks.Task.getname(), dataset.CMSDataset.getPrimaryDatasetEntries(), dataset.PrivateDataset.getPrimaryDatasetEntries(), VIDSelectorBase.VIDSelectorBase.initialize(), dirstructure.Directory.print_report(), dataset.BaseDataset.printInfo(), dataset.Dataset.printInfo(), production_tasks.MonitorJobs.run(), python.rootplot.utilities.Hist.TGraph(), and python.rootplot.utilities.Hist.TH1F().
dataset.BaseDataset.pattern |
Definition at line 29 of file dataset.py.
Referenced by dataset.BaseDataset.printFiles().
dataset.BaseDataset.primaryDatasetEntries |
MM.
Definition at line 34 of file dataset.py.
Referenced by dataset.BaseDataset.getPrimaryDatasetEntries(), and dataset.BaseDataset.printFiles().
dataset.BaseDataset.report |
Definition at line 35 of file dataset.py.
Referenced by dataset.Dataset.getPrimaryDatasetEntries(), and addOnTests.testit.run().
dataset.BaseDataset.run_range |
Definition at line 30 of file dataset.py.
Referenced by dataset.CMSDataset.buildListOfFiles(), dataset.CMSDataset.buildListOfFilesDBS(), dataset.CMSDataset.getPrimaryDatasetEntries(), and dataset.PrivateDataset.getPrimaryDatasetEntries().
dataset.BaseDataset.user |
Definition at line 28 of file dataset.py.
Referenced by dataset.BaseDataset.printInfo(), DTWorkflow.DTWorkflow.remote_out_path(), production_tasks.CheckDatasetExists.run(), production_tasks.GenerateMask.run(), production_tasks.SourceCFG.run(), production_tasks.FullCFG.run(), production_tasks.MonitorJobs.run(), and production_tasks.CleanJobFiles.run().