Hands on 4: First physics measurments


Introduction

Note: The code of this tutorial is a modified version of Geant4 example B5. You can review almost all concepts from this tutorial in the example that can be found under: $G4INSTALL/examples/basic/B5.
In this third hands-on you will learn: The code for this hands-on session is here ( for your reference, the complete solution is also availavble here). Copy the the tar ball to your local area.

Follow the instructions of Hands On 1 to configure with cmake the example and build it. Try out the application:

  $ cd <tutorial>
  $ tar xzf HandsOn4.tar.gz
  $ cd HandsOn4
  $ cmake .
  $ make -j 2 -f Makefile
  $ ./SLACtut

Note: Ignore compiler warning messages. They disappear once you complete the exercise.

The geometry is the same obtained at the end of the previous hands on.
We will not modify the geometry anymnore, but several sensitive detectors and hits classes have been added to the setup. Take a moment to look at the classes whose filenames ends with strings "SD" and "Hit". In particular it is important that you understand how the calorimeter hits work. In the exercise number 2 you will calculate a very simple physics quantity (a partial shower shape) from the energy released in calorimeters.
Geant4 does not provide/recommend specific utilities for the analysis opeartions because these are strongly detector-specific. However we provide a light-weight histogramming and ntuple facility. These are compatible with AIDA and ROOT output format and they do not require any library installed on the system. They can also dump output in tabular form in text files (CSV) to import data in (virtually) any analysis system (pylab, R, Octave, Excel, Matlab, Mathematica, ...). If you do not have an AIDA-compliant tool nor ROOT installed, will not be able to display histograms, but you will still be able to read data written in CSV format.

For your interest here are some links:


User Actions II

In Exercise 3 of Hands On 3 you have printed on screen, for each simulated event, the hits collected in the hodosope. In this exercise we will show how to accumualte some information (the energy deposited in the calorimeters) over the entire run. We will also show how to merge (i.e. reduce or combine) the results in a multi-threaded application.
The goal of these two exercises is to calculate the average energy released in the electromagnetic and hadronic calorimeter and the average partial shower shape. A shower shape is a quantity that somehow describes the charactersitics spatial dimensions of particle showers in calorimeters. In this example we will calcualte the fraction of energy released in the electromagnetic calorimeter. An electron or gamma has the em fraction very close to 1; a hadron will have a smaller value of the em fraction.

During Exercise 1 you will modify the application to accumulate the energy released in calorimeters in each event. You will modify 2 files: Run.hh and Run.cc, implementing a user-defined G4Run class.
During Exercise 2 you will modify the file RunAction.cc, that implements the user defined G4UserRunAction class, to retrieve the information collected in the first exercise and dump on screen the results of your data analysis: energy in calorimeters and shower shape.
During the simulation of a run an instance of a G4Run exists and is managed by Geant4 kernel. User should extend this class to accumulate user data.

Exercise 1 Step 1

Create a user-defined run class

Modify the file Run.hh, a class inheriting from G4Run. Extend the class to contain the information to be stored: the total energy deposited in the two calorimeters and the accumulated shower shape (of types double). Since you will need to access hits collections from calorimeters, add two integer data members to keep track of the hits collection ids.
Question: what are the data members of the base class G4Run?

Solution

Run.hh File:

 class Run : public G4Run {
 public:
  Run();
  virtual ~Run() {};
  virtual void RecordEvent(const G4Event*);
  virtual void Merge(const G4Run*);
  G4double GetEmEnergy() const { return em_ene; }
  G4double GetHadEnergy() const { return had_ene; }
  G4double GetShowerShape() const { return shower_shape; }
 private:
  G4double em_ene; //accumulated energy in EM calo
  G4double had_ene;//accumulated energy in HAD calo
  G4double shower_shape;//accumulated shower shape ( f=EM/(EM+HAD) )
  G4int ECHCID; //ID for EM hits collection
  G4int HCHCID; //ID for HAD hits collection
 };

Exercise 1 Step 2

Accumualte physics quantities

Modify in file Run.cc the method RecordEvent. This method is called by Geant4 kernel at the end of each event passing the pointer to the current event. Retrieve here the hits collections of both calorimeters, loop on all hits and calculate the needed physics quantities. In the constructor of Run class initialize the class data members to an initial value (0 for energy and shape and -1 for ids).
Hint: Note that the initial value of -1 for hits id allows you to be efficient in searching the hits by collection: if id==-1 you need to search the collections, if not you already did this opeartion and you can skip the heavy search by string.

Solution

Run.cc

 Run::Run()
 : G4Run() ,
  em_ene(0), had_ene(0), shower_shape(0) , ECHCID(-1), HCHCID(-1)

 { }

 void Run::RecordEvent(const G4Event* evt)
 {
  //Forward call to base class
  //This is important, can you tell why?
  G4Run::RecordEvent(evt);
 
  if ( ECHCID == -1 || HCHCID == -1) {
    G4SDManager* sdManager = G4SDManager::GetSDMpointer();
    ECHCID = sdManager->GetCollectionID("EMcalorimeter/EMcalorimeterColl");
    HCHCID = sdManager->GetCollectionID("HadCalorimeter/HadCalorimeterColl");
  }
  G4HCofThisEvent* hce = evt->GetHCofThisEvent();
  if (!hce) {
    G4ExceptionDescription msg;
    msg << "No hits collection of this event found.\n";
    G4Exception("Run::RecordEvent()","Code001", JustWarning, msg);
    return;
  }
  const EmCalorimeterHitsCollection* emHC = static_cast<const EmCalorimeterHitsCollection*>(hce->GetHC(ECHCID));
  const HadCalorimeterHitsCollection* hadHC = static_cast<const HadCalorimeterHitsCollection*>(hce->GetHC(HCHCID));
  if ( !emHC || !hadHC )
  {
    G4ExceptionDescription msg;
     msg << "Some of hits collections of this event not found.\n";
    G4Exception("Run::RecordEvent()","Code001", JustWarning, msg);
     return;
  }
  G4double em = 0;
  G4double had = 0;
  for (size_t i=0;i<emHC->GetSize();i++)
  {
    EmCalorimeterHit* hit = (*emHC)[i];
    em += hit->GetEdep();
  }
  for (size_t i=0;i<hadHC->GetSize();i++)
  {
    HadCalorimeterHit* hit = (*hadHC)[i];
    had += hit->GetEdep();
  }
  had_ene += had;
  em_ene += em;
  if ( had+em > 0 ) //Protect agains the case had+em=0
    shower_shape += ( em/(had+em) );
 }

Exercise 1 Step 3

Implement reduction for multi-threading.

This step is optional for application without multi-thread support.
Why you need this? Remember in a multi-threaded application each worker thread has its own instance of class G4Run. Events are distributed and you end the simulation with many run objects (one per worker thread). Geant4 provides a way to merge these sub-runs into a single global one. This is done implementing a Merge method in the Run class. Geant4 kernel works in a way that the worker threads will call the Merge method of the master run object passing a pointer to the worker run object. This animation explains what is happening under the hood (Geant4 kernel will take care of synchronizing the threads to avoid race conditions):

Solution

Run.cc File:

 void Run::Merge(const G4Run* aRun)
 {
  const Run* localRun = static_cast<const Run*>(aRun);
  em_ene += localRun->GetEmEnergy();
  had_ene += localRun->GetHadEnergy();
  shower_shape += localRun->GetShowerShape();
  //Forward call to base-class
  //This is important, can you tell why?
  G4Run::Merge(localRun);
 }

Exercise 1 Step 4

Create an instance of user-defined run class at each new run.

Now that you have extended G4Run you need to tell Geant4 kernel to use it instead of the default one. To do so you need to modify the method RunAction::GenerateRun and return an instance of Run instead of the default (this method is called by Geant4 at the beginning of each run). The method is already implemented in RunAction.cc file.

Solution

RunAction.cc File

 G4Run* RunAction::GenerateRun() {
  return new Run;
 }

Exercise 2

Calculate physics quantities and print them on screen.

Now that Run class has been modified to include user data we can print out the summary of our simple data analysis at the end of the run. To do that we modify the method EndOfRunAction of the RunAction class (RunAction.cc file). Retrieve from the run object the information you need and calculate the average energy release in calorimeters and the shower shape.
Hint 1: Note that Geant4 will pass you an object of type G4Run (the base class). You need to make an appropriate cast to access your data.
Hint 2: The total number of events is a data member of base class G4Run. Check in online documentation how to get it.
Hint 3: The quantity have been stored in Geant4 natural units. A useful function G4BestUnit can be used to print on screen a variable with a dimension. For example:

G4double someValue = 0.001*GeV;
G4cout<< G4BestUnit( someValue , "Energy" )<<G4endl; //Will print "1 MeV"

Solution

RunAction.cc File

 void RunAction::EndOfRunAction(const G4Run* run)
 {
  const Run* myrun = dynamic_cast<const Run*>(run);
  if ( myrun )
  {
    G4int nEvents = myrun->GetNumberOfEvent();
    if ( nEvents < 1 )
    {
      G4ExceptionDescription msg;
      msg << "Run consists of 0 events";
      G4Exception("RunAction::EndOfRunAction()","Code001", JustWarning, msg);
    }
    G4double em_ene = myrun->GetEmEnergy();
    G4double had_ene = myrun->GetHadEnergy();
    G4double shower_shape = myrun->GetShowerShape();
    G4int safety = ( nEvents > 0 ? nEvents : 1);//To avoid divisions by zero
    G4cout<<"Run["<<myrun->GetRunID()<<"] With: "<<nEvents<<"Events\n"
      <<" <E_em>="<<G4BestUnit(em_ene/safety,"Energy")<<"\n"
      <<" <E_had>="<<G4BestUnit(had_ene/safety,"Energy")<<"\n"
      <<" <E>="<<G4BestUnit((em_ene+had_ene)/safety,"Energy")<<"\n"
      <<" <ShowerShape>="<<shower_shape/safety<<G4endl;
  } else {
  G4ExceptionDescription msg;
  msg << "Run is not of correct type, skipping analysis";
  G4Exception("RunAction::EndOfRunAction()","Code001", JustWarning, msg);
  }
 }


Using g4analysis to store the results in a file

In these exercises we will use G4AnalysisManager to store in ntuples and histograms the content of hits collections. The goal of the g4analysis module is to provide light-weight support for simple storage of data. You need an AIDA-compliant tool or ROOT to visualize histograms.
Alternatively at this link you can find a simple python script that will display the CSV histograms produced by g4analysis if you have the matplotlib python package.
If you want to use matlab/octave you can plot the 1D histograms from this tutorial with the following commands:

  >> data=readcsv("Tutorial_h1_Chamber1.csv")
  >> x=(1:50)
  >> y=data(9:58,2)
  >> plot(x,y)

With ROOT this is shows how histograms look like:

While this is how they look like in JAS3:

And this is how one histogram looks like in Octave:

And this is how the histograms look like in matplotlib using the provided plot_histofiles.py script:


For this tutorial the VM has installed ROOT system (not matplotlib or octave), so we use this as the default format.
If you do not know ROOT, this is for you. Get the ROOT plot.C script and copy it the working directory. To visualize histograms in ROOT type:
  $ root -l plot.C
  root[0] #Type ".q" to exit root

The plotting script relies on the histogram names and file to be the one produced by the solution of this exercise. Hint: Histograms are automatically merged from all worker threads. With a concept similar to what shown in Exercise 1 Step 3 histograms are summed at the end of the run. A single histograms file "Tutorial.[root|xml]" exists (for CSV each histogram is written out in a separate file, with file name "Tutorial_h{1,2}_{name}.csv"). For ntuples it makes not so much sense the automatic merging, because what you will do for analysis is to process one file after the other. In ROOT terminology you will create a TChain ( ITupleFactory::createChained in AIDA); with CSV format the merging is as simple as: cat *_t*.csv > merged.csv.
For ntuples the file name format is: "Tutorial_t*.[root|xml]", for CSV the format is: "Tutorial_nt_[ntupleName]_t*.csv".

Exercise 3 Step 1

Define content of the output file(s).

Create output file(s) and define their content: four histograms and one nutple.

Solution

RunAction.cc file:
 RunAction::RunAction()
  : G4UserRunAction()
 {
  // Create analysis manager
  // The choice of persistency format is done via selectin of a namespace
  // in Analysis.hh
  G4AnalysisManager* analysisManager = G4AnalysisManager::Instance();
  G4cout << "Using " << analysisManager->GetType() << G4endl;
 
  // Default settings
  analysisManager->SetVerboseLevel(1);
  analysisManager->SetFileName("Tutorial");
 
  // Book histograms, ntuple
  //
 
  // Creating 1D histograms
  analysisManager->CreateH1("Chamber1","Drift Chamber 1 # Hits", 50, 0., 50); // h1 Id = 0
  analysisManager->CreateH1("Chamber2","Drift Chamber 2 # Hits", 50, 0., 50); // h1 Id = 1
 
  // Creating 2D histograms
  analysisManager->CreateH2("Chamber1_XY","Drift Chamber 1 X vs Y",50, -1000., 1000, 50, -300., 300.); // h2 Id = 0
  analysisManager->CreateH2("Chamber2_XY","Drift Chamber 2 X vs Y",50, -1500., 1500, 50, -300., 300.); // h2 Id = 1
 
  // Creating ntuple
  //
  analysisManager->CreateNtuple("Tutorial", "Hits");
  analysisManager->CreateNtupleIColumn("Dc1Hits"); // column Id = 0
  analysisManager->CreateNtupleIColumn("Dc2Hits"); // column Id = 1
  analysisManager->CreateNtupleDColumn("ECEnergy"); // column Id = 2
  analysisManager->CreateNtupleDColumn("HCEnergy"); // column Id = 3
  analysisManager->CreateNtupleDColumn("Time1"); // column Id = 4
  analysisManager->CreateNtupleDColumn("Time2"); // column Id = 5
  analysisManager->FinishNtuple(); //Do not forget this line!
 }

Note: We book histograms and define the nutple in the constructor of the G4UserRunAction class, however you should note that this class is instantiated only once per job, even if multiple runs are simulated. This means that you cannot change the content of the output file between runs. If you think that you will do that, you can book histograms and define columns in the ntuple in the BeginOfRunAction method.

Exercise 3 Step 2

Open the output file at each new run.

Defining an output file and its content, it is not enough, you need to explicitly open it. The best way to do so is to open it at the beginning of a new run. In more complex setups where you will perform more than one run per job, you can change the file name at each new run (e.g. via UI commands or append the run number to the file name), so you can produce one file output for each run.

Solution

RunAction.cc file:
  void RunAction::BeginOfRunAction(const G4Run* /*run*/)
 {
  // Get analysis manager
  G4AnalysisManager* analysisManager = G4AnalysisManager::Instance();
  // Open an output file
  // The default file name is set in RunAction::RunAction(),
  // it can be overwritten in a macro
  analysisManager->OpenFile();
 }

Exercise 3 Step 3

Write out the file.

Output files must be explicitly written to disk and closed. It is a good idea to do that at the end of the run.

Solution

RunAction.cc file:
 void RunAction::EndOfRunAction(const G4Run* run)
 {
  const Run* myrun = dynamic_cast<const Run*>(run);
  if ( myrun )
  {
    G4int nEvents = myrun->GetNumberOfEvent();
    if ( nEvents < 1 )
    {
      G4ExceptionDescription msg;
      msg << "Run consists of 0 events";
      G4Exception("RunAction::EndOfRunAction()",
      "Code001", JustWarning, msg);
      nEvents=1;
    }
    G4double em_ene = myrun->GetEmEnergy();
    G4double had_ene = myrun->GetHadEnergy();
    G4double shower_shape = myrun->GetShowerShape();
    G4int safety = ( nEvents > 0 ? nEvents : 1);//To avoid divisions by zero
    G4cout<<"Run["<<myrun->GetRunID()<<"] With: "<<nEvents<<"Events\n"
      <<" <E_em>="<<G4BestUnit(em_ene/safety,"Energy")<<"\n"
      <<" <E_had>="<<G4BestUnit(had_ene/safety,"Energy")<<"\n"
      <<" <E>="<<G4BestUnit((em_ene+had_ene)/safety,"Energy")<<"\n"
      <<" <ShowerShape>="<<shower_shape/safety<<G4endl;
  } else {
    G4ExceptionDescription msg;
    msg << "Run is not of correct type, skypping analysis via RunAction";
    G4Exception("RunAction::EndOfRunAction()","Code001", JustWarning, msg);
  }
 
  //=================================
  // Exercise 3 Step 3:
  // Write and close output file
  // save histograms & ntuple
  //
  G4AnalysisManager* analysisManager = G4AnalysisManager::Instance();
  analysisManager->Write();
  analysisManager->CloseFile();
 }

Exercise 3 Step 4

Fill histograms and ntuple with data.

At the end of each event you should retrieve informaiton from hits collection and fill the histograms and ntuple objects.

You can access filled hits at the end of each event in EventAction class.

Solution

EventAction.cc file:
 void EventAction::EndOfEventAction(const G4Event* event)
 {
  G4HCofThisEvent* hce = event->GetHCofThisEvent();
  if (!hce)
  {
    G4ExceptionDescription msg;
    msg << "No hits collection of this event found.\n";
    G4Exception("EventAction::EndOfEventAction()","Code001", JustWarning, msg);
    return;
  }
 
  // Get hits collections
  HodoscopeHitsCollection* hHC1 = static_cast<HodoscopeHitsCollection*>(hce->GetHC(fHHC1ID));
 
  HodoscopeHitsCollection* hHC2 = static_cast<HodoscopeHitsCollection*>(hce->GetHC(fHHC2ID));
 
  DriftChamberHitsCollection* dHC1 = static_cast<DriftChamberHitsCollection*>(hce->GetHC(fDHC1ID));
 
  DriftChamberHitsCollection* dHC2 = static_cast<DriftChamberHitsCollection*>(hce->GetHC(fDHC2ID));
 
  EmCalorimeterHitsCollection* ecHC = static_cast<EmCalorimeterHitsCollection*>(hce->GetHC(fECHCID));
 
  HadCalorimeterHitsCollection* hcHC = static_cast<HadCalorimeterHitsCollection*>(hce->GetHC(fHCHCID));
 
  if ( (!hHC1) || (!hHC2) || (!dHC1) || (!dHC2) || (!ecHC) || (!hcHC) )
  {
    G4ExceptionDescription msg;
    msg << "Some of hits collections of this event not found.\n";
    G4Exception("EventAction::EndOfEventAction()","Code001", JustWarning, msg);
    return;
  }
 
  //
  // Fill histograms & ntuple
  //
  //=================================
  // Exercise 3 Step 4:
  // Fill histograms & ntuple
 
  // Get analysis manager
  G4AnalysisManager* analysisManager = G4AnalysisManager::Instance();
 
  // Fill histograms
 
  G4int n_hit = dHC1->entries();
  analysisManager->FillH1(0, n_hit);
 
  for (G4int i=0;i<n_hit;i++)
  {
    DriftChamberHit* hit = (*dHC1)[i];
    G4ThreeVector localPos = hit->GetLocalPos();
    analysisManager->FillH2(0, localPos.x(), localPos.y());
  }
 
  n_hit = dHC2->entries();
  analysisManager->FillH1(1, n_hit);
 
  for (G4int i=0;i<n_hit;i++)
  {
    DriftChamberHit* hit = (*dHC2)[i];
    G4ThreeVector localPos = hit->GetLocalPos();
    analysisManager->FillH2(1, localPos.x(), localPos.y());
  }
 
 
  // Fill ntuple
 
  // Dc1Hits
  analysisManager->FillNtupleIColumn(0, dHC1->entries());
  // Dc2Hits
  analysisManager->FillNtupleIColumn(1, dHC1->entries());
 
  // ECEnergy
  G4int totalEmHit = 0;
  G4double totalEmE = 0.;
  for (G4int i=0;i<80;i++)
  {
    EmCalorimeterHit* hit = (*ecHC)[i];
    G4double eDep = hit->GetEdep();
    if (eDep>0.)
    {
      totalEmHit++;
      totalEmE += eDep;
    }
  }
  analysisManager->FillNtupleDColumn(2, totalEmE);
 
  // HCEnergy
  G4int totalHadHit = 0;
  G4double totalHadE = 0.;
  for (G4int i=0;i<20;i++)
  {
    HadCalorimeterHit* hit = (*hcHC)[i];
    G4double eDep = hit->GetEdep();
    if (eDep>0.)
    {
      totalHadHit++;
      totalHadE += eDep;
    }
  }
  analysisManager->FillNtupleDColumn(3, totalHadE);
 
  // Time 1
  for (G4int i=0;i<hHC1->entries();i++)
  {
    analysisManager->FillNtupleDColumn(4,(*hHC1)[i]->GetTime());
  }
 
  // Time 2
  for (G4int i=0;i<hHC2->entries();i++)
  {
    analysisManager->FillNtupleDColumn(5,(*hHC2)[i]->GetTime());
  }
 
  analysisManager->AddNtupleRow();
 
  //
  // Print diagnostics: UI command /run/printProgress can be used
  // to set frequency of how often info should be dumpled
  //
 
  G4int printModulo = G4RunManager::GetRunManager()->GetPrintProgress();
  if ( printModulo==0 || event->GetEventID() % printModulo != 0) return;
 
  G4PrimaryParticle* primary = event->GetPrimaryVertex(0)->GetPrimary(0);
  G4cout << G4endl
      << ">>> Event " << event->GetEventID() << " >>> Simulation truth : "
      << primary->GetG4code()->GetParticleName()
       << " " << primary->GetMomentum() << G4endl;
 
  // Hodoscope 1
  n_hit = hHC1->entries();
  G4cout << "Hodoscope 1 has " << n_hit << " hits." << G4endl;
  for (G4int i=0;i<n_hit;i++)
  {
    HodoscopeHit* hit = (*hHC1)[i];
    hit->Print();
  }
 
  // Hodoscope 2
  n_hit = hHC2->entries();
  G4cout << "Hodoscope 2 has " << n_hit << " hits." << G4endl;
  for (G4int i=0;i<n_hit;i++)
  {
    HodoscopeHit* hit = (*hHC2)[i];
    hit->Print();
  }
 
  // Drift chamber 1
  n_hit = dHC1->entries();
  G4cout << "Drift Chamber 1 has " << n_hit << " hits." << G4endl;
  for (G4int i2=0;i2<5;i2++)
  {
    for (G4int i=0;i<n_hit;i++)
    {
      DriftChamberHit* hit = (*dHC1)[i];
      if (hit->GetLayerID()==i2) hit->Print();
    }
  }
 
  // Drift chamber 2
  n_hit = dHC2->entries();
  G4cout << "Drift Chamber 2 has " << n_hit << " hits." << G4endl;
  for (G4int i2=0;i2<5;i2++)
  {
    for (G4int i=0;i<n_hit;i++)
    {
      DriftChamberHit* hit = (*dHC2)[i];
      if (hit->GetLayerID()==i2) hit->Print();
    }
  }
 
  // EM calorimeter
  G4cout << "EM Calorimeter has " << totalEmHit << " hits. Total Edep is "
  << totalEmE/MeV << " (MeV)" << G4endl;
 
  // Had calorimeter
  G4cout << "Hadron Calorimeter has " << totalHadHit << " hits. Total Edep is "
  << totalHadE/MeV << " (MeV)" << G4endl;
 }

Exercise 3 Step 5

Select output file format.

You can select the output file format including one of g4root.hh, g4xml.hh or g4csv.hh headers file. To keep code clean a file Analysis.hh is used to define the persistency format.

Solution

Analysis.hh File:
  //Uncomment one of the three
 //#include "g4root.hh"
 //#include "g4xml.hh"
 #include "g4csv.hh"


Created by: Andrea Dotti (adotti AT slac DOT stanford DOT edu) May 2018
Updated by: Makoto Asai (asai AT slac DOT stanford DOT edu) August 2019