At the Large Hadron Collider, the identification of jets originating from b quarks is important for searches for new physics and for measurements of standard model processes. A variety of algorithms has been developed by CMS to select b-quark jets based on variables such as the impact parameters of charged-particle tracks, the properties of reconstructed decay vertices, and the presence or absence of a lepton, or combinations thereof. The performance of these algorithms has been measured using data from proton-proton collisions at the LHC and compared with expectations based on simulation. The data used in this study were recorded in 2011 at √ s = 7 TeV for a total integrated luminosity of 5.0 fb −1 . The efficiency for tagging b-quark jets has been measured in events from multijet and t-quark pair production. CMS has achieved a bjet tagging efficiency of 85% for a light-parton misidentification probability of 10% in multijet events. For analyses requiring higher purity, a misidentification probability of only 1.5% has been achieved, for a 70% b-jet tagging efficiency. Jets that arise from bottom-quark hadronization (b jets) are present in many physics processes, such as the decay of top quarks, the Higgs boson, and various new particles predicted by supersymmetric models. The ability to accurately identify b jets is crucial in reducing the otherwise overwhelming background to these channels from processes involving jets from gluons (g) and light-flavour quarks (u, d, s), and from c-quark fragmentation.
Submitted to the Journal of InstrumentationThe properties of the bottom and, to a lesser extent, the charm hadrons can be used to identify the hadronic jets into which the b and c quarks fragment. These hadrons have relatively large masses, long lifetimes and daughter particles with hard momentum spectra. Their semileptonic decays can be exploited as well. The Compact Muon Solenoid (CMS) detector, with its precise charged-particle tracking and robust lepton identification systems, is well matched to the task of b-jet identification (b-jet tagging). The first physics results using b-jet tagging have been published [1][2][3] from the first data samples collected at the Large Hadron Collider (LHC).This paper describes the b-jet tagging algorithms used by the CMS experiment and measurements of their performance. A description of the apparatus is given in Section 2. The event samples and simulation are discussed in Section 3. The algorithms for b-jet tagging are defined in Section 4. The distributions of the relevant observables are compared between simulation and proton-proton collision data collected in 2011 at a centre-of-mass energy of 7 TeV. The robustness of the algorithms with respect to running conditions, such as the alignment of the detector elements and the presence of additional collisions in the same bunch crossing (pileup), is also discussed.Physics analyses using b-jet identification require the values of the efficiency and misidentification probability of the chosen algorithm, and, in general, ...