1 Introduction

The non-Abelian nature of the standard model (SM) electroweak theory predicts the self-interactions of the weak gauge bosons. These triple and quartic gauge-boson couplings provide a unique means to test for new fundamental interactions. The fusion of electroweak (EW) bosons is a particularly important process for measuring particle properties, such as the couplings of the Higgs boson, and for searching for new particles beyond the Standard Model [1,2,3,4,5,6,7,8,9,10,11]. In proton–proton (pp) collisions, a characteristic signature of these processes is the production of two high-momentum jets of hadrons at small angles with respect to the incoming proton beams [12]. Measurements of this vector-boson-fusion (VBF) topology have been performed in W [13], Z [14, 15] and Higgs [16] boson production, though the observation of purely electroweak processes in this topology has only been achieved in individual measurements of Z-boson production. This paper presents a precise measurement of electroweak W-boson production in the VBF topology, with a significance well above the standard for claiming observation, as well as differential cross section measurements and constraints on anomalous triple-gauge-boson couplings (aTGCs).

The production of a W boson in association with two or more jets (\(Wjj\)) is dominated by processes involving strong interactions (strong \(Wjj\) or QCD \(Wjj\)). These processes have been extensively studied by experiments at the Large Hadron Collider (LHC) [17, 18] and the Tevatron collider [19, 20], motivating the development of precise perturbative predictions [21,22,23,24,25,26,27,28,29,30,31,32,33]. The large cross section for W-boson production provides greater sensitivity to the VBF topology and to the electroweak production of \(Wjj\) (electroweak \(Wjj\) or EW \(Wjj\)) than corresponding measurements of Z- or Higgs-boson production.

The VBF process is inseparable from other electroweak \(Wjj\)  processes, so it is not measured directly; sensitivity to the VBF production mechanism is quantified by determining constraints on operator coefficients in an effective Lagrangian approach [34]. The classes of electroweak diagrams constituting the signal are shown in Figure 1 [35] and contain at least three vertices where an electroweak gauge boson connects to a pair of fermions. Diboson production, where the final-state quarks result from the decay of an s-channel gauge boson, is not shown and is considered as a background; it is small for the VBF topology defined in the analysis. The large background from a W boson associated with strongly produced jets is shown in Fig. 2 and has only two electroweak vertices. This background has \(\mathcal{{O}}\)(10) times the yield of the signal process, and can interfere with the signal. This interference is suppressed because only a small subset of the background diagrams have the same initial and final state as the signal.

Fig. 1
figure 1

Representative leading-order diagrams for electroweak Wjj production at the LHC. In addition to a the vector boson fusion process, there are four b W bremsstrahlung diagrams, corresponding to \(W^\pm \) boson radiation by any incoming or outgoing quark, and two c non-resonant diagrams, corresponding to \(W^\pm \) boson radiation by either incoming quark

Fig. 2
figure 2

Examples of leading-order diagrams for strong Wjj production at the LHC. The left-hand diagram interferes with the electroweak diagrams of Fig. 1 when the final-state quarks have the same colours as the initial-state quarks

The analysis signature consists of a neutrino and either an electron or a muon, two jets with a high dijet invariant mass, and no additional jets at a wide angle from the beam. This signature discriminates signal events from the copious background events consisting of strongly produced jets associated with a W (or Z) boson, top-quark production, or multijet production. The purity of electroweak \(Wjj\)  production increases with increasing dijet invariant mass, increasing the sensitivity to anomalous triple-gauge-boson couplings.

Measurements of the inclusive and fiducial cross sections of electroweak \(Wjj\)  production in proton–proton collisions at centre-of-mass energies \(\sqrt{s}=7\) and 8 \(\text {TeV}\) are performed in a fiducial region with a signal-to-background ratio of approximately 1:8. The electroweak signal is extracted with a binned likelihood fit to the dijet invariant mass distribution. The fit determines the ratio \(\mu _{\text {EW}} \) of the measured signal cross section to that of a Standard Model calculation [36]; this ratio is then multiplied by the prediction to provide the measured cross section. To reduce the uncertainties in the modelling of the strong \(Wjj\)  events, data are used to constrain their dijet mass distribution, resulting in a precise measurement of the electroweak \(Wjj\) fiducial cross section. The quantum-mechanical interference between electroweak and strong \(Wjj\) processes is not modelled and its impact on the measurement is estimated using a Monte Carlo simulation and taken as an uncertainty.

In order to explore the kinematics of the \(Wjj\)  topology, and the interplay between strong and electroweak production, the 8 \(\text {TeV}\) data are unfolded differentially to particle level in many variables and phase-space regions, and compared to theoretical predictions. Electroweak \(Wjj\)  production is measured in regions where the signal purity is relatively high (\(\gtrsim 10\%\)); combined strong and electroweak \(Wjj\)  production is measured in the other regions. These measurements are then integrated to obtain fiducial cross sections in the different phase-space regions, albeit with larger uncertainties than the measurement with the constrained background.

Sensitivity to the VBF diagram is determined by modifying the triple-gauge-boson couplings. Anomalous couplings arising from new processes at a high energy scale would cause increasing deviations from the SM prediction for increasing momentum transfer between the incoming partons. Hence, a region of high momentum transfer is defined, and constraints on anomalous gauge couplings are set in the context of an effective field theory (EFT), including limits on interactions that violate charge-parity (CP) conservation.

The paper is organized as follows. The ATLAS detector and reconstruction of the final-state particles are described in Sect. 2. The definitions of the measurement phase-space regions and the event selection are given in Sect. 3. The modelling of signal and background processes is discussed in Sect. 4. Section 5 is dedicated to the precise extraction of the inclusive and fiducial cross sections, while Sect. 6 presents differential cross sections unfolded for detector effects. Section 7 describes limits on aTGCs and parameters of an effective field theory. Section 8 summarizes the results and the Appendix provides a comprehensive set of differential cross-section measurements.

2 ATLAS detector and data reconstruction

The data set corresponds to LHC pp collisions at \(\sqrt{s}=7\) \(\text {TeV}\) in 2011 and at \(\sqrt{s}=8\) \(\text {TeV}\) in 2012, with final-state particles measured by the ATLAS detector. This section describes the detector and the reconstruction of the data to produce the final-state physics objects used in the measurements.

2.1 ATLAS detector

ATLAS is a multi-purpose detector used to measure LHC particle collisions. A detailed description of the detector can be found in Ref. [37]. A tracking system comprises the inner detector (ID) surrounding the collision point, with silicon pixel and microstrip detectors most centrally located, followed by a transition radiation tracker at higher radii [38, 39]. These tracking detectors are used to measure the trajectories and momenta of charged particles up to pseudorapidities of \(|\eta | = 2.5\).Footnote 1 The ID is surrounded by a superconducting solenoid, providing a 2 T magnetic field for the tracking detectors.

A calorimeter system surrounds the solenoid magnet and consists of electromagnetic and hadronic sections. The electromagnetic section is segmented along the z-axis into a barrel region covering \(|\eta | < 1.475\), two end-cap components spanning \(1.375< |\eta | < 3.2\), and two forward components (\(3.1< |\eta | < 4.9\)). Similarly, the hadronic section comprises a barrel region (\(|\eta | < 1.7\)), two end-cap regions (\(1.5< |\eta | < 3.2\)), and two forward regions (\(3.1< |\eta | < 4.9\)). The barrel region of the hadronic section uses scintillator tiles as the active medium, while the remaining regions use liquid argon.

A muon spectrometer surrounds the calorimeter system and contains superconducting coils, drift tubes and cathode strip chambers to provide precise measurements of muon momenta within \(|\eta | < 2.7\). The spectrometer also includes resistive-plate and thin-gap chambers to trigger on muons in the region \(|\eta | < 2.4\).

The ATLAS trigger system uses three consecutive stages to select events for permanent storage. The first level uses custom electronics and the second level uses fast software algorithms to inspect regions of interest flagged by the first trigger level. At the third level, the full event is reconstructed using software algorithms similar to those used offline.

2.2 Object reconstruction

Electrons, muons, and hadronic jets are reconstructed in the ATLAS detector. Each type of object has a distinctive signature and is identified using the criteria described below. The object identification includes track and vertex positions relative to the primary event vertex, defined as the reconstructed vertex with the highest summed \(p_{\text {T}} ^2\) of all associated tracks. Each object is calibrated and modelled in Monte Carlo simulation, corrected to match data measurements of the trigger, reconstruction, and identification efficiencies, and of the energy and momentum scales and resolutions [40,41,42,43,44].

3 Electrons

Electron candidates are reconstructed from energy clusters in the electromagnetic section of the calorimeter which are matched to tracks reconstructed in the ID. Candidates for signal events are required to satisfy ‘tight’ selection criteria [41, 42], which include requirements on calorimeter shower shape, track hit multiplicity, the ratio of reconstructed energy to track momentum, E / p, and the matching of the energy clusters to the track. In order to build templates to model the multijet background (see Sect. 4.2), a set of criteria is employed based on ‘loose’ or ‘medium’ selection, which drops the E / p requirement and uses less restrictive selection criteria for the other discriminating variables.

Electron candidates are required to be isolated to reject possible misidentified jets or heavy-flavour hadron decays. Isolation is calculated as the ratio of energy in an isolation cone around the primary track or calorimeter deposit to the energy of the candidate. Different isolation requirements are made in the 7 and 8 \(\text {TeV}\) data sets, due to the different LHC and detector operating conditions. For 7 \(\text {TeV}\) data taking, the requirements on track and calorimeter isolation variables associated with the electron candidate achieve a constant identification efficiency as a function of the candidate transverse energy (\(E_{\text {T}} \)) and pseudorapidity. The 8 \(\text {TeV}\) trigger includes a requirement on track isolation, so the selection is more restrictive and requires the summed \(p_{\text {T}} \) of surrounding tracks to be \(<5\%\) of the electron candidate \(E_{\text {T}} \), excluding the electron track and using a cone of size \(R \equiv \sqrt{(\Delta \phi )^2 + (\Delta \eta )^2} = 0.2\) around the shower centroid.

4 Muons

Muon candidates are identified as reconstructed tracks in the muon spectrometer which are matched to and combined with ID tracks to form a ‘combined’ muon candidate [43]. Quality requirements on the ID track include a minimum number of hits in each subdetector to ensure good track reconstruction. Candidates in 7 \(\text {TeV}\) data are selected using a track-based fractional isolation requiring the scalar sum of the \(p_{\text {T}} \) values of tracks within a cone of size \(R = 0.2\) of the muon track to be less than 10% of the candidate \(p_{\text {T}} \). For 8 \(\text {TeV}\) data taking, requirements are applied to track and calorimeter fractional isolation using a cone of size \(R = 0.3\). The upper bound on each type of isolation increases with increasing muon \(p_{\text {T}} \), and is 15% for \(p_{\text {T}} >30\) \(\text {GeV}\).

Additional transverse (\(d_0\)) and longitudinal (\(z_0\)) impact parameter requirements of \(|d_0/\sigma _{d_0}|<3\) (where \(\sigma _{d_0}\) is the \(d_0\) uncertainty) and \(|z_0\sin \theta |<0.5\) mm are imposed on all muon and electron candidates to suppress contributions from hadron decays to leptons.

5 Jets

Jets are reconstructed using the anti-\(k_t\) algorithm [45] with a jet-radius parameter of 0.4, from three-dimensional clustered energy deposits in the calorimeters [46]. Jets are required to have \(p_{\text {T}} >30\) \(\text {GeV}\) and \(|\eta |<4.4\), and must be separated from the lepton in \(\eta \)\(\phi \) space, \(\Delta R (\ell , j) \ge 0.3\). Quality requirements are imposed to remove events where jets are associated with noisy calorimeter cells. Jet energies are corrected for the presence of low-energy contributions from additional in-time or out-of-time collisions (pile-up), the non-compensating response of the calorimeter, detector material variations, and energy losses in uninstrumented regions. This calibration is performed in bins of \(p_{\text {T}} \) and \(\eta \), using correction factors determined using a combination of Monte Carlo simulations and in-situ calibrations with data [44, 47]. The systematic uncertainties in these correction factors are determined from the same control samples in data. A significant source of uncertainty in this analysis arises from the modelling of the \(\eta \) dependence of the jet energy response.

To suppress the contribution of jets from additional coincident pp collisions, the jet vertex fraction (JVF) [48] is used to reject central jets (\(|\eta |<2.4\)) that are not compatible with originating from the primary vertex. The JVF is defined as the scalar sum of the \(p_{\text {T}} \) values of tracks associated with both the primary vertex and the jet, divided by the summed \(p_{\text {T}} \) of all tracks associated with the jet. For the 7 \(\text {TeV}\) data taking, the requirement is \(|\mathrm {JVF}|\ge 0.75\); this requirement is loosened in 8 \(\text {TeV}\) data taking to \(|\mathrm {JVF}|\ge 0.5\) if the jet has \(p_{\text {T}} <50\) \(\text {GeV}\). The relaxed requirement in 8 \(\text {TeV}\) data is due to the larger pile-up rate causing signal events to be rejected when using the 7 \(\text {TeV}\) selection, and the requirement of \(|\eta |<2.4\) is to ensure the jets are within the ID tracking acceptance.

Jets that are consistent with originating from heavy-flavour quarks are identified using a neural network algorithm trained on input variables related to the impact parameter significance of tracks in the jet and the secondary vertices reconstructed from these tracks [49]. Jets are identified as b-jets with a selection on the output of the neural network corresponding to an identification efficiency of 80%.

6 Missing transverse momentum

In events with a leptonically decaying W boson, one expects large missing momentum in the transverse plane due to the escaping neutrino. The magnitude of this missing transverse momentum (\(E_{\text {T}}^{\text {miss}}\)) is constructed from the vector sum of muon momenta and three-dimensional energy clusters in the calorimeter [50, 51]. The clusters are corrected to account for the different response to hadrons compared to electrons or photons, as well as dead material and out-of-cluster energy losses. Additional tracking information is used to extrapolate low-momentum particles to the primary vertex to reduce the contribution from pile-up.

7 Event selection

The proton–proton collision data samples correspond to a total integrated luminosity of 4.7 fb\({}^{-1}\) for the 7 \(\text {TeV}\) data and 20.2 fb\({}^{-1}\) for the 8 \(\text {TeV}\) data with uncertainties of 1.8% [52] and 1.9% [53], respectively.

The measurements use data collected with single-electron and single-muon triggers. The triggers identify candidate muons by combining an ID track with a muon-spectrometer track, and candidate electrons by matching an inner detector track to an energy cluster in the calorimeter consistent with an electromagnetic shower. The triggers in the 7 \(\text {TeV}\) data require \(p_{\text {T}} > 18\) \(\text {GeV}\) for muons and either \(E_{\text {T}} > 20\) \(\text {GeV}\) or \(E_{\text {T}} > 22\) \(\text {GeV}\) for electrons, depending on the data-taking period. The 8 \(\text {TeV}\) data events are selected by two triggers in each channel. The electron-channel triggers have \(E_{\text {T}}\) thresholds of 24 and 60 \(\text {GeV}\), where the lower-threshold trigger includes a calorimeter isolation criterion: the measured \(E_{\text {T}}\) within a cone of radius \(R=0.2\) around the electron candidate, excluding the electron candidate’s \(E_{\text {T}}\), must be less than 10% of the \(E_{\text {T}}\) of the electron. The muon-channel triggers have \(p_{\text {T}}\) thresholds of 24 and 36 \(\text {GeV}\). The lower-threshold trigger has a track-isolation requirement, where the scalar summed \(p_{\text {T}} \) of tracks within a cone of radius \(R = 0.2\) around the muon is required to be less than 12% of the \(p_{\text {T}} \) of the muon.

The analysis defines many measurement regions varying in electroweak \(Wjj\) purity. Table 1 shows the regions at the generated particle level based on the variables defined below. Particle-level objects are reconstructed as follows: jets are reconstructed using the anti-\(k_t\) algorithm with a radius parameter of 0.4 using final-state particles with a proper lifetime longer than 10 ps; and leptons are reconstructed by combining the final-state lepton with photons within a cone of \(R=0.1\) around the lepton. The requirements in Table 1 are also used to select data events, except for the following differences: (1) electrons must have \(|\eta |<2.47\) and cannot be in the crack region of the calorimeter (\(1.37< |\eta | < 1.52\)); (2) muons must have \(|\eta |<2.4\); and (3) jets are selected using pseudorapidity (\(|\eta |<4.4\)) rather than rapidity. Also, a b-jet veto is applied to the validation region in data when performing the measurement of the fiducial electroweak \(Wjj\) cross section described in Sect. 5.

Table 1 Phase-space definitions at the generated particle level. Each phase-space region includes the preselection and the additional requirements listed for that region. The variables are defined in Sects. 3.1 and 3.2

7.1 Event preselection

Signal candidate events are initially defined by the presence of missing transverse momentum (\(E_{\text {T}}^{\text {miss}} >20\) \(\text {GeV}\)), exactly one charged lepton (electron or muon) candidate with \(p_{\text {T}} > 25\) \(\text {GeV}\), and at least two jets. The highest-\(p_{\text {T}} \) jet is required to have \(p_{\text {T}} ^{j_1} >80\) \(\text {GeV}\) and the second jet must have \(p_{\text {T}} ^{j_2} >60\) \(\text {GeV}\). To isolate events with a W boson, a veto is imposed on events with a second same-flavour lepton with \(p_{\text {T}} > 20\) \(\text {GeV}\); these leptons are identified in data using relaxed isolation and impact parameter criteria. A minimum cut on the transverse mass, \(m_{\text{ T }} >40\) \(\text {GeV}\), of the W-boson candidate is additionally imposed, where \(m_{\text{ T }} \) is defined by:

$$\begin{aligned} m_{\text{ T }} = \sqrt{\,2p_{\text {T}} \cdot E_{\text {T}}^{\text {miss}} \left[ 1-\cos \Delta \phi (\ell , E_{\text {T}}^{\text {miss}})\right] }. \end{aligned}$$

Jets are selected in data if they have \(|\eta |<4.4\) and \(\Delta R(j,\ell )>0.3\). A VBF topology is selected by requiring the invariant mass of the dijet system defined by the two highest-\(p_{\text {T}} \) jets to satisfy \(M_{jj} >500\) \(\text {GeV}\), and the absolute value of the rapidity separation of the jets to satisfy \(\Delta y(j_1,j_2) > 2\).

7.2 Definitions of the measurement regions

The above preselection defines an inclusive fiducial region, which is then split into four orthogonal fiducial regions defined by the presence or absence of the lepton or an additional jet in a “central” rapidity range between the two highest-\(p_{\text {T}} \) jets. The signal EW \(Wjj\) process is characterized by a lepton and no jets in the central rapidity range. This range is determined by the centrality variable \(C_{\ell }\) or \(C_j\) for the lepton or jets respectively:

$$\begin{aligned} C_{\ell ~(j)} \equiv \left| \frac{y_{\ell ~(j)} - \frac{y_1+y_2}{2}}{y_1-y_2}\right| , \end{aligned}$$
(1)

where \(y_{\ell ~(j)}\) is the rapidity of the candidate lepton (jet), and \(y_1\) and \(y_2\) are the rapidities of the highest-\(p_{\text {T}}\) (leading) and next-highest-\(p_{\text {T}}\) (subleading) jets. Requiring the centrality to be below a value \(C_{\text {max}}\) defines the selection of a rapidity range centred on the mean rapidity of the leading jets, i.e.,

$$\begin{aligned} \left[ \frac{y_1+y_2}{2}-C_{\text {max}}\times |y_1-y_2|,\quad \frac{y_1+y_2}{2}+C_{\text {max}}\times |y_1-y_2|\right] , \end{aligned}$$
(2)

as illustrated in Fig. 3. For \(C_{\text {max}}=0.5\), the interval spans the entire rapidity region between the two jets; the number of jets within this interval is denoted \(N_{\text {jets}}^{\text {gap}}\). In defining the electroweak \(Wjj\) signal region, \(C_{\text {max}}=0.4\) is used to count the number of leptons (\(N_{\text {lepton}}^{\text {cen}}\)) or jets (\(N_{\text {jets}}^{\text {cen}}\)) within the range. A value of \(C_{\text {max}}=0.4\) permits an event with the emission of an additional jet close to one of the two highest-\(p_{\text {T}}\) jets to be retained as a candidate signal event.

Fig. 3
figure 3

Illustration of the central region used to count leptons and jets in the definition of the signal, control, and validation regions. The rapidity range of the region corresponds to \(C_{\text {max}}=0.4\) in Eq. (2). An object in the direction of the dashed line has \(C = 0\)

Fig. 4
figure 4

Illustration of the relationship between the signal, control, and validation fiducial regions. The signal region is defined by both a veto on additional jets (beyond the two highest-\(p_{\text {T}} \) jets) and the presence of a lepton in the rapidity region defined in Eq. (2). The signal region is studied with either \(M_{jj} >0.5\) \(\text {TeV}\) or 1 \(\text {TeV}\). A forward-lepton/central-jet fiducial region is also defined, for which the centrality requirements on the jets and the lepton are inverted with respect to the signal region. The inclusive region corresponds to the union of all four regions, and is studied with \(M_{jj} >0.5,~1.0,~1.5,\) or 2.0 TeV. The quantities \(N_{\text {jets}}^{\text {cen}} \) and \(N_{\text {lepton}}^{\text {cen}} \) refer to the number of reconstructed leptons and additional jets reconstructed in the rapidity interval defined by Eq. (2) and illustrated in Fig. 3, with \(C_{\text {max}} = 0.4\)

The fiducial regions are illustrated in Fig. 4. The signal process is characterized by a W boson in the rapidity range spanned by the two jets (Fig. 1), with no jets in this range due to the absence of colour flow between the interacting partons. An event is therefore defined as being in the electroweak-enhanced signal region if the identified lepton is reconstructed in the rapidity region defined by Eq. (2) and no additional jets are reconstructed in this interval. A QCD-enhanced forward-lepton control fiducial region is defined by the requirement that neither the identified lepton nor any additional jets be present in the central rapidity interval. A second QCD-enhanced central-jet validation region is defined by events having both the identified lepton and at least one additional jet reconstructed in the central rapidity interval. These three orthogonal fiducial regions are used in Sect. 5 to extract the EW\(Wjj\) production cross section, constrain the modelling of QCD \(Wjj\)production from data, and validate the QCD \(Wjj\) modelling, respectively.

For the determination of unfolded differential cross sections presented in Sect. 6, four additional fiducial regions are studied: the inclusive region for the progressively more restrictive dijet invariant mass thresholds of 1.0, 1.5, and 2.0 \(\text {TeV}\), and an orthogonal forward-lepton/central-jet region defined by events with the lepton outside the central region, but at least one additional jet reconstructed in the interval. For the study of EW \(Wjj\) differential cross sections, the signal fiducial region with an increased dijet invariant mass requirement of \(M_{jj} >1\) \(\text {TeV}\) (high-mass signal region) is also analyzed; a further requirement that the leading-jet \(p_{\text {T}} \) be greater than 600 \(\text {GeV}\) defines a high- \(q^2\) region used for constraints on aTGCs (discussed in Sect. 7).

8 Modelling of signal and background processes

Simulated Monte Carlo (MC) samples are used to model \(Wjj\) production, with small data-derived corrections applied to reduce systematic uncertainties. Other processes producing a prompt charged lepton are also modelled with MC samples. The multijet background, where a photon or hadronic jet is misreconstructed as a prompt lepton, or where a lepton is produced in a hadron decay, is modelled using data.

8.1 Monte Carlo simulation

The measurements described in this paper focus on the electroweak production of \(Wjj\). This process has different kinematic properties to strong \(Wjj\) production, but there is nonetheless some small interference between the processes. The other significant background processes are top-quark, Z-boson, and diboson production, which are modelled with MC simulation. All MC samples used to model the data are passed through a detector simulation [54] based on geant4  [55]. Pile-up interactions are modelled with Pythia8 (v. 8.165) [56]. Table 2 lists the MC samples and the cross sections used in the MC normalization.

Table 2 Monte Carlo samples used to model the signal and background processes. The cross sections times branching fractions, \(\sigma \cdot \mathcal {B}\), are quoted for \(\sqrt{s} = 7\) and 8 \(\text {TeV}\). The branching fraction corresponds to the decay to a single lepton flavour, and here \(\ell \) refers to e, \(\mu \), or \(\tau \). The neutral current \(Z/\gamma ^*\) process is denoted by Z. To remove overlap between \(W(\rightarrow \tau \nu )\) + 2 jets and WW/WZ in 7 \(\text {TeV}\) samples, events with a generated \(\tau \) lepton are removed from the 7 \(\text {TeV}\) WW/WZ samples. Jets refer to a quark or gluon in the final state of the matrix-element calculation

9 \(Wjj\)

The primary model of the signal and background \(Wjj\) processes in the analysis is the next-to-leading-order (NLO) Powheg Monte Carlo generator [29, 36, 57, 58], interfaced with Pythia8 using the AU2 parameter values [59] for the simulation of parton showering, underlying event, and hadronization. Two final-state partons with \(p_{\text {T}} >20\) \(\text {GeV}\) are required for the signal. A generator-level suppression is applied in the background generation to enhance events with one parton with \(p_{\text {T}} > 80\) \(\text {GeV}\) and a second parton with \(p_{\text {T}} > 60\) \(\text {GeV}\), and the mass of the pair larger than 500 \(\text {GeV}\). Parton momentum distributions are modelled using the CT10 [60] set of parton distribution functions (PDFs). The QCD factorization and renormalization scales are set to the W-boson mass for the sample with jets produced via the electroweak interaction. For the sample with strongly produced jets, the hard-process scale is also the W-boson mass while the QCD emission scales are set with the multiscale-improved NLO (MiNLO) procedure [61] to improve the modelling and reduce the scale dependence. Uncertainties due to missing higher-order contributions are estimated by doubling and halving the factorization and renormalization scales independently, but keeping their ratio within the range 0.5–2.0. Uncertainties due to parton distribution functions are estimated using CT10 eigenvector variations rescaled to 68% confidence level, and an uncertainty due to the parton shower and hadronization model is taken from the difference between predictions using the Pythia8 and Herwig++  [62, 63] generators.

Measured particle-level differential distributions are also compared to the Sherpa (v. 1.4) [64] generation of QCD+EW \(Wjj\) production at leading-order accuracy, including interference. An uncertainty due to the neglect of interference in the EW \(Wjj\) measurement is estimated using this sample and individual Sherpa QCD and EW \(Wjj\) samples. The individual samples are also used to model the small contribution from \(W\rightarrow \tau \nu \) decays. Measured distributions of QCD+EW \(Wjj\) production are compared to the combined QCD+EW and to the QCD \(Wjj\) samples, the latter to demonstrate the effect of the EW \(Wjj\) process. The QCD \(Wjj\) sample is a \(W+(n)\)-parton prediction with \(n\le 4\) partons with \(p_{\text {T}} > 15\) \(\text {GeV}\) produced via QCD interactions. The EW \(Wjj\) sample has two partons produced via electroweak vertices, and up to one additional parton produced by QCD interactions. The CKKW matching scheme [65] is used to remove the overlap between different parton multiplicities at the matrix-element level. The predictions use the CT10 PDFs and the default parameter values for simulating the underlying event. Renormalization and factorization scales are set using the standard dynamical scale scheme in Sherpa. The interference uncertainty is cross-checked with the Madgraph  [28] generator interfaced to Pythia8.

For unfolded distributions with a low purity of electroweak \(Wjj\) production, an additional comparison is made to the all-order resummation calculation of hej (High Energy Jets) [33] for strong \(Wjj\) production. The calculation improves the accuracy of predictions in wide-angle or high-invariant-mass dijet configurations, where logarithmic corrections are significant. To allow a comparison to unfolded data and to other generators, the small electroweak \(Wjj\) contribution is added using Powheg interfaced to Pythia8 and the sum is labelled hej (qcd) + pow+py (ew).

Both the Powheg and Sherpa predictions for electroweak \(Wjj\) production omit the small contribution from diboson production processes, assuming negligible interference with these processes. Higher-order electroweak corrections to the background \(Wjj\) process are studied with OpenLoops [66, 67] and found to affect the measured fiducial cross section by \(<1\%\).

10 Other processes

Background contributions from top-quark, \(Z + 2\text { jets}\), and diboson processes are estimated using MC simulation.

The top-quark background consists of pair-production and single-production processes, with the latter including s-channel production and production in association with a b quark or W boson. Top-quark pair production is normalized using the cross section calculated at next-to-next-to-leading order (NNLO) in \(\alpha _{\text {S}} \), with resummation to next-to-next-to-leading logarithm (NNLL) using TOP++2.0 [68]. Kinematic distributions are modelled at NLO using the mc@nlo  [69] generator and the Herwig  [63, 70] parton shower model for 7 \(\text {TeV}\) data, and with Powheg and Pythia6 (v. 6.427) [71] for 8 \(\text {TeV}\) data; both use the CT10 PDF set. An uncertainty due to the parton shower model, and its interface to the matrix-element generator, is estimated by comparing the Powheg sample to an mc@nlo sample interfaced to Herwig. Single-top-quark production in the t-channel, \(t\bar{b}q \rightarrow \ell \nu b\bar{b}q\), is modelled using the leading-order generator AcerMC (v. 3.8) [72] interfaced with Pythia6 and the CTEQ6L1 [73] PDF set, and the sample is normalized using the cross sections calculated by the generator. Modelling of the s-channel production of a single top quark, \(t\bar{b} \rightarrow \ell \nu b\bar{b}\), and of the associated production of a top quark and a W boson are performed using AcerMC with Pythia6 in 7 \(\text {TeV}\) data and mc@nlo with Herwig in 8 \(\text {TeV}\) data. These samples are also normalized using the generator cross-section values.

Background from the \(Z + 2\text { jets}\) (Zjj) process, which contributes when one of the leptons is not reconstructed and the \(E_{\text {T}}^{\text {miss}}\) is large, is modelled using Sherpa and the CT10 PDF set. For the background with jets from QCD radiation, an inclusive Drell–Yan sample is produced at NLO [74] and merged with the leading-order (LO) production of additional partons (up to five). The background with jets produced purely through the electroweak interaction is modelled at leading order. This combination of samples is also used to model the \(W(\rightarrow \tau \nu )\) + 2 jets background; the 7 \(\text {TeV}\) sample includes WW and WZ production. The interference between the electroweak and QCD production of jets for these small backgrounds has a negligible impact on the measurements and is not modelled.

The diboson background processes \(WW/WZ\rightarrow \ell \nu q\bar{q}^{(')}\) and \(ZZ\rightarrow \ell \ell q\bar{q}\) provide only a small contribution at high dijet mass since the distribution peaks at the mass of the W or Z boson. The interference between the single and pair production of electroweak bosons is negligible for the mass range selected by the analysis. The diboson processes are modelled at leading order with Herwig++ and normalized to the NLO cross section [75]. The generation uses the CTEQ6L1 PDF set. In 7 \(\text {TeV}\) samples, \(W\rightarrow \tau \nu \) decays are removed since they are included in the \(Wjj\) samples.

10.1 Multijet background

Multijet production constitutes a background to the \(Wjj\) process when one of the jets is misidentified as a lepton and significant \(E_{\text {T}}^{\text {miss}}\) arises from either a momentum mismeasurement or the loss of particles outside the detector acceptance. Due to the very small fraction of multijet events with both of these properties, and their relatively poor modelling in simulation, a purely data-driven method is used to estimate this background. The method inverts certain lepton identification criteria (described below) to obtain a multijet-dominated sample for modelling kinematic distributions. The \(E_{\text {T}}^{\text {miss}}\) distribution is then fit to obtain a multijet normalization factor; this fit is performed separately in the signal, control, and validation regions. Systematic uncertainties are estimated by modifying the fit distribution and the identification criteria, and by propagating detector and theoretical uncertainties.

Modifications to the lepton identification criteria which enhance the multijet contribution are based on isolation and either the impact parameter with respect to the primary vertex (for muons) or the shower and track properties (for electrons). For the 7 TeV analysis, the impact parameter significance requirement is inverted in the muon channel (\(|d_0|/\sigma _{d_0} > 3\)). This preferentially selects muons from heavy-flavour hadron decays, a dominant source of muons in multijet events. For the 8 TeV analysis, no requirement on impact parameter significance is made and instead a track isolation requirement is applied orthogonal to the requirement for selected muons (\(0.15< \sum p_{\text {T}} ^{R=0.3}/ p_{\text {T}} < 0.35\)).

For the electron channel in \(\sqrt{s}=7\) TeV data, triggers requiring a loose electron candidate are used to obtain a multijet modelling sample. The electron candidate must satisfy medium criteria on track hit multiplicity and track–shower matching in \(\eta \), but must fail to satisfy at least one of the tight shower-based criteria. It also must not be isolated in the calorimeter: \(\sum E_{\text {T}} ^{R=0.3}/ E_{\text {T}} > 0.2\). In \(\sqrt{s}=8\) TeV data, electron candidates must satisfy medium selection criteria consistent with the trigger used in the analysis. As in the muon channel, a track isolation window is applied orthogonal to the requirement for selected electrons (\(0.05< \sum p_{\text {T}} ^{R=0.2}/ p_{\text {T}} < 0.1\)).

To normalize the multijet-dominated samples to the expected contribution with nominal lepton criteria, a fit to the \(E_{\text {T}}^{\text {miss}}\) distribution is performed. The fit simultaneously determines the multijet and strong \(Wjj\) normalizations in the region where the nominal lepton criteria are applied, taking the multijet distribution from the sample with inverted lepton identification criteria. Other contributions are fixed to their SM predictions, and the data are consistent with the post-fit distribution within uncertainties. The strong \(Wjj\) normalization is consistent with that found in the fit to the dijet mass distribution described in Sect. 5.

Systematic uncertainties in the multijet normalization arise from uncertainties in the kinematic modelling and in jet, lepton, and \(E_{\text {T}}^{\text {miss}}\) reconstruction. The modelling uncertainties dominate and are estimated using three methods: (1) modifying the lepton candidate selection for the kinematic distributions; (2) using \(m_{\text{ T }}\) as an alternative fit distribution; and (3) varying the kinematic range of the fit. For each method, the largest change in the normalization is taken as a systematic uncertainty and added in quadrature with reconstruction and modelling uncertainties for processes modelled with Monte Carlo simulation. The leading uncertainty arises from the change in multijet normalization when fitting the \(m_{\text{ T }}\) distribution instead of the \(E_{\text {T}}^{\text {miss}}\) distribution. The next largest uncertainty results from variations of the isolation and impact parameter requirements in the lepton selection used for the kinematic distributions. The total relative systematic uncertainty of the multijet normalization in the muon (electron) channel is 28% (67%) for the \(\sqrt{s}=7\) TeV analysis, and 36% (38%) for the \(\sqrt{s}=8\) TeV analysis. The relatively large uncertainty in the \(\sqrt{s}=7\) TeV electron channel results from a larger dependence on the fit distribution and range than in the other multijet fits.

10.2 Distributions and yields

The distributions of lepton centrality and the minimum centrality of additional jets, which are used to separate signal, control, and validation regions, are shown in Fig. 5 for the 7 and 8 TeV data and the corresponding SM predictions after the preselection. The comparisons of the SM predictions to data show general agreement within the estimated uncertainties. The predictions include correction factors for lepton identification and triggering, and the bands correspond to the combination of statistical and experimental uncertainties. The signal-region dijet mass distributions, used to fit for the signal yield in the fiducial and total cross-section measurements, are shown in Fig. 6 for both data sets. The figure also shows the dijet rapidity difference, which is correlated with dijet mass and demonstrates an enhancement in signal at high values. Table 3 details the data and SM predictions for the individual processes in the signal region, and Table 4 shows the total predictions and the observed data in each of the fiducial regions defined in Sect. 3.

Fig. 5
figure 5

Predicted and observed distributions of the lepton centrality (top) and the minimum centrality of additional jets (bottom) for events in the inclusive fiducial region (i.e. after preselection) in 7 \(\text {TeV}\) (left) and 8 \(\text {TeV}\) (right) data. The arrows in the lepton-centrality distributions separate the signal-region selection (to the left) from the control-region selection (to the right). The arrows in the jet-centrality distributions separate the signal-region selection (to the right) from the validation-region selection (to the left). The bottom panel in each distribution shows the ratio of data to the prediction. The shaded band represents the statistical and experimental uncertainties summed in quadrature

Fig. 6
figure 6

Predicted and observed distributions of the dijet invariant mass (top) and \(\Delta y(j_1,j_2) \) (bottom) for events in the signal region in 7 \(\text {TeV}\) (left) and 8 \(\text {TeV}\) (right) data. The bottom panel in each distribution shows the ratio of data to the prediction. The shaded band represents the statistical and experimental uncertainties summed in quadrature

Table 3 Observed data and predicted SM event yields in the signal region. The MC predictions are normalized to the theoretical cross sections in Table 2. The relative uncertainty of the total SM prediction is \(\mathcal{{O}}\)(10%)
Table 4 Observed data and total predicted SM event yields in each measurement region. The MC predictions are normalized to the theoretical cross sections times branching ratios in Table 2. The relative uncertainty of the total SM prediction is \(\mathcal{{O}}\)(10%)

11 Fiducial and total electroweak \(Wjj\) cross sections

The measurement of the fiducial EW \(Wjj\) cross section in the signal region uses a control-region constraint to provide a precise determination of the electroweak production cross section for W bosons produced in association with dijets at high invariant mass. The measurement is performed with an extended joint binned likelihood fit [76] of the \(M_{jj}\) distribution for the normalization factors of the QCD \(Wjj\) and EW \(Wjj\) Powheg + Pythia8 predictions, \(\mu _{\text {QCD}}\) and \(\mu _{\text {EW}}\) respectively, defined as follows:

$$\begin{aligned} (\sigma _i^{\ell \nu jj} \times \mathcal{{A}}_i)^\mathrm {meas}= & {} \mu _i \cdot (\sigma _i^{\ell \nu jj} \times \mathcal{{A}}_i)^\mathrm {theo} \\= & {} \frac{N_i}{\mathcal{{C}}_i \mathcal{{L}}}, \end{aligned}$$

where \(\sigma ^{\ell \nu jj}_i\) is the cross section of process i (QCD \(Wjj\) or EW \(Wjj\) production in a single lepton channel), \(\mathcal{{A}}_i\) is the acceptance for events to pass the signal selection at the particle level (see Table 1), \(N_i\) is the number of measured events, \(\mathcal{{L}}\) is the integrated luminosity, and \(\mathcal{{C}}_i\) is the ratio of reconstructed to generated events passing the selection and accounts for experimental efficiencies and resolutions. The fit includes a Gaussian constraint for all non-\(Wjj\) backgrounds, and accounts only for statistical uncertainties in the expected yield. The fit result for \(\mu _{\text {EW}}\) is translated into a fiducial cross section by multiplying \(\mu _{\text {EW}}\) by the predicted fiducial cross section from Powheg + Pythia8. In addition, the total cross section for jets with \(p_{\text {T}} > 20\) \(\text {GeV}\) is calculated by dividing the fiducial cross section by \(\mathcal{{A}}\) for the EW \(Wjj\) process.

The dijet mass provides the discriminating fit distribution. The region at relatively low invariant mass (\({\approx }\)500–1000 \(\text {GeV}\)) has low signal purity and primarily determines \(\mu _{\text {QCD}}\), while events with higher invariant mass have higher signal purity and mainly determine \(\mu _{\text {EW}}\). The interference between the processes is not included in the fit, and is instead taken as an uncertainty based on SM predictions.

The uncertainty in the shape of the QCD \(Wjj\) distribution dominates the measurement, but is reduced by using the forward-lepton control region to correct the modelling of the \(M_{jj}\) shape. This control region is defined in Table 1 and uses the same selection as the signal region, except for the inversion of the central-lepton requirement. This section describes the application of the control-region constraint, the uncertainties in the measurement, and the results of the fit.

11.1 Control-region constraint

The SM prediction of the dijet mass distribution receives significant uncertainties from the experimental jet energy scale and resolution. These uncertainties are constrained with a correction to the predicted distribution derived using data in a control region where the signal contribution is suppressed. This forward-lepton control region is selected using the lepton centrality distribution. Residual uncertainties arise primarily from differences in the dijet mass spectrum between the control region and the signal region.

To derive the \(M_{jj}\) correction, all processes other than strong \(Wjj\) production are subtracted from the data and the result is compared to the prediction (Fig. 7). The correction is then determined with a linear fit to the ratio of the subtracted data to the \(Wjj\) prediction. The slopes of the fits in 7 and 8 \(\text {TeV}\) data are consistent with zero; they are \((0.2 \pm 1.1)\)%/\(\text {TeV}\) and \((0.28 \pm 0.43)\)%/\(\text {TeV}\), respectively, where the uncertainties are statistical only. The effect of a slope correction of 1%/\(\text {TeV}\) is approximately 0.1 in the measured \(\mu _{\text {EW}} \).

Fig. 7
figure 7

Comparison of the predicted QCD \(Wjj\) dijet mass distribution to data with background processes subtracted, for events in the forward-lepton control region in 7 \(\text {TeV}\) (left) and 8 \(\text {TeV}\) (right) data. The bottom panel in each distribution shows the ratio of data to the QCD \(Wjj\) prediction, and the result of a linear fit to the ratio. The error bars represent statistical and experimental uncertainties summed in quadrature

Systematic uncertainties in the corrected dijet mass distribution in the signal and validation regions are estimated by varying each source of uncertainty up or down by \(1\sigma \) and calculating the corresponding slope correction in the control region in the simulation. This correction is applied to the prediction in the signal region and the fit performed on pseudodata derived from the nominal prediction. The resulting change in \(\mu _{\text {EW}} \) is taken as the corresponding systematic uncertainty. The method is illustrated in the central-jet validation region in Fig. 8, where the background-subtracted and corrected \(Wjj\) dijet mass distribution is compared to data. The ratio of subtracted data to the corrected \(Wjj\) prediction is consistent with a line of zero slope when considering statistical and experimental uncertainties (the dotted lines in the figure).

Fig. 8
figure 8

Comparison of the corrected QCD \(Wjj\) background dijet mass distribution to data with background processes subtracted, for events in the central-jet validation region in 7 \(\text {TeV}\) (left) and 8 \(\text {TeV}\) (right) data. The bottom panel in each subfigure shows the ratio of data to prediction, and the result of a linear fit to the ratio (solid line). The error bars represent statistical and experimental uncertainties summed in quadrature. The dotted lines show the fit with slope adjusted up and down by statistical and experimental uncertainties

11.2 Uncertainties in \(\mu _{\text {EW}} \)

Uncertainties in \(\mu _{\text {EW}} \) consist of: statistical uncertainties in the fit to the normalizations of the signal and background \(Wjj\) processes in the signal region; the statistical uncertainty of the correction from the control region; and experimental and theoretical uncertainties affecting the signal and background predictions. Table 5 summarizes the uncertainties in the measurement of \(\mu _{\text {EW}}\).

Table 5 The statistical and systematic uncertainty contributions to the measurements of \(\mu _{\text {EW}} \) in 7 and 8 TeV data

The total statistical uncertainty in \(\mu _{\text {EW}} \) of the joint likelihood fit is 0.16 (0.052) in 7 (8) \(\text {TeV}\) data, where the leading uncertainty is the statistical uncertainty of the data in the control region rather than in the signal region.

Systematic uncertainties affecting the MC prediction are estimated by varying each uncertainty source up and down by \(1\sigma \) in all MC processes, fitting the ratio of the varied QCD \(Wjj \) prediction to the nominal prediction in the control region, and performing the signal region fit using the varied samples as pseudodata and the nominal samples as the templates. The largest change in \(\mu \) from the up and down variations is taken as a symmetric uncertainty. The dominant experimental uncertainty in \(\mu _{\text {EW}} \) is due to the calibration of the \(\eta \) dependence of the jet energy scale, and is 0.124 (0.053) in 7 (8) \(\text {TeV}\) data. Other uncertainties in the jet energy scale (JES) and resolution (JER) are of similar size when combined, with the largest contribution coming from the uncertainty in modelling the ratio of responses to quarks and gluons. Uncertainties due to multijet modelling are estimated by separately varying the normalization and distribution of the multijet background in each phase-space region and combining the effects in quadrature.

Theoretical uncertainties arise from the statistical uncertainty on the MC predictions; the lack of interference between signal and background \(Wjj\) processes in the MC modelling; \(Wjj\) renormalization and factorization scale variations and parton-shower modelling, which affect the acceptance of the jet centrality requirement; parton distribution functions; and cross-section uncertainties. The uncertainty due to MC statistics is 0.040 (0.032) in 7 (8) \(\text {TeV}\) data. The interference uncertainty is estimated by including the Sherpa leading-order interference model as part of the background \(Wjj\) process and affects the measurement of \(\mu _{\text {EW}} \) by 0.037 (0.032) in 7 (8) \(\text {TeV}\) data. Uncertainties due to PDFs are 0.053 (0.052) for 7 (8) \(\text {TeV}\) data. Scale and parton-shower uncertainties are \(\approx 0.04\) in both the 7 and 8 \(\text {TeV}\) measurements. The scale uncertainty in EW \(Wjj\) production is larger at \(\sqrt{s}=8\) TeV than at 7 \(\text {TeV}\) because of the increasing uncertainty with dijet mass and the higher mean dijet mass at 8 \(\text {TeV}\). The scale uncertainty in QCD \(Wjj\) production is larger at \(\sqrt{s}=7\) \(\text {TeV}\) because the data constraint has less statistical power than at 8 \(\text {TeV}\).

Finally, a 0.076 (0.061) uncertainty in the signal cross section at 7 (8) \(\text {TeV}\) due to higher-order QCD corrections and non-perturbative modelling is estimated using scale and parton-shower variations, affecting the measurement of \(\mu _{\text {EW}} \) but not the extracted cross sections.

11.3 Electroweak \(Wjj\) cross-section results

The dijet mass distributions in 7 and 8 \(\text {TeV}\) data after fitting for \(\mu _{\text {EW}}\) and \(\mu _{\text {QCD}}\) are shown in Fig. 9. There is good overall agreement between the normalized distributions and the data. The fit results for \(\mu _{\text {QCD}}\) are \(1.16 \pm 0.07\) for 7 \(\text {TeV}\) data, and \(1.09 \pm 0.05\) for 8 \(\text {TeV}\) data. The measured values of \(\mu _{\text {EW}}\) are consistent between electron and muon channels, with the following combined results:

$$\begin{aligned} \mu _{\text {EW}} ~(7~\mathrm {{\text {TeV}}})= & {} 1.00 \pm 0.16~\mathrm {(stat)}~\pm 0.17~\mathrm {(exp)}~\pm 0.12~\mathrm {(th)}, \\ \mu _{\text {EW}} ~(8~\mathrm {{\text {TeV}}})= & {} 0.81 \pm 0.05~\mathrm {(stat)}~\pm 0.09~\mathrm {(exp)}~\pm 0.10~\mathrm {(th)}. \end{aligned}$$
Fig. 9
figure 9

Distributions of the dijet invariant mass for events in the signal region in 7 \(\text {TeV}\) (left) and 8 \(\text {TeV}\) (right) data, after fitting for the yields of the individual \(Wjj\) processes. The bottom panel in each distribution shows the ratio of data to predicted signal-plus-background yields. The shaded band centred at unity represents the statistical and experimental uncertainties summed in quadrature

The measured value of \(\mu _{\text {EW}} \) has a total uncertainty of 0.26 (0.14) in 7 (8) \(\text {TeV}\) data, and differs from the SM prediction of unity by \(<0.1\sigma \) (\(1.4\sigma \)). In the absence of a control region, the uncertainty would increase to 0.37 (0.18) in 7 (8) \(\text {TeV}\) data.

The fiducial signal region is defined by the selection in Table 1 using particle-level quantities after parton showering. The measured and predicted cross sections times branching ratios in this region are shown in Table 6. The acceptance is calculated using Powheg + Pythia8 with a dominant uncertainty due to the parton-shower modelling which is estimated by taking the difference between Powheg + Pythia8 and Powheg + Herwig++. The uncertainty in the predicted fiducial cross section at \(\sqrt{s} = 8\) \(\text {TeV}\) includes a 4 fb contribution from scale variations and an 11 fb contribution from parton-shower modelling.

A summary of this measurement and other measurements of boson production at high dijet invariant mass is shown in Fig. 10, normalized to SM predictions. The measurement with the smallest relative uncertainty is the 8 TeV \(Wjj \) measurement presented here.

Table 6 Measured fiducial cross sections of electroweak \(Wjj\) production in a single lepton channel, compared to predictions from Powheg + Pythia8. The acceptances and the inclusive measured production cross sections with \(p_{\text {T}} > 20\) GeV jets are also shown
Fig. 10
figure 10

Measurements of the cross section times branching fractions of electroweak production of a single W, Z, or Higgs boson at high dijet invariant mass, divided by the SM predictions (Powheg +Pythia8 for ATLAS, Madgraph +Pythia8 for CMS, and Powheg +Pythia8 for the LHC combination). The lighter shaded band (where shown) represents the statistical uncertainty of the measurement, the outer darker band represents the total measurement uncertainty. Theoretical uncertainties in the SM prediction are represented by the shaded region centred at unity

12 Differential cross sections

Differential cross section measurements provide valuable information on the observed kinematic properties of a process, testing the theoretical predictions and providing model-independent results to probe for new physics. This section presents differential measurements in the \(\sqrt{s}=8\) \(\text {TeV}\) data that discriminate EW \(Wjj\) from QCD \(Wjj\) production, after first introducing the unfolding procedure, uncertainties, and the fiducial measurement regions. The large event yields allow more precise tests of these distributions than other VBF measurements and provide the most comprehensive tests of predictions in VBF-fiducial regions. Distributions sensitive to anomalous triple gauge couplings are also presented and extend to values of momentum transfer approaching 1 \(\text {TeV}\), directly probing these energies for the presence of new interactions. Additional distributions are provided in Appendix A, and the complete set of measurements is available in hepdata [77].

All differential production cross sections are measured both as absolute cross sections and as distributions normalized by the cross section of the measured fiducial region (\(\sigma ^\mathrm {fid}_W\)). The normalizations are performed self-consistently, i.e. data measurements are normalized by the total fiducial data cross section and MC predictions are normalized by the corresponding MC cross section. Many sources of uncertainty are reduced for normalized distributions, allowing higher-precision tests of the modelling of the shape of the measured observables.

Unfolded differential cross-section measurements are performed for both QCD+EW \(Wjj\) and EW \(Wjj\) production and compared to theoretical predictions from the Powheg + Pythia8, Sherpa, and hej event generators, which are described in Sect. 4.1. The reported cross sections are for a single lepton flavour and are normalized by the width of the measured bin interval.

12.1 Unfolding and uncertainties

The MC simulations are used to correct the cross sections for detector and event selection inefficiencies, and for the effect of detector resolutions. An implementation [78] of a Bayesian iterative unfolding technique [79] is used to perform these corrections. The unfolding is based on a response matrix from the simulated events which encodes bin-to-bin migrations between a particle-level differential distribution and the equivalent reconstruction-level distribution. The matrix gives transition probabilities from particle level to reconstruction level, and Bayes’ theorem is employed to calculate the inverse probabilities. These probabilities are used in conjunction with a prior particle-level signal distribution, which is taken from the Powheg + Pythia8 simulations, to unfold the background-subtracted reconstruction-level data distributions. After this first unfolding iteration the unfolded data distribution is used as the new prior and the process repeated for another iteration. The unfolding procedure is validated by unfolding the Sherpa simulation using the Powheg + Pythia8 response matrix. For all distributions the unfolded and initial particle-level Sherpa predictions agree within the unfolding uncertainty assigned. Bin boundaries in unfolded distributions are chosen to ensure that \({>}66\%\) of particle-level events remain within the same interval at reconstruction level.

The sources of uncertainty discussed in Sect. 5 are assessed for the unfolded differential production cross sections. Figures are shown with statistical uncertainties as inner bars and total uncertainties as the outer bars. Statistical uncertainties are estimated using pseudoexperiments, with correlations between bins determined using a bootstrap method [80]. The \(W\rightarrow e\nu \) and \(W\rightarrow \mu \nu \) channels are found to be statistically compatible, and are combined. Theoretical uncertainties include the effects of scale and PDF variations on the prior distribution and on the response matrix. For unfolding EW \(Wjj\) production, additional theoretical uncertainties arise from modelling the QCD \(Wjj\) contribution subtracted from the data, and from the neglect of interference between the strong and electroweak \(Wjj\) processes. The interference uncertainty is estimated using the same procedure as for the fiducial measurement (Sect. 5), i.e. by adding the Sherpa interference model to the background prediction. The interference uncertainty is shown explicitly as a shaded area in each bin of the measured distributions. An uncertainty in the unfolding procedure is estimated by reweighting the simulation such that the distributions match the unfolded data, and then unfolding the data with the reweighted simulation; the change in the unfolded measurement is symmetrized and taken as an uncertainty. Experimental uncertainties are assessed by unfolding the data distributions using a modified response matrix and prior incorporating the change in detector response.

Fig. 11
figure 11

Relative uncertainties in example unfolded differential cross sections for the combined QCD+EW \(Wjj\) processes. The examples are: the number of jets in the rapidity gap between the two highest-\(p_{\text {T}} \) jets in the inclusive region (top left); the lepton centrality distribution in the inclusive \(M_{jj} >1\) TeV region (top right); \(M_{jj} \) in the inclusive region (bottom left); and the dijet \(p_{\text {T}} \) in the signal region (bottom right). Dominant contributions to the total systematic uncertainty are highlighted separately

Fig. 12
figure 12

Relative uncertainties in example unfolded differential cross sections for the EW \(Wjj\) processes. The examples are \(M_{jj} \) (top left) and \(\Delta y(j_1,j_2) \) (top right) in the high-mass signal region; \(M_{jj} \) in the \(M_{jj} >1\) TeV inclusive region (bottom left); and leading-jet \(p_{\text {T}} \) in the high-mass signal region (bottom right). Dominant contributions to the total systematic uncertainty are highlighted separately

Figures 11 and 12 summarize the uncertainty contributions to example unfolded data distributions for QCD+EW \(Wjj\) and EW \(Wjj\) production, respectively. For measurements of combined QCD+EW \(Wjj\) production, the jet energy scale and resolution uncertainties dominate the total uncertainty except in regions where statistical uncertainties are significant. The unfolding uncertainty is typically relevant in these regions and in regions dominated by QCD \(Wjj\) production where the statistical uncertainties are small. In measurements of EW \(Wjj\) production, uncertainties in the modelling of strong \(Wjj\) production are particularly important at low dijet invariant mass, where the EW \(Wjj\) signal purity is lowest. Interference uncertainties become dominant at low dijet rapidity separation but are otherwise not the leading contribution to the total uncertainty. A recent study [81] of interference in Z+jets VBF topologies, incorporating NLO electroweak corrections, predicted similar behaviour. For the bulk of the EW \(Wjj\) distributions, the leading sources of uncertainty are statistical, QCD \(Wjj\) modelling, and jet energy scale and resolution, and contribute roughly equally.

12.2 Fiducial regions and integrated cross sections

The differential cross sections of the combined \(Wjj\) processes are measured in the following nine fiducial regions:

  • the four mutually orthogonal fiducial regions defined in Fig. 4, three of which are electroweak-suppressed (\({<}\)5% contribution) and one electroweak-enhanced (15–20% contribution);

  • an additional electroweak-enhanced signal region with \(M_{jj} >1.0\) \(\text {TeV}\) (35–40% electroweak \(Wjj\) contribution); and

  • four inclusive fiducial regions defined by the preselection requirements in Table 1 with \(M_{jj} > 0.5,~1.0,~1.5\) and 2.0 \(\text {TeV}\).

The inclusive fiducial regions probe the observables used to distinguish EW and QCD \(Wjj\) production, namely lepton and jet centrality, and the number of jets radiated in the rapidity gap between the two leading jets. The four successively higher invariant mass thresholds increasingly enhance the EW \(Wjj\) purity of the differential distributions, without lepton and jet topology requirements.

The combined QCD+EW \(Wjj\) production is measured in all regions to test the modelling of QCD \(Wjj\) production in a VBF topology. In regions sensitive to EW \(Wjj\) contributions, the prediction for QCD \(Wjj\) only is shown along with the combined QCD+EW \(Wjj\) prediction in order to indicate the effect of the EW \(Wjj\) process. Differential measurements of EW \(Wjj\) production are performed in regions with \(M_{jj} >1.0\) \(\text {TeV}\), where the expected EW \(Wjj\) fraction is \({>}20\%\). The QCD \(Wjj\) background is subtracted using the multiplicative normalization factor of \(\mu _{\text {QCD}} = 1.09 \pm 0.02\) (stat) determined from the fits in Sect. 5. This substantially reduces the normalization uncertainty, confining theoretical uncertainties to the shapes of the background distributions.

Performing a complete unfolding of the EW \(Wjj\) signal process leads to better precision on the unfolded data, particularly in the case of normalized distributions, than could be achieved by subtracting the particle-level QCD \(Wjj\) production background from unfolded QCD+EW \(Wjj\) production data. All EW \(Wjj\) differential measurements are nonetheless also performed as combined QCD+EW \(Wjj\) production measurements so that such a subtraction could be performed with other QCD \(Wjj\) predictions.

Integrated cross sections for \(Wjj\) production are determined in each fiducial region. Table 7 and Fig. 13 show the measured integrated production cross sections for a single lepton flavour (\(\sigma ^\mathrm {fid}_{W}\)) for QCD+EW \(Wjj\) production and, in high dijet invariant-mass regions, for EW \(Wjj\) production. Also shown is the value of the EW \(Wjj\) cross section extracted from the constrained fit described in Sect. 5.3. All measurements are broadly compatible with predictions from Powheg + Pythia8. In fiducial regions dominated by QCD \(Wjj\) production the measured cross sections are approximately 15–20% higher than predictions. The integrated EW \(Wjj\) production cross sections have larger relative uncertainties than the precisely constrained fiducial EW \(Wjj\) cross-section measurement.

Table 7 Integrated fiducial cross sections for QCD+EW and EW \(Wjj\) production and the equivalent predictions from Powheg + Pythia8. The uncertainties displayed are the values of the statistical and systematic uncertainties added in quadrature
Fig. 13
figure 13

Integrated production cross sections for QCD+EW \(Wjj\) (solid data points) and EW \(Wjj\) (open data points) production in each measured particle-level fiducial region in a single lepton channel; EW \(Wjj\) production is only measured in fiducial regions where there is sufficient purity. For each measurement the error bar represents the statistical and systematic uncertainties summed in quadrature. Comparisons are made to predictions from Powheg + Pythia8 and the bottom pane shows the ratio of data to these predictions

The measurements of electroweak \(Wjj \) fiducial cross sections are compared to measurements of electroweak Zjj production and VBF Higgs boson production in Fig. 14. These other measurements are extrapolated to lower dijet mass (for Zjj production) or to inclusive production (for Higgs boson production) so their apparent cross sections are generally increased relative to the \(Wjj\) fiducial cross sections.

Fig. 14
figure 14

Measurements of the cross sections times branching fractions of electroweak production of a single W, Z, or Higgs boson with two jets at high dijet invariant mass and in fiducial measurement regions. For each measurement the error bar represents the statistical and systematic uncertainties summed in quadrature. Shaded bands represent the theory predictions. The \(M_{jj}\) threshold defining the fiducial Zjj region differs between ATLAS and CMS, leading to different inclusive cross sections

12.3 Observables distinguishing QCD \(Wjj\) and EW \(Wjj\)

Differential measurements are performed in the following distributions that provide discrimination between strong and electroweak \(Wjj\) production:

  • \(M_{jj} \), the invariant mass of the two highest-\(p_{\text {T}} \) jets;

  • \(\Delta y(j_1,j_2) \), the absolute rapidity separation between the two highest-\(p_{\text {T}} \) jets;

  • \(C_\ell \), lepton centrality, the location in rapidity of the lepton relative to the average rapidity of the two highest-\(p_{\text {T}}\) jets, defined in Eq. (1);

  • \(C_j\), jet centrality, the location in rapidity of any additional jet relative to the average rapidity of the two highest-\(p_{\text {T}}\) jets, defined in Eq. (1); and

  • \(N_{\text {jets}}^{\text {gap}} \), the number of additional jets in the rapidity gap bounded by the two highest-\(p_{\text {T}} \) jets (i.e., jets with \(C_j < 0.5\)).

The first two observables use the dijet system to distinguish the t-channel VBF topology from the background. The remaining observables use the rapidity of other objects relative to the dijet rapidity gap, exploiting the colourless gauge boson exchange to distinguish the EW \(Wjj\) signal from the QCD \(Wjj\) background. Figure 15 shows the Powheg + Pythia8 and Sherpa predictions of the fraction of \(Wjj\) events produced via electroweak processes, as a function of the dijet invariant mass in the signal fiducial region and the number of jets emitted in the dijet rapidity gap for the inclusive fiducial region with \(M_{jj} >0.5\) \(\text {TeV}\).

Fig. 15
figure 15

Fraction of EW \(Wjj\) signal relative to the combined QCD+EW \(Wjj\) production, predicted by Powheg + Pythia8 and Sherpa simulations for observables in the signal (left) and inclusive (right) fiducial regions

12.3.1 Dijet observables

The best discrimination between QCD and EW \(Wjj\) production is provided by the dijet mass distribution, as demonstrated in the top plots of Fig. 16. The distribution of dijet rapidity separation is correlated with this distribution but is purely topological. The discrimination provided by \(\Delta y(j_1,j_2) \) is shown in the bottom plots of the figure for \(M_{jj} >0.5\) and 1 \(\text {TeV}\).

Fig. 16
figure 16

Top Unfolded absolute (left) and normalized (right) differential \(Wjj\) production cross sections as a function of dijet mass for the signal fiducial region. Bottom Unfolded normalized production cross sections as a function of \(\Delta y(j_1,j_2) \) for the signal regions with \(M_{jj} > 0.5\) \(\text {TeV}\) (left) and \(M_{jj} > 1.0\) \(\text {TeV}\) (right). Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

The QCD \(Wjj\) modelling of the dijet distributions is important for extracting the cross section for EW \(Wjj\) production. The modelling of the \(M_{jj}\) distribution in regions dominated by QCD \(Wjj\) production is shown in Fig. 17. Predictions from hej, which are expected to provide a good description at high dijet invariant mass where large logarithms contribute, are similar to the NLO predictions from Powheg + Pythia8. Sherpa predicts more events at high dijet invariant mass than observed in data in these fiducial regions, whereas Powheg + Pythia8 and hej are in better agreement with data. The dijet rapidity separation (Fig. 18) shows similar behavior, with Sherpa overestimating the rate at large separation. The hej distributions have larger deviations from the data due to the reduced accuracy of resummation at small \(\Delta y(j_1,j_2) \).

Fig. 17
figure 17

Unfolded normalized differential \(Wjj\) production cross sections as a function of dijet invariant mass in the inclusive, forward-lepton/central-jet, forward-lepton, and central-jet fiducial regions. Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

Fig. 18
figure 18

Unfolded normalized differential \(Wjj\) production cross sections as a function of \(\Delta y(j_1,j_2) \) in the inclusive, forward-lepton/central-jet, forward-lepton, and central-jet fiducial regions. Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

The dijet distributions are generally well modelled for the EW \(Wjj\) process, as shown in Fig. 19 for the inclusive and signal regions with \(M_{jj} >1.0\) \(\text {TeV}\). The reduced purity in the inclusive region causes larger measurement uncertainties, and the measurements have larger absolute discrepancies with respect to predictions. The interference uncertainty is largest at low \(\Delta y(j_1,j_2) \), where the topology is less VBF-like.

Fig. 19
figure 19

Unfolded normalized differential EW \(Wjj\) production cross sections as a function of the dijet invariant mass (top) and \(\Delta y(j_1,j_2) \) (bottom) for the inclusive (left) and signal (right) fiducial regions with \(M_{jj} > 1.0\) \(\text {TeV}\). Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

12.3.2 Object topology relative to the rapidity gap

The event topology distinguishes electroweak VBF production from other processes, in particular the lack of hadronic activity in the rapidity gap between the leading two jets and the tendency for the boson to be emitted within this gap. These topological features are studied using the distributions of the jet multiplicity in the gap, the fraction of events with no jets with the gap, and the rapidity of the lepton and jets relative to the gap.

Figure 20 shows the normalized differential cross section as a function of the number of \(p_{\text {T}} >30\) \(\text {GeV}\) jets emitted into the rapidity gap for progressively increasing \(M_{jj}\) thresholds. In the lowest invariant-mass fiducial region, strong \(Wjj\) production dominates and predictions from Powheg + Pythia8, Sherpa, and hej all describe the data well. As the dijet invariant mass threshold is increased, the differences in shape between predictions with and without the EW \(Wjj\) contribution become apparent. The corresponding differential measurements for EW \(Wjj\) production are shown in Fig. 21 for the inclusive regions with \(M_{jj} > 1.0\) and 2.0 \(\text {TeV}\). The measured fraction of EW \(Wjj\) events with no additional central jets is higher than that of QCD+EW \(Wjj\) events, as also demonstrated in Table 8. The table shows that the measured zero-jet fraction, frequently referred to as the jet-veto efficiency, is consistent with the Powheg + Pythia8 QCD+EW \(Wjj\) prediction for progressively increasing \(M_{jj} \). As \(M_{jj} \) increases the relative contribution of the EW \(Wjj\) process increases substantially.

Fig. 20
figure 20

Unfolded normalized distribution of the number of jets with \(p_{\text {T}} >30\) \(\text {GeV}\) in the rapidity interval bounded by the two highest-\(p_{\text {T}} \) jets in the inclusive fiducial region with \(M_{jj} \) thresholds of 0.5 \(\text {TeV}\) (top left), 1.0 \(\text {TeV}\) (top right), 1.5 \(\text {TeV}\) (bottom left), and 2.0 \(\text {TeV}\) (bottom right). Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

Fig. 21
figure 21

Unfolded normalized differential EW \(Wjj\) production cross sections as a function of the number of jets with \(p_{\text {T}} >30\) \(\text {GeV}\) in the rapidity interval bounded by the two highest-\(p_{\text {T}} \) jets in the inclusive fiducial region, with \(M_{jj} > 1.0\) \(\text {TeV}\) (left) and \(M_{jj} > 2.0\) \(\text {TeV}\) (right). Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

Table 8 Jet-veto efficiency for each \(M_{jj} \) threshold compared to Powheg + Pythia8 QCD+EW and QCD \(Wjj\) simulations. The uncertainties comprise statistical and systematic components added in quadrature

Jet centrality is related to the number of jets in the rapidity gap, as events with \(C_j < 0.5\) have a jet within the gap. Figure 22 shows good agreement between the predictions and data in the QCD+EW \(Wjj\) differential cross section weighted by the mean number of gap jets. Since the rate for additional jet production is low in EW \(Wjj\) production, there are too few events to perform a measurement of the jet centrality distribution for this process.

Fig. 22
figure 22

Unfolded normalized differential QCD+EW \(Wjj\) production cross sections as a function of jet centrality for the inclusive fiducial region with \(M_{jj} > 0.5\) \(\text {TeV}\) (left) and 1.0 \(\text {TeV}\) (right). Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

The lepton centrality distribution indirectly probes the rapidity of the W boson relative to the dijet rapidity interval. The differential cross section in the inclusive region as a function of lepton centrality is shown in Fig. 23 for three \(M_{jj} \) thresholds. All QCD+EW \(Wjj\) predictions adequately describe the lepton centrality in the region with the lowest dijet mass threshold, which is dominated by QCD \(Wjj\) production. As the \(M_{jj} \) threshold is increased the differences between QCD and QCD+EW \(Wjj\) production become more apparent, particularly at low lepton centrality where EW \(Wjj\) production is enhanced. The measurement of this distribution for EW \(Wjj\) production shows good agreement with the predictions.

Fig. 23
figure 23

Unfolded normalized differential QCD+EW \(Wjj\) (top) and EW (bottom) production cross sections as a function of lepton centrality for the inclusive fiducial region with \(M_{jj} > 0.5\) \(\text {TeV}\) (top left), 1.0 \(\text {TeV}\) (top right and bottom left), and 1.5 \(\text {TeV}\) (bottom right). Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

12.4 Observables sensitive to anomalous gauge couplings

Differential measurements are performed in distributions that provide enhanced sensitivity to anomalous gauge couplings:

  • \(p_{\text {T}} ^{j_1} \), the \(p_{\text {T}} \) of the highest-\(p_{\text {T}} \) jet;

  • \(p_{\text {T}} ^{jj}\), the \(p_{\text {T}} \) of the dijet system (vector sum of the \(p_{\text {T}} \) of the two highest-\(p_{\text {T}}\) jets); and

  • \(\Delta \phi (j_1,j_2)\), the magnitude of the azimuthal angle between the two highest-\(p_{\text {T}} \) jets,

where the last observable is sensitive to anomalous CP-violating couplings [82].

The transverse momentum distribution of the leading jet, shown in Fig. 24, has a substantial correlation with the momentum transfer in t-channel events. The QCD+EW \(Wjj\) measurements are globally well described by Powheg + Pythia8, while predictions from Sherpa and hej both show a harder spectrum than observed in data. For EW \(Wjj\) production the Powheg + Pythia8 and Sherpa predictions give a harder spectrum than observed in the data, particularly in the higher purity regions (Fig. 25). The overestimation of rates at high jet \(p_{\text {T}}\) may be reduced by the inclusion of NLO electroweak corrections [66].

Fig. 24
figure 24

Unfolded normalized differential \(Wjj\) production cross sections as a function of the leading-jet \(p_{\text {T}}\) in the signal, high-mass signal, forward-lepton/central-jet, forward-lepton, and central-jet fiducial regions. Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

Fig. 25
figure 25

Unfolded normalized differential EW \(Wjj\) production cross sections as a function of the leading-jet \(p_{\text {T}} \) for the inclusive fiducial region with three thresholds on the dijet invariant mass (1.0, 1.5, and 2.0 \(\text {TeV}\)), and for the signal-enriched fiducial region with a minimum dijet invariant mass of 1.0 \(\text {TeV}\). Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

The transverse momentum of the dijet system is also correlated with the momentum transfer in t-channel events. Figure 26 shows the measured normalized \(p_{\text {T}}\) distribution of the dijet system compared to the various predictions. There is a trend for all predictions to overestimate the relative rate at high dijet \(p_{\text {T}} \) in the inclusive and signal-enhanced regions, both for QCD+EW \(Wjj\) and EW \(Wjj\) production. As in the case of the jet \(p_{\text {T}}\) distribution, the discrepancy could be due to missing NLO electroweak corrections, which reduce the predictions at high W-boson \(p_{\text {T}}\)  [66].

Fig. 26
figure 26

Unfolded normalized differential \(Wjj\) production cross sections as a function of dijet \(p_{\text {T}} \) for the inclusive (top) and signal (bottom) regions with \(M_{jj} >0.5\) \(\text {TeV}\) (left) and \(M_{jj} > 1.0\) \(\text {TeV}\) (right). The bottom right distribution shows EW \(Wjj\) production and the other distributions show QCD+EW \(Wjj\) production. Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

The azimuthal angle between the two leading jets can be used to probe for new CP-odd operators in VBF production. The normalized differential cross sections for QCD+EW \(Wjj\) production as a function of this angle are shown in the inclusive, forward-lepton control, central-jet validation, and signal fiducial regions in Fig. 27. Good agreement between the data and all predictions is seen, with a slight tendency for predictions to overestimate the relative rate at small angles in all fiducial regions. Figure 28 shows the normalized EW \(Wjj\) cross section as a function of the azimuthal angle between the two leading jets for the inclusive and signal fiducial regions with \(M_{jj} >1.0\) \(\text {TeV}\).

Fig. 27
figure 27

Unfolded normalized differential \(Wjj\) production cross sections as a function of \(\Delta \phi (j_1,j_2)\) for the inclusive, forward-lepton control, central-jet validation, and signal fiducial regions. Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

Fig. 28
figure 28

Unfolded normalized differential EW \(Wjj\) production cross sections as a function of the azimuthal angle between the two leading jets, for the inclusive and signal fiducial regions with \(M_{jj} > 1.0\) \(\text {TeV}\). Both statistical (inner bar) and total (outer bar) measurement uncertainties are shown, as well as ratios of the theoretical predictions to the data (the bottom panel in each distribution)

13 Anomalous triple-gauge-boson couplings

The triple-gauge-boson vertex is directly probed by the vector-boson-fusion process. Non-SM couplings at this vertex would affect the production rates and distributions. The couplings are constrained in the context of an aTGC or EFT framework, using the yield in the anomalous coupling signal region (Table 1) to constrain the parameters. The results are complementary [83] to those obtained in diboson production [84], which corresponds to the exchange of one off-shell boson in the s-channel rather than two in the t-channel.

13.1 Theoretical overview

The signal-region measurements are sensitive to the WWV (V = Z or \(\gamma \)) couplings present in the t-channel production mode shown in Fig. 1a. These couplings can be characterized by an effective Lagrangian \(\mathcal {L}^{WWV}_\mathrm {eff}\) including operators up to mass-dimension six [34]:

$$\begin{aligned} i\mathcal {L}^{WWV}_\mathrm {eff}&=g_{WWV}\, \biggl \{ \Bigl [g_1^V V^{\mu }(W^{-}_{\mu \nu }W^{+\nu } -W^{+}_{\mu \nu }W^{-\nu }) + \kappa _{V} W^{+}_{\mu }W^{-}_{\nu }V^{\mu \nu } + \frac{\lambda _{V}}{m_{W}^{2}}V^{\mu \nu }W^{+ \rho }_{\nu }W^{-}_{\rho \mu } \Bigr ]\\&\qquad - \Bigl [\frac{\tilde{\kappa }_{V}}{2} W^{-}_{\mu }W^{+}_{\nu } \epsilon ^{\mu \nu \rho \sigma } V_{\rho \sigma } + \frac{\tilde{\lambda }_{V}}{2 m_{W}^{2}} W^{-}_{\rho \mu }W^{+ \mu }_{\nu } \epsilon ^{ \nu \rho \alpha \beta } V_{\alpha \beta } \Bigr ]\biggr \}, \end{aligned}$$

where \(W^{\pm }_{\mu \nu } = \partial _\mu W^{\pm }_\nu - \partial _\nu W^{\pm }_{\mu }\), with \(W^{\pm }_\mu \) the \(W^{\pm }\) field; \(V_{\mu \nu } = \partial _\mu V_\nu - \partial _\nu V_\mu \), with \(V_\mu \) the Z or \(\gamma \) field; \(m_W\) is the W-boson mass; and the individual couplings have SM values \(g_1^V = 1\), \(\kappa _{V} = 1\), \(\lambda _{V} = 0\), \(\tilde{\kappa }_{V} = 0\), and \(\tilde{\lambda }_{V} = 0\). The overall coupling constants \(g_{WWV}\) are given by \(g_{WW\gamma } = -e\) and \(g_{WWZ} = -e\cdot \cot (\theta _W)\), where e is the electromagnetic coupling and \(\theta _W\) is the weak mixing angle. The terms in the first row of the Lagrangian conserve C, P, and CP, while those in the second violate CP. Deviations of the \(g_1^V\) and \(\kappa _{V}\) parameters from the SM are denoted by \(\Delta g_1^Z= g_1^Z-1\) and \(\Delta \kappa _{V}=\kappa _{V}-1\), respectively. The requirement of gauge invariance at the level of dimension-six operators leads to the following relations [85]:

$$\begin{aligned} \Delta g_1^Z = \Delta \kappa _Z + \Delta \kappa _{\gamma } \tan ^2\theta _W, \quad \lambda _{\gamma } = \lambda _Z \equiv \lambda _V, \quad g_1^{\gamma }=1, \quad \tilde{\kappa }_{\gamma } = - \tilde{\kappa }_Z \cot ^2\theta _W, \quad \mathrm { and } \quad \tilde{\lambda }_{\gamma } = \tilde{\lambda }_Z \equiv \tilde{\lambda }_V. \end{aligned}$$

The presence of anomalous couplings leads to unphysically large cross sections when the square of the momentum transfer \((q^2)\) between the incoming partons is large. To preserve unitarity, a form factor is introduced with a new-physics scale \(\Lambda \) that suppresses the anomalous coupling at high energies:

$$\begin{aligned} \alpha (q^2) = \frac{\alpha }{(1+ q^2 / \Lambda ^2)^2}, \end{aligned}$$

where \(\alpha \) is the anomalous coupling of interest. In the following, 95% confidence-level intervals are set for a unitarization scale of \(\Lambda = 4 \, \text {TeV}\) and for a scale that effectively removes the form factor (shown as \(\Lambda = \infty \)). The scale \(\Lambda = 4 \, \text {TeV}\) is chosen because it does not violate unitarity for any parameter in the expected range of sensitivity.

An alternative to the use of a form factor is to employ an effective field theory, which is an expansion in inverse powers of the energy scale of new interactions assuming perturbative coupling coefficients. An EFT allows the comprehensive investigation of a complete set of dimension-six operators in a Lagrangian with SM fields. The dimension-six terms introduced in the EFT can be expressed as

$$\begin{aligned} \mathcal {L}_{\text {EFT}} = \sum _i \frac{c_i}{\Lambda ^2}O_i, \end{aligned}$$

where \(O_i\) are field operators with dimension 6, the scale of new physics is \(\Lambda \), and \(c_i\) are dimensionless coefficients. The operators relevant to triple-gauge-boson couplings in the HISZ basis [85] are

$$\begin{aligned} O_B= & {} (D_\mu H)^{\dagger } B^{\mu \nu }D_\nu H, \\ O_W= & {} (D_\mu H)^{\dagger } W^{\mu \nu }D_\nu H, \\ O_{WWW}= & {} \mathrm {Tr}[W_{\mu \nu } W^{\nu }_{\rho } W^{\rho \mu }], \\ O_{\tilde{W}}= & {} (D_\mu H)^{\dagger } \tilde{W}^{\mu \nu } D_\nu H, \\ O_{\tilde{W}WW}= & {} \mathrm {Tr}[W_{\mu \nu }W^{\nu }_{\rho } \tilde{W}^{\rho \mu }], \end{aligned}$$

where H is the Higgs-boson field, \(B_{\mu \nu } = \partial _\mu B_\nu - \partial _\nu B_{\mu }\), \(B^{\mu }\) is the U(1)\(_\mathrm {Y}\) gauge field, and \(\tilde{W}^{\mu \nu } = \frac{1}{2}\epsilon _{\mu \nu \rho \sigma }W^{\rho \sigma }\). The coefficients of these operators are related to the aTGC parameters via the following equations:

$$\begin{aligned} \frac{c_W}{\Lambda ^2}&= \frac{2}{m^2_Z} (g_1^Z -1), \\ \frac{c_B}{\Lambda ^2}&= \frac{2}{\tan ^2\theta _W m^2_Z}(g_1^Z -1)-\frac{2}{\sin ^2\theta _W m^2_Z} (\kappa _{Z} -1), \\ \frac{c_{WWW}}{\Lambda ^2}&= \frac{2}{3 g^2 m^2_W} \lambda _V, \\ \frac{c_{\tilde{W}}}{\Lambda ^2}&= -\frac{2}{\tan ^2\theta _W m^2_W} \tilde{\kappa }_Z, \\ \frac{c_{\tilde{W}WW}}{\Lambda ^2}&= \frac{2}{3 g^2 m^2_W} \tilde{\lambda }_V, \end{aligned}$$

where g is the weak coupling, \(m_Z\) is the Z-boson mass, and the aTGC parameters do not have any form-factor suppression.

13.2 Experimental method

The signal region defined to increase the sensitivity to anomalous triple-gauge-boson couplings requires \(M_{jj} > 1\) \(\text {TeV}\) and leading-jet \(p_{\text {T}} > 600\) \(\text {GeV}\) (Table 1). The leading-jet \(p_{\text {T}} \) is chosen because it is highly correlated with the \(q^2\) of the signal t-channel process. The \(p_{\text {T}} \) threshold is optimized to maximize sensitivity to anomalous couplings, considering both the statistical and systematic uncertainties. The event yields in the reconstructed signal region used for setting the constraints are given in Table 4. The SM prediction is negligible for \(p_{\text {T}} > 1\) \(\text {TeV}\), yielding an approximate lower bound for the validity of the EFT constraints.

The effects of anomalous couplings are modelled with Sherpa. Each sample is normalized by a factor \(k = \text {NLO}/\text {LO}\) given by the ratio of Powheg + Pythia8 to Sherpa SM predictions of electroweak \(Wjj\) production. The number of events expected for a given parameter value is calculated as:

$$\begin{aligned} N_{\text {reco}}=\mathcal {L}\times \sigma ^{\ell \nu jj} \times \mathcal{{A}} \times \mathcal{{C}} \times k, \end{aligned}$$

where \(\mathcal {L}\) is the integrated luminosity of the 8 \(\text {TeV}\) data, \(\sigma ^{\ell \nu jj}\) is the cross section for the corresponding anomalous-coupling variation, \(\mathcal{{A}}\) is the selection acceptance at particle level, and \(\mathcal{C}\) is the ratio of selected reconstruction-level events to the particle-level events in the fiducial phase-space region. The factor containing the cross section and acceptance (\(\sigma ^{\ell \nu jj} \times \mathcal{{A}}\)) is parameterized as a quadratic function of each aTGC parameter, with a 10% statistical uncertainty in the parameterization.

Theoretical uncertainties due to missing higher orders, estimated with factors of 2 and 1/2 variations of the renormalization and factorization scales, are estimated to be 8% of the strong \(Wjj\) yield and 14% of the electroweak \(Wjj\) yield in the region with leading-jet \(p_{\text {T}} >600\) \(\text {GeV}\). Detector uncertainties are correlated between strong and electroweak production and are estimated to be 11% of the combined yield.

13.3 Confidence-level intervals for aTGC parameters

Confidence-level (C.L.) intervals are calculated using a frequentist approach [86]. A negative log-likelihood function is constructed based on the expected numbers of background and signal events, and the number of observed data events. The likelihood is calculated as a function of individual aTGC parameter variations, with the other parameters set to their SM values. To obtain 95% confidence-level intervals, pseudoexperiments are produced with the number of pseudodata events drawn from a Poisson distribution, where the mean is given by the total SM prediction Gaussian-fluctuated according to theoretical and experimental uncertainties.

Table 9 Expected and observed 95% C.L. allowed ranges for all aTGC parameters considered with the other parameters set to their SM values. A form factor with unitarization scale equal to 4 TeV enforces unitarity for all aTGC parameters. The results are derived from the high-\(q^2\) region yields given in Table 4
Table 10 Expected and observed 95% C.L. intervals for individual EFT coefficients divided by the square of the new physics scale \(\Lambda \), with other coefficients set to zero. Intervals are calculated using the high-\(q^2\) region yields (Table 4)

Tables 9 and 10 give the expected and observed 95% C.L. interval for each parameter probed, with the other parameters set to their SM values. All observed intervals are narrower than the expected intervals due to a slight deficit of data events compared with the SM prediction (Table 4). The \(\lambda _V\) intervals are competitive with those derived from WW production [84]. The 95% C.L. regions in planes with two parameters deviating from their SM values are shown in Fig. 29. Since the regions are determined using a single measured yield, only the size of the region is constrained and not its shape. Thus, along an axis where one parameter is equal to zero, the corresponding one-parameter C.L. interval is recovered. The constraints on \(\tilde{\lambda }_V\) are similar to \(\lambda _V\) since the sensitivity is dominated by the square of the anomalous-coupling amplitude rather than its interference with the SM amplitude.

Fig. 29
figure 29

The observed (solid blue) and expected (open dashed) 95% C.L. allowed regions in two-parameter planes for \(\Lambda = 4\) \(\text {TeV}\). The regions are derived using a single measured yield and therefore reduce to the corresponding one-parameter interval when the other parameter is set to zero. Constraints on \(\tilde{\lambda }_V\) are similar to those on \(\lambda _V\)

14 Summary

Measurements of the fiducial and differential cross sections of electroweak production of W bosons in association with two jets have been performed using the lepton decay channel and events with high dijet invariant mass. The measurements use data collected by the ATLAS detector from proton–proton collisions at the LHC at centre-of-mass energies of \(\sqrt{s}=7\) and 8 \(\text {TeV}\), corresponding to 4.7 and 20.2 fb\(^{-1}\) of integrated luminosity, respectively. The cross sections in a fiducial region with a signal purity of \(\mathcal{{O}}\)(15%) are

$$\begin{aligned} \sigma ^\mathrm {fid}_{\mathrm {EW}~\ell \nu jj}~(7~\mathrm {TeV})= & {} 144 \pm 23~\mathrm {(stat)}~\pm 23~\mathrm {(exp)}~\pm 13~\mathrm {(th)~fb}, \\ \sigma ^\mathrm {fid}_{\mathrm {EW}~\ell \nu jj}~(8~\mathrm {TeV})= & {} 159 \pm 10~\mathrm {(stat)}~\pm 17~\mathrm {(exp)}~\pm 15~\mathrm {(th)~fb}, \end{aligned}$$

corresponding to a deviation of \(<0.1\sigma ~(1.4\sigma )\) from the SM prediction of \(144 \pm 11\) (\(198 \pm 12\)) fb at \(\sqrt{s}=7~(8)\) \(\text {TeV}\). The large sample size of the 8 \(\text {TeV}\) measurement yields the smallest relative uncertainty of existing fiducial cross-section measurements of electroweak boson production in a VBF topology.

Differential cross sections of the \(\sqrt{s}=8\) \(\text {TeV}\) electroweak \(Wjj\) production process are measured in a high-purity region with \(M_{jj} >1\) \(\text {TeV}\). The cross sections are measured as a function of dijet mass, dijet rapidity separation, dijet azimuthal angular separation, dijet \(p_{\text {T}} \), leading-jet \(p_{\text {T}} \), the number of jets within the dijet rapidity gap, and lepton and jet centralities. Additionally, differential cross sections are measured in various fiducial regions for the combined electroweak and strong \(Wjj\) production with high dijet invariant mass. The differential measurements are integrated in each fiducial region to obtain additional fiducial cross-section measurements. The most inclusive region, where \(M_{jj} >0.5\) \(\text {TeV}\), \(\Delta y(j_1,j_2) > 2\), \(p_{\text {T}} ^{j_1} > 80\) \(\text {GeV}\), and \(p_{\text {T}} ^{j_2} > 60\) \(\text {GeV}\), has a measured QCD+EW fiducial cross section at \(\sqrt{s}=8\) \(\text {TeV}\) of \(\sigma ^\mathrm {fid}_{\mathrm {QCD+EW}~\ell \nu jj} =1700 \pm 110\) fb.

The region of increased purity for electroweak production of \(Wjj\) (\(M_{jj} >1\) \(\text {TeV}\)) is used to constrain dimension-six triple-gauge-boson operators motivated by an effective field theory. To improve the sensitivity to high-scale physics affecting the triple-gauge-boson vertex, events with leading-jet \(p_{\text {T}} >600\) \(\text {GeV}\) are also used to constrain CP-conserving and CP-violating operators in the HISZ scenario, both with and without a form-factor suppression. A 95% C.L. range of \([-0.13,0.09]\) is determined for \(\lambda _V\) with a suppression scale of 4 \(\text {TeV}\) and the other parameters set to their SM values. Limits are also set on the parameters of an effective field theory. The operator coefficient \(c_{WWW}/\Lambda ^2\) is proportional to \(\lambda _V\) and is constrained to \([-13, 9]/{\mathrm {\text {TeV}}}^2\) at 95% C.L. Constraints on CP-violating operators are similar to those on the CP-conserving operators.