<?xml version="1.0" encoding="UTF-8"?>
<resource xmlns="http://datacite.org/schema/kernel-4" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.5/metadata.xsd">
  <identifier identifierType="DOI">10.5072/FK2/62G4ZZ</identifier>
  <creators>
    <creator>
      <creatorName nameType="Personal">Fernando Abreu de Souza</creatorName>
      <givenName>Fernando de</givenName>
      <familyName>eu de Souza</familyName>
      <affiliation>LIP</affiliation>
    </creator>
    <creator>
      <creatorName nameType="Personal">Maura Barros</creatorName>
      <givenName>Maura</givenName>
      <familyName>Barros</familyName>
      <affiliation>LIP</affiliation>
    </creator>
    <creator>
      <creatorName nameType="Personal">Nuno Castro</creatorName>
      <givenName>Nuno</givenName>
      <familyName>Castro</familyName>
      <affiliation>LIP e Universidade do Minho</affiliation>
    </creator>
    <creator>
      <creatorName nameType="Personal">Miguel Crispim Romão</creatorName>
      <givenName>Miguel</givenName>
      <familyName>Crispim Romão</familyName>
      <affiliation>LIP e Durham University</affiliation>
    </creator>
    <creator>
      <creatorName nameType="Personal">Rute Pedro</creatorName>
      <givenName>Rute</givenName>
      <familyName>Pedro</familyName>
      <affiliation>LIP</affiliation>
    </creator>
  </creators>
  <titles>
    <title>Simulated pp collisions at 13 TeV for Standard Model background and beyond Standard Model signals with 2 leptons, 1-bjet and high HT</title>
  </titles>
  <publisher>Repositório ACNCA</publisher>
  <publicationYear>2025</publicationYear>
  <subjects>
    <subject>Physics</subject>
    <subject>LIP-Machine Learning</subject>
  </subjects>
  <contributors>
    <contributor contributorType="Producer">
      <contributorName nameType="Personal">Laboratório de Instrumentação e Física Experimental de Partículas</contributorName>
      <givenName>de de</givenName>
      <familyName>tório de Instrumentação e Física Experimental de Partículas</familyName>
    </contributor>
    <contributor contributorType="ContactPerson">
      <contributorName nameType="Personal">LIP</contributorName>
    </contributor>
  </contributors>
  <dates>
    <date dateType="Submitted">2025-12-29</date>
    <date dateType="Available">2025-12-30</date>
  </dates>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType=":unav">10.5281/zenodo.15423466</alternateIdentifier>
  </alternateIdentifiers>
  <sizes>
    <size>17373144</size>
    <size>17426104</size>
    <size>16031448</size>
    <size>42886136</size>
    <size>17039840</size>
    <size>19464800</size>
    <size>459996792</size>
  </sizes>
  <formats>
    <format>application/x-hdf</format>
    <format>application/x-hdf</format>
    <format>application/x-hdf</format>
    <format>application/x-hdf</format>
    <format>application/x-hdf</format>
    <format>application/x-hdf</format>
    <format>application/x-hdf</format>
  </formats>
  <version>1.0</version>
  <rightsList>
    <rights rightsURI="info:eu-repo/semantics/openAccess"/>
    <rights rightsURI="http://creativecommons.org/licenses/by/4.0" rightsIdentifier="CC-BY-4.0" rightsIdentifierScheme="SPDX" schemeURI="https://spdx.org/licenses/" xml:lang="en">Creative Commons Attribution 4.0 International License.</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">The dataset comprises simulated events from pp collisions at 13 TeV. Standard Model (SM) background events and a diverse set of beyond Standard Model (BSM) signals with 2 leptons, 1 b-jet and  H_T &amp;gt; 500 GeV in the final state were simulated.

The SM background was generated at leading order, and it includes Z+jets, ttbar, WW, WZ and ZZ subsamples. The processes were generated in kinematic regions to ensure good statistics across the whole phase space. The sampling was carried out using event generation filters at parton level as follows:

ttbar: pT &amp;lt;100 GeV; pT in [100, 250] GeV; pT &amp;gt; 250 GeV
The scalar sum of the pT of outgoing particles (ST) for Z+Jet: ST &amp;lt; 250 Gev; ST in [250, 500] GeV; ST &amp;gt; 500 GeV
W/Z pT for dibosons: pT &amp;lt; 250 GeV; pT in [250, 500] GeV; pT &amp;gt; 500 GeV
The BSM signals include:

Pair production of heavy vector-like T quarks, with T masses mT = {1.0, 1.4} TeV;
tZ production through a flavour-changing neutral current vertex;
Production of a Randall-Sundrum  radion R  with mR = 4 TeV;
Top quark pair production in association with a heavy Higgs boson H&amp;apos; (2HDM), with mH&amp;apos; = 400 GeV;
Production of  W_R, decaying N_R, and a charged lepton, with masses mW_R = 6.5 TeV and mN_R = 1.5 TeV (Left-Right Symmetric Model).
All samples were generated using MadGraph5 2.6.5, and the detector was simulated using Delphes 3 with the default CMS card. Pythia 8.2 was used for the hadronisation of the events. The features are in Cartesian coordinates, and the accumulation at zeros from non-reconstructed objects (i.e. missing values) is removed. Each file provides a train:validation:test split with the ratio 1:1:1 to ensure equal statistical description of the events. 

The events were generated with the support of the Nacional de Computação Avançada (CNCA) under the 2023.10635.CPCA.A1 project.</description>
    <description descriptionType="Other">Abreu de Souza, F., Barros, M., Castro, N., Crispim Romão, M., &amp; Pedro, R. (2025). Simulated pp collisions at 13 TeV for Standard Model background and beyond Standard Model signals with 2 leptons, 1-bjet and high HT [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15423467</description>
  </descriptions>
</resource>
