Encrypted Web Traffic Dataset: Event Logs and Packet Traces
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14610%2F22%3A00125749" target="_blank" >RIV/00216224:14610/22:00125749 - isvavai.cz</a>
Result on the web
<a href="https://doi.org/10.1016/j.dib.2022.108188" target="_blank" >https://doi.org/10.1016/j.dib.2022.108188</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1016/j.dib.2022.108188" target="_blank" >10.1016/j.dib.2022.108188</a>
Alternative languages
Result language
angličtina
Original language name
Encrypted Web Traffic Dataset: Event Logs and Packet Traces
Original language description
We present a dataset that captures seven days of monitoring data from eight servers hosting more than 800 sites across a large campus network. The dataset contains data from network monitoring and host-based monitoring. The first set of data are packet traces collected by a probe situated on the network link in front of the web servers. The traces contain encrypted HTTP over TLS 1.2 communication between clients and web servers. The second set of data is an event log captured directly on the web servers. The events are generated by the Internet Information Services (IIS) logging and include both the IIS default features and custom features, such as client port and transferred data volume. Anonymization of all features in the dataset has been carefully carried out to prevent private information leakage while preserving the information value of the dataset. The dataset is suitable mainly for training machine learning techniques for anomaly detection and the identification of relationships between network traffic and events on web servers. We also add tools, settings, and a guide to convert the packet traces to IP flows that are often preferred for network traffic analysis.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10200 - Computer and information sciences
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2022
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Data in Brief
ISSN
2352-3409
e-ISSN
—
Volume of the periodical
42
Issue of the periodical within the volume
June
Country of publishing house
NL - THE KINGDOM OF THE NETHERLANDS
Number of pages
10
Pages from-to
1-10
UT code for WoS article
000795935500014
EID of the result in the Scopus database
2-s2.0-85129507189