Skip to main content
Version: 1.2.0

Normalization

Normalization is a critical stage connecting ingestion from sources and forwarding to targets used to coalesce log data from diverse sources into consistent formats, enabling unified handling across different logging systems.

Log Formats

The processor supports several widely-used log formats:

Generic

FormatNotationKey IdentifierLayout CharacteristicsExample Fields
Elastic Common Schema (ECS)Dot notation with lowercase@timestampHierarchical structuresource.ip, network.direction
Splunk Common Information Model (CIM)Underscore with lowercase_timeFlat structuresrc_ip, network_direction
Advanced Security Information Model (ASIM)PascalCaseTimeGeneratedExplicit namesSourceIp, NetworkDirection

Security-specific

FormatDescriptionKey IdentifierExample Fields
Common Event Format (CEF)ArcSight's standard formatrt (receiptTime)networkUser, sourceAddress
Log Event Extended Format (LEEF)IBM QRadar's formatdevTimenetworkUser, srcAddr
Common Security Log (CSL)Microsoft Sentinel's formatTimeGeneratedNetworkUser, SourceAddress

Format Detection

Source formats can be automatically detected using certain characteristic fields, e.g.

ContextFieldFormat
Timestamp@timestampECS
_timeCIM
TimeGeneratedASIM/CSL
SecurityrtCEF
devTimeLEEF
CSL detectionTimeGenerated + LogSeverityCSL
TimeGenerated onlyASIM

Conversion

Casing and Delimiters

Each format follows specific naming conventions:

ECS
source.ip, event.severity
CIM
src_ip, event_severity
ASIM
SourceIp, EventSeverity
CEF
sourceAddress, eventSeverity
LEEF
srcAddr, evtSev
CSL
SourceIP, EventSeverity
caution

Complex format conversions may impact performance.

Field Mapping

There are identifiable common network fields based on context across various formats:

Context
FormatSource IPDestination IPDirection
ecssource.ipdestination.ipnetwork.direction
cimsrc_ipdest_ipnetwork_direction
asimSourceIpDstIpNetworkDirection
cefsrcdstnetworkDirection
leefsrcAddrdstAddrnetDir
cslSourceIpDestinationIpNetworkDirection

Configuration

Basic

Convert from ECS to ASIM format:

normalize:
source_format: ecs
target_format: asim

Field-specific

Convert a specific network field:

normalize:
field: network_data
source_format: cef
target_format: ecs

Auto-detection

Let the processor detect the source format:

normalize:
target_format: cim

Preprocessing

Fields are standardized with normalize for conversion between the ECS, CIM, ASIM, CEF, LEEF and CSL formats (see the Log Formats and Conversion sections above). Values are formatted for uniform casing with uppercase and lowercase processors when required by the target format's naming conventions.

Postprocessing

Fields are optimized for storage and queries using format conversion with the normalize processor (see the Conversion and Field Mapping sections above). For Microsoft Sentinel integration, data is prepared by converting to the ASIM format with normalize (see Log Formats table).

warning

Complex format conversions may impact processing performance and delivery latency.