Copyright © 2023 World Wide Web Consortium. W3C® liability, trademark and permissive document license rules apply.
This document defines a mechanism that enables developers to declare a network error reporting policy for a web application. A user agent can use this policy to report encountered network errors that prevented it from successfully fetching requested resources.
This section describes the status of this document at the time of its publication. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at https://www.w3.org/TR/.
This document was published by the Web Performance Working Group as a Working Draft using the Recommendation track.
Publication as a Working Draft does not imply endorsement by W3C and its Members.
This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
This document was produced by a group operating under the W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
This document is governed by the 12 June 2023 W3C Process Document.
Accurately measuring performance characteristics of web applications is an important aspect in helping site developers understand how to improve their web applications. The worst case scenario is the failure to load the application, or a particular resource, due to a network error, and to address such failures the developer requires assistance from the user agent to identify when, where, and why such failures are occurring.
Today, application developers do not have real-time web application availability data from their end users. For example, if the user fails to load the page due to a network error, such as a failed DNS lookup, a connection timeout, a reset connection, or other reasons, the site developer is unable to detect and address this issue. Note that these kinds of network errors cannot be detected purely server-side, since by definition the client might not have been able to successfully establish a connection with the server.
Existing methods (such as synthetic monitoring) provide a partial solution by placing monitoring nodes in predetermined geographic locations, but require additional infrastructure investments, and cannot provide truly global and near real-time availability data for real end users.
Network Error Logging (NEL) addresses this need by defining a mechanism enabling web applications to declare a reporting policy that can be used by the user agent to report network errors for a given origin. A web application opts into using NEL by supplying a NEL
HTTP response header field that describes the desired NEL policy. This policy instructs the user agent to log information about requests to that origin, and to attempt to deliver that information to a group of endpoints previously configured using the [[REPORTING]. As the name implies, NEL reports are primarily used to describe errors. However, in order to determine rates of errors across different client populations, we must also know how many successful requests are occurring; these successful requests can also be reported via the NEL mechanism.
For example, if the user agent fails to fetch a resource from https://www.example.com
due to an aborted TCP connection, the user agent would queue the following report via the Reporting API:
"network-error"
report_to
field{
"referrer": "https://referrer.com/",
"sampling_fraction": 1.0,
"server_ip": "192.0.2.42",
"protocol": "http/1.1",
"elapsed_time": 321,
"phase": "connection",
"type": "tcp.aborted"
}
See 5.4 Generate a network error report for an explanation of the communicated fields and format of the report, and 7. Examples for more hands-on examples of NEL registration and reporting process.
As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.
The key words MAY, MUST, MUST NOT, OPTIONAL, REQUIRED, and SHOULD in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.
Requirements phrased in the imperative as part of algorithms (such as "strip any leading space characters" or "return false and abort these steps") are to be interpreted with the meaning of the key word ("must", "should", "may", etc) used in introducing the algorithm.
Some conformance requirements are phrased as requirements on attributes, methods or objects. Such requirements are to be interpreted as requirements on the user agent.
Conformance requirements phrased as algorithms or specific steps may be implemented in any manner, so long as the end result is equivalent. (In particular, the algorithms defined in this specification are intended to be easy to follow, and not intended to be performant.)
The following terms are defined in the DNS specification: [RFC1034]
The following terms are defined in the Fetch specification: [FETCH]
The following terms are defined in the High Resolution Time specification: [HR-TIME]
The following terms are defined in the HSTS specification: [RFC6797]
The following terms are defined in the HTML specification: [HTML]
navigator.onLine
The following terms are defined in the HTTP specification: [RFC7230], [RFC7231], [RFC7232], [RFC7234]
The following terms are defined in the HTTP-JFV specification: [HTTP-JFV]
The following terms are defined in the JSON specification: [RFC7159]
The following terms are defined in the Network Reporting API specification: [NETWORK-REPORTING]
The following terms are defined in the Referrer Policy specification: [REFERRER-POLICY]
The following terms are defined in the Reporting API specification: [REPORTING]
The following terms are defined in the Resource Timing specification: [RESOURCE-TIMING-2]
The following terms are defined in the Secure Contexts specification: [SECURE-CONTEXTS]
The following terms are defined in the URL specification: [URL]
A network request occurs when the user agent must use the network to service a single request.
If the user agent can service a request out of a local cache, that request MUST NOT result in a network request.
If the user agent follows redirects as part of a navigation, there MUST be separate network requests for each request in the redirect chain.
A request MUST NOT result in a network request if the user
agent is known to be offline (i.e., when navigator.onLine
returns
false
).
A request MUST NOT result in a network request if it is blocked due to mixed content or CORS failures. Any CORS-preflight request MUST result in its own network request.
For user agents that service requests according to the [FETCH] standard, a network request corresponds to one execution of the HTTP-network fetch algorithm.
Regardless of which fetch algorithm and which underlying application and transport protocols are used, servicing a network request consists of the following phases:
The only mandatory phase is the transmission of request and response; the other phases might not be needed for every network request. For instance, DNS results can be cached locally in the user agent, eliminating DNS resolution for future requests to the same domain. Similarly, HTTP persistent connections allow open connections to be shared for multiple requests to the same origin. However, if multiple phases occur, they will occur in the above order.
We would like to move the definition of these phases into [FETCH] so that they are more reusable.
A network request is successful if the user agent is able to receive a valid HTTP response from the server, and that response does not have a 4xx or 5xx status code.
A network request is failed if it is not successful.
Note that HTTP error responses (i.e., those with a 4xx or 5xx status code) are considered failures, so that they are subject to a NEL policy's failure sampling rate instead of its successful sampling rate.
A network error is the error condition that caused a network request to fail.
Each network error has a type, which is a string.
Each network error has a phase, which describes which phase the error occurred in:
dns
connection
application
There are several predefined network error types defined in 6. Predefined network error types.
A network error report is a Reporting API report that describes a network error.
Network error reports have a report type of
network-error
.
Network error reports are NOT visible to
ReportingObserver
s.
Network error reports are not visible to
ReportingObserver
s because they are only intended to be
visible to the administrator or owner of the server receiving the
requests. If they were visible to ReportingObserver
s,
then the reports would also be visible to the originator of the
request. For cross-origin requests, this could leak information about the
server's network configuration to parties outside of its control.
A NEL policy instructs a user agent whether to collect reports about network requests to an origin, and if so, where to send them. NEL policies are delivered to the user agent via HTTP response headers.
Each NEL policy has a received IP address, which is the IP address of the server that the user agent received this NEL policy from.
Each NEL policy has an origin.
Each NEL policy has a subdomains flag, which is either
include
or exclude
.
Each NEL policy has a list of request headers and a list of response headers, each of which is a list of header names.
Each NEL policy has a reporting group, which is the name of the Reporting endpoint group that reports for this policy will be sent to.
Each NEL policy has a ttl representing the number of seconds the policy remains valid.
Each NEL policy has a creation which is the timestamp when the user agent received the policy.
A NEL policy is stale if the duration from its creation to the current wall time is greater than 172800 seconds (48 hours).
A NEL policy is expired if the duration from its creation to the current wall time is greater than its ttl (in seconds).
An origin that expects to serve a large volume of traffic might not be equipped to ingest NEL reports for every network request made to the origin. The origin can define sampling rates to limit the number of NEL reports that each user agent submits. Since successful requests should typically greatly outnumber failed requests, the origin can specify different sampling rates for each.
Each NEL policy has a successful sampling rate, which is a number between 0.0 and 1.0 inclusive.
Each NEL policy has a failure sampling rate, which is a number between 0.0 and 1.0 inclusive.
A conformant user agent MUST provide a policy cache, which is a storage mechanism that maintains a set of NEL policies, keyed by (network partition key, origin) tuples.
This storage mechanism is opaque, vendor-specific, and not exposed to the web, but it MUST provide the following methods which will be used in the algorithms this document defines:
A server MAY define a NEL policy for an origin it controls via
the NEL
HTTP response header.
The NEL
response header is used to
communicate an origin's NEL policy to the user agent. The
ABNF (Augmented Backus-Naur Form) syntax for the NEL
header is as
follows:
NEL = json-field-value
The header's value is interpreted as an array of JSON objects, as defined by json-field-value. Each object in the array defines an NEL policy for the origin. The user agent MUST process the first valid policy in the array and ignore any additional policies in the array.
User agents MUST ignore any unknown or invalid field(s) or value(s) that do not conform to the syntax defined in this specification. A valid NEL
header field MUST, at a minimum, contain one object with all of the "REQUIRED" fields defined in this specification.
The user agent MUST ignore the NEL
header specified via a meta
element to mitigate hijacking of error reporting via scripting attacks. The NEL policy MUST be delivered via the NEL
response header.
The restriction on meta
element is consistent with the [CSP] specification, which restricts reporting registration to HTTP header fields only for the same reasons.
The report_to
member specifies the endpoint
group that reports for this NEL policy will be sent to.
The report_to
member is REQUIRED to register a NEL policy,
and OPTIONAL if the intent is to remove a previous registration – see
max_age
. If present, its value MUST be a string; any other type
will result in a parse error.
To improve delivery of NEL reports, the server should set
report_to
to an endpoint group containing at least
one endpoint in an alternative origin whose infrastructure is not
coupled with the origin from which the resource is being fetched —
otherwise network errors cannot be reported until the problem is solved,
if ever — and provide multiple endpoints to provide alternatives
if some endpoints are unreachable.
The REQUIRED max_age
member specifies the
lifetime of this NEL policy, as a non-negative integer number of
seconds. Its value MUST be an non-negative integer; any other type will
result in a parse error.
A value of 0
will cause any NEL policy for this
origin to be removed from the policy cache.
To ensure delivery of NEL reports, the server should ensure that
the Reporting endpoint group is also configured with a
sufficiently high max_age
. If the Reporting policy
expires, NEL reports will not be delivered, even if the NEL policy has
not expired.
The OPTIONAL include_subdomains
member is a
boolean that enables this NEL policy for all subdomains of this
origin (to an unlimited subdomain depth). If no member named
include_subdomains
is present in the object, or its value
is not true
, the NEL policy will not be enabled
for subdomains.
To ensure delivery of NEL reports for subdomains, the application should
ensure that the Reporting endpoint group is also configured with
include_subdomains
enabled. If the Reporting policy is
not, and there is not a separate Reporting policy for a given subdomain,
NEL reports for that subdomain will not be delivered, even if the NEL
policy includes the subdomain.
The OPTIONAL success_fraction
member defines the
sampling rate that should be applied to reports about
successful network requests for this origin. If present,
its value MUST be a number between 0.0
and
1.0
, inclusive; any other value will result in a parse
error. If this member is not present, the user agent will not
collect NEL reports about successful network requests for
this origin.
The OPTIONAL failure_fraction
member defines the
sampling rate that should be applied to reports about
failed network requests for this origin. If present, its
value MUST be a number between 0.0
and 1.0
,
inclusive; any other value will result in a parse error. If this member
is not present, the user agent will collect NEL reports about
all failed network requests for this origin.
The OPTIONAL request_headers
member defines the
list of request headers whose names and values will be included in network error reports
about this origin. If present, its value MUST be a list of
strings.
The OPTIONAL response_headers
member defines the
list of response headers whose
names and values will be included in network error reports
about this origin. If present, its value MUST be a list of
strings.
Given a network request (request) and its corresponding response (response), this algorithm extracts a NEL policy for request's origin, and updates the policy cache accordingly.
Potentially
Trustworthy
.
NEL
.
NEL
.
max_age
, or that
member's value is not a number, abort these steps.
max_age
member is
0
, then remove any NEL policy from the policy
cache whose origin is
origin, and skip the remaining steps.
report_to
, or that
member's value is not a string, abort these steps.
success_fraction
, whose
value is not a number in the range 0.0 to 1.0, inclusive, abort these
steps.
failure_fraction
, whose
value is not a number in the range 0.0 to 1.0, inclusive, abort these
steps.
request_headers
, whose
value is not a list, or if any element of that list is not a string,
abort these steps.
response_headers
, whose
value is not a list, or if any element of that list is not a string,
abort these steps.
Let policy be a new NEL policy whose properties are set as follows:
Plumb this through more explicitly in [FETCH].
include
if item has a member named
include_subdomains
whose value is true
,
exclude
otherwise
request_headers
member
response_headers
member
report_to
membermax_age
membersuccess_fraction
member, if
present; 0.0
otherwise
failure_fraction
member, if
present; 1.0
otherwise
Given a network request (request), this algorithm determines which NEL policy in the policy cache should be used to generate reports for that network request.
include
, return it.
no policy
.
Given a network request (request) and a NEL policy (policy), this algorithm extracts header values from the request as instructed by the policy.
Given a response (response) and a NEL policy (policy), this algorithm extracts header values from the response as instructed by the policy.
Given a network request (request) and its corresponding response (response), this algorithm generates a report about request if instructed to by any matching NEL policy, and returns the report and the NEL policy. Otherwise this algorithm returns null.
Potentially Trustworthy
, return
null.
no policy
, return null.
sampling_fraction
elapsed_time
phase
"application"
.
type
"ok"
.
phase
property is not
dns
, append the following properties to report
body:
server_ip
x:x:x:x:x:x:x:x
, where the 'x's are one to four hexadecimal
digits of the eight 16-bit pieces of the address). [RFC4291]
protocol
""
.
phase
property is not
dns
or connection
, append the following
properties to report body:
referrer
method
request_headers
response_headers
status_code
0
.
include
, and report
body's phase
property is not dns
,
return null.
This step ensures that subdomain NEL policies can only be used to generate reports about subdomains of the policy origin during the DNS resolution phase of a request. See 9. Privacy Considerations for more details.
phase
property is not
dns
, and report body's server_ip
property is non-empty and not equal to policy's received
IP address:
phase
to
dns
. type
to
dns.address_changed
.
request_headers
,
response_headers
, status_code
, and
elapsed_time
properties.
This step "downgrades" a NEL report if the IP addresses of the server and the policy don't match. This is a privacy protection, ensuring that NEL reports are only sent to the owner of the service that the report describes. If the IP addresses don't match, then the user agent can only verify that the NEL policy was sent by the owner of the origin's domain name; it cannot verify that the policy was sent by the owner of the server this domain name resolves to. We therefore downgrade the report to only contain information about DNS resolution. See 9. Privacy Considerations and 7.5 Origins with multiple IP addresses for more details.
Given a ECMAScript object (report body, usually returned from Generate a network error report and then augmented by the calling specification) and its matching NEL policy (policy) and network request (request), this algorithm queues the report for delivery.
Let url be request's URL.
Clear url's fragment.
If report body's phase
property is
dns
or connection
:
Generate a network report given these parameters:
network-error
There are several predefined network error types.
The user agent MAY extend this list with custom network error
types — e.g. to accommodate new protocols, or more detailed error
descriptions of existing ones. When doing so, the user agent SHOULD
follow the dot-delimited pattern
([group].[optional-subgroup].[error-name]
) for the
type names to facilitate simple and consistent processing of the
error reports — e.g. the collector may provide aggregation by category
and/or one or multiple subgroups.
All of the network errors in this section occur during DNS
resolution, and therefore have a phase
of dns
.
dns.unreachable
dns.name_not_resolved
dns.failed
dns.address_changed
All of the network errors in this section occur during secure
connection establishment, and therefore have a
phase of connection
.
tcp.timed_out
tcp.closed
tcp.reset
tcp.refused
tcp.aborted
tcp.address_invalid
tcp.address_unreachable
tcp.failed
tls.version_or_cipher_mismatch
tls.bad_client_auth_cert
tls.cert.name_invalid
tls.cert.date_invalid
tls.cert.authority_invalid
tls.cert.invalid
tls.cert.revoked
tls.cert.pinned_key_not_in_cert_chain
tls.protocol.error
tls.failed
All of the network errors in this section occur during the
transmission of request and response, and therefore have a phase of application
.
http.error
http.protocol.error
http.response.invalid
http.response.redirect_loop
http.failed
abandoned
unknown
> GET / HTTP/1.1
> Host: example.com
< HTTP/1.1 200 OK
< ...
< Report-To: {"group": "network-errors", "max_age": 2592000,
"endpoints": [{"url": "https://example.com/upload-reports"}]}
< NEL: {"report_to": "network-errors", "max_age": 2592000}
This NEL
header defines a NEL policy, instructing
the user agent to report network errors about example.com
to the endpoint group named network-errors
. The
policy applies for 2592000 seconds (30 days).
Note that above registration will only succeed if the response is communicated from a potentially trustworthy origin.
> GET / HTTP/1.1
> Host: example.com
< HTTP/1.1 200 OK
< ...
< NEL: {"max_age": 0}
This NEL
header instructs the user agent to remove any
existing NEL policy for example.com
.
This section contains example network error reports the
user agent might queue when a network error is encountered for an
origin with a registered NEL policy. We show the full
report payload that would be created by the [REPORTING] API when
uploading the report; the payload's body
field contains the
network error report body.
{
"age": 0,
"type": "network-error",
"url": "https://www.example.com/",
"body": {
"sampling_fraction": 0.5,
"referrer": "http://example.com/",
"server_ip": "2001:DB8:0:0:0:0:0:42",
"protocol": "h2",
"method": "GET",
"request_headers": {},
"response_headers": {},
"status_code": 200,
"elapsed_time": 823,
"phase": "application",
"type": "http.protocol.error"
}
}
This report indicates that the user agent attempted to navigate from
example.com
to www.example.com
, which
successfully resolved to 2001:DB8::42
. However, while
the user agent received a 200 response from the server via the
HTTP/2 (h2
) protocol, it encountered a protocol error in
the exchange and was forced to abandon the navigation. The user agent
aborted the navigation 823 milliseconds after it started. Finally, the
user agent sent this report immediately after the network error was
encountered – i.e. the report age is 0.
{
"age": 0,
"type": "network-error",
"url": "https://widget.com/thing.js",
"body": {
"sampling_fraction": 1.0,
"referrer": "https://www.example.com/",
"server_ip": "",
"protocol": "",
"method": "GET",
"request_headers": {},
"response_headers": {},
"status_code": 0,
"elapsed_time": 143,
"phase": "dns",
"type": "dns.name_not_resolved"
}
}
The above report indicates that the user agent attempted to fetch
https://widget.com/thing.js
from
https://www.example.com/
. However, the user agent was
unable to resolve the DNS name (widget.com
) and the request
was aborted by the user agent after 143 milliseconds. Because a
previous request to widget.com
delivered a valid NEL
policy, the user agent generates a network error
report for this request. The report was uploaded immediately
after the network error was encountered – i.e. the report age is
0.
> GET / HTTP/1.1
> Host: example.com
< HTTP/1.1 200 OK
< ...
< Report-To: {"group": "network-errors", "max_age": 2592000,
"endpoints": [{"url": "https://example.com/upload-reports"}]}
< NEL: {"report_to": "network-errors", "max_age": 2592000, "include_subdomains": true}
This NEL
header allows the owner of
example.com
to detect when they have misconfigured their
DNS servers — for instance, when they have forgotten to add a new
resource record resolving new-subdomain.example.com
to an
IP address. If a user agent tries to make a request to
new-subdomain.example.com
, it might generate the following
report:
{
"age": 0,
"type": "network-error",
"url": "https://new-subdomain.example.com/",
"body": {
"sampling_fraction": 1.0,
"server_ip": "",
"protocol": "http/1.1",
"method": "GET",
"request_headers": {},
"response_headers": {},
"status_code": 0,
"elapsed_time": 48,
"phase": "dns",
"type": "dns.name_not_resolved"
}
}
> GET / HTTP/1.1
> Host: example.com
< HTTP/1.1 200 OK
< ...
< Report-To: {"group": "network-errors", "max_age": 2592000,
"endpoints": [{"url": "https://example.com/upload-reports"}]}
< NEL: {"report_to": "network-errors", "max_age": 2592000, "success_fraction": 1.0,
"request_headers": ["If-None-Match"], "response_headers": ["ETag"]}
< ETag: 01234abcd
In this example, the owner of example.com
uses
response headers to identify different versions
of the resources hosted on the server. User agents can then use
ETag
request headers to inform the server
which version of a resource is presently cached client-side, allowing
the server to avoid generating and sending the content of the resource
if the client's existing copy is up to date.
If-None-Match
By including request_headers
and response_headers
fields
in the NEL
header for this domain, the browser will include
a copy of the
request header and
If-None-Match
response header in any NEL report that it
creates for that request, allowing the site owner to track the
effectiveness of their caching policies.
ETag
Given the above, consider the following sequence of events:
The user agent sends a request to example.com
,
and receives a successful response from the server,
with an
header indicating the version of
the resource. The user agent will generate the following NEL
report:
ETag
{
"age": 0,
"type": "network-error",
"url": "https://example.com/",
"body": {
"sampling_fraction": 1.0,
"server_ip": "192.0.2.1",
"protocol": "http/1.1",
"method": "GET",
"request_headers": {},
"response_headers": {
"ETag": ["01234abcd"]
},
"status_code": 200,
"elapsed_time": 1392,
"phase": "application",
"type": "ok"
}
}
Some time later, the user agent sends another request to
example.com
. The user agent still has a copy of the
original resource in its local cache, and includes its version in a
request header. The server checks
this version, notices that it is still current, and sends a
If-None-Match
304
response informing the user agent that its cached copy of
the resource is still valid. The user agent will generate the
following report:
{
"age": 0,
"type": "network-error",
"url": "https://example.com/",
"body": {
"sampling_fraction": 1.0,
"server_ip": "192.0.2.1",
"protocol": "http/1.1",
"method": "GET",
"request_headers": {
"If-None-Match": ["01234abcd"]
},
"response_headers": {
"ETag": ["01234abcd"]
},
"status_code": 304,
"elapsed_time": 45,
"phase": "application",
"type": "ok"
}
}
Even later, the user agent sends yet another request to
example.com
. The user agent still has the same copy of the
resource in its local cache, and includes its version in a
request header, as in the previous
example. However, this time the server notices that there is a new
version of the resource available. It generates the content of this
resource, and sends it to the client, with the new version encoded
in a new If-None-Match
response header value. The user
agent will generate the following report:
ETag
{
"age": 0,
"type": "network-error",
"url": "https://example.com/",
"body": {
"sampling_fraction": 1.0,
"server_ip": "192.0.2.1",
"protocol": "http/1.1",
"method": "GET",
"request_headers": {
"If-None-Match": ["01234abcd"]
},
"response_headers": {
"ETag": ["56789ef01"]
},
"status_code": 200,
"elapsed_time": 935,
"phase": "application",
"type": "ok"
}
}
For origins whose domain name resolves to multiple IP addresses, NEL will sometimes "downgrade" an error report, providing less information about the cause of the error, since it cannot verify that the owner of the origin is the same as the owner of the server handling the request.
As an example, assume that example.com
is handled by three
servers, each with a different IP address. The owner of the
service configures DNS to resolve example.com
to
192.0.2.1
, 192.0.2.2
, and
192.0.2.3
, and relies on user agents to balance their
requests across these three IP addresses. The service owner delivers
the following NEL policy:
> GET / HTTP/1.1
> Host: example.com
< HTTP/1.1 200 OK
< ...
< Report-To: {"group": "network-errors", "max_age": 2592000,
"endpoints": [{"url": "https://example.com/upload-reports"}]}
< NEL: {"report_to": "network-errors", "max_age": 2592000,
"success_fraction": 1.0, "failure_fraction": 1.0}
Given the above, consider the following sequence of events:
The user agent sends a request to 192.0.2.1
,
and receives a successful response from the server.
This response includes the above NEL policy, and the user
agent sets the policy's received IP address to
192.0.2.1
. Since the received IP address
matches the server's IP address (which it must for any
successful request), it generates the following NEL report:
{
"age": 0,
"type": "network-error",
"url": "https://example.com/",
"body": {
"sampling_fraction": 1.0,
"server_ip": "192.0.2.1",
"protocol": "http/1.1",
"method": "GET",
"request_headers": {},
"response_headers": {},
"status_code": 200,
"elapsed_time": 57,
"phase": "application",
"type": "ok"
}
}
The user agent sends a new request to
192.0.2.2
, and receives another successful
response. This response also includes the NEL policy,
and the user agent updates the policy's received IP address
to 192.0.2.2
. Since the received IP address
matches the server's IP address (which it must for any
successful request), it generates the following NEL report:
{
"age": 0,
"type": "network-error",
"url": "https://example.com/",
"body": {
"sampling_fraction": 1.0,
"server_ip": "192.0.2.2",
"protocol": "http/1.1",
"method": "GET",
"request_headers": {},
"response_headers": {},
"status_code": 200,
"elapsed_time": 34,
"phase": "application",
"type": "ok"
}
}
The user agent then tries to send a request to
192.0.2.3
, but isn't able to establish a connection
to the server. The user agent still has the NEL policy in
the policy cache, and would ideally use this policy to
generate a tcp.timed_out
report about the failed
network request. However, the because policy's received
IP address (192.0.2.2
) doesn't match the IP
address that this request was sent to, the user agent cannot
verify that the server at 192.0.2.3
is actually owned
by the owners of example.com
. The user agent must
therefore downgrade the report to dns.address_changed
:
{
"age": 0,
"type": "network-error",
"url": "https://example.com/",
"body": {
"sampling_fraction": 1.0,
"server_ip": "192.0.2.3",
"protocol": "http/1.1",
"method": "GET",
"request_headers": {},
"response_headers": {},
"status_code": 0,
"elapsed_time": 0,
"phase": "dns",
"type": "dns.address_changed"
}
}
The user agent then tries to send another request to
192.0.2.1
, but once again isn't able to establish a
connection to the server. Even though the user agent received the
NEL policy from 192.0.2.1
at some point in the
past, the policy's received IP address only records where it
was most recently received from — in this case,
192.0.2.2
. The user agent must therefore downgrade
the report to dns.address_changed
:
{
"age": 0,
"type": "network-error",
"url": "https://example.com/",
"body": {
"sampling_fraction": 1.0,
"server_ip": "192.0.2.1",
"protocol": "http/1.1",
"method": "GET",
"request_headers": {},
"response_headers": {},
"status_code": 0,
"elapsed_time": 0,
"phase": "dns",
"type": "dns.address_changed"
}
}
A typical application requires dozens of resources, the fetching of which is typically initiated via HTML, CSS, or JavaScript. The application requesting such resources can observe failures of most such fetches (e.g. via onerror
callbacks), but it does not have access to the detailed network error report of why the failure has occurred - e.g. DNS failure, TCP error, TLS protocol violation, etc.
To address this, the application can register relevant NEL policies with the user agent for the first-party hosts from which the subresources are being fetched. Then, if such a policy is present and a network error is encountered for a resource from an origin with a registered NEL policy, the user agent will report the detailed network error report and enable the application developers to investigate the error.
In the case where a resource is embedded by a third party, the provider of the resource is often unable to instrument and observe the failure. For example, if example.com
embeds a widget.com/thing.js
resource on its site, and the user visiting example.com
fails to fetch such resource due to a network error, the widget.com
host is both unaware of the failure and unable to detect it.
To address this, widget.com
can register an NEL policy for its host. Then, if such policy is present and a network error is encountered while fetching a resource — regardless of whether it is being requested from a first-party or third-party origin — from the origin with a registered NEL policy, the user agent will report the network error and enable the provider to investigate the error.
NEL provides network error reports that could expose new information about the user's network configuration. For example, an attacker could abuse NEL reporting to probe the user's network configuration, or to scan for servers on the user's internal network. Also, similar to HSTS, HPKP, and pinned CSP policies, the stored NEL policy could be used as a "supercookie" by setting a distinct policy with a custom (per-user) reporting URI to act as an identifier in combination with (or instead of) HTTP cookies.
To mitigate some of the above risks, NEL registration is restricted to potentially trustworthy origins, and delivery of network error reports is similarly restricted to potentially trustworthy origins. This disallows a transient HTTP MITM from trivially abusing NEL as a persistent tracker.
Additionally, the NEL policy cache is partitioned using the network partition key, so that a NEL policy stored for a site in one embedding context will not be used in a different context (for instance, when embedded by a different top-level site.)
NEL is intended to augment existing server-side monitoring. NEL reports should only be sent to the owner of the service being requested. For errors that occur during DNS resolution, NEL reports are only generated when the NEL policy was received from the owner of the domain namespace tree that contains the policy origin. For errors that occur during secure connection establishment or transmission of request and response, NEL reports are only generated when the NEL policy was received from the owner of the server that the request was sent to.
This rationale explains the treatment of the received IP address and subdomains flag of a NEL policy. By checking that the policy's received IP address matches the IP address of the server, NEL extends the trust boundary of the policy to include not just the policy's origin, but also the specific server that the user agent is communicating with. This helps prevent (for instance) DNS rebinding attacks, where an attacker delivers a long-lived NEL policy from a server that they own, and then changes their name servers to resolve the policy origin to a server they don't control. Without the received IP address verification, this would cause user agents to send reports about the second server to the attacker.
Similarly, subdomain NEL policies are limited, and can only be used to generate reports about subdomains of the policy origin during the DNS resolution phase of a request. During this phase, there is no server to verify ownership of, and the fact that the policy was received from a superdomain of the request's origin is enough to establish ownership of the error. This allows the owners of a particular portion of the domain namespace tree to use NEL to detect 7.3 DNS misconfiguration errors, while preventing them from using malicious DNS entries to collect information about servers they don't control.
To prevent information leakage, NEL reports about a request do not contain any information that is not visible to the server when processing the request. For errors during DNS resolution, a NEL report only contains information available from DNS itself. This prevents servers from abusing NEL to collect more information about their users than they already have access to.
As an example, NEL reports specifically do not contain any information about which DNS resolver was used to resolve a request's domain name into an IP address.
In addition to above restrictions, the user agents MUST:
When deploying NEL the developer SHOULD consider privacy implications of NEL reports delivered to the specified collectors. For example, reports may contain URLs with sensitive data (e.g. "Capability URLs") that may need special precautions (see [CAPABILITY-URLS]), and may require the developer to operate their own NEL collectors to prevent reporting of such URLs to third parties.
The permanent message header field registry should be updated with the following registrations ([RFC3864]):
NEL
NEL
response header)This document reuses text from the [CSP] and [RFC6797] specification, as permitted by the licenses of those specifications. Additionally, sincere thanks to Julia Tuttle, Chris Bentzel, Todd Reifsteck, Aaron Heady, and Mark Nottingham for their helpful comments and contributions to this work.
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in: