The OSCT (Operational Security Coordination Team) has produced this document with the aim to minimize the impact of security incidents, to encourage post-mortem analysis and to promote the cooperation between the sites. It is based on the EGEE Incident Response policy.
The procedure described in the document is aiming at complementing the security policies already existing at the sites giving, at the same time, a simplified guideline of the most important steps to be taken in case of a security incident.
This procedure is aimed at minimising the impact of security incidents, by encouraging post-mortem analysis and promoting cooperation between grid sites. It is based on the EGEE Incident Response policy.
This grid incident response procedure is aiming at complementing local security procedures.
A security incident is the act of violating an explicit or implied security policy (ie: local security policy, EGEE Acceptable Use Policy)
This document is intended for grid site security contacts and site administrators and is primarily aimed at reporting security incidents.
When a security incident affecting grid hosts is suspected, the following procedure MUST be followed:
- Report a downtime for the affected hosts on the GOCDB
- Send an EGEE broadcast announcing the downtime for the affected hosts Use "Security operations in progress" as the reason with no additional detail both for the broadcast and the GOCDB. This can be done at the GOCDB broadcast page
This step MUST be completed within one working day after the incident has been detected.
- Identify and kill suspicious process(es) as appropriate, but aim at preserving the information they could have generated, if possible both in memory and on disk.The objective is to understand the source and the cause of the incident, the affected credentials and services, and the possible implications for the infrastructure.
- If it is suspected that some grid credentials have been abused or compromised, you MUST ensure the relevant accounts have been suspended.
- If it is suspected that some grid credentials have been abused, you MUST ensure that the relevant VO manager(s) have been informed. VO contacts are available at the CIC Portal
- If it is suspected that some grid credentials have been compromised, you MUST ensure that the relevant CA has been informed. CA contacts are available at the EuGRIDPMA web site.
- If needed, seek for help from your local security team or from your ROC Security Contact or from OSCT (project-egee-security-support (at) cern.ch)
- If relevant, additional reports containing suspicious patterns, IP addresses, files or evidence that may be of use to other Grid participants SHOULD be sent to project-egee-security-csirts (at) in2p3.fr. Never send potentially sensitive information (ex: IP addresses, usernames) without clearance from your local security team and/or your ROC Security Contact.
- Host(s) affected (ex: compromised hosts, hosts running suspicious user code)Throughout step 5, requests from the Operational Security Coordination Team MUST be followed-up within 4 hours.
- Host(s) used as a local entry point to the site (ex: UI or WMS IP address)
- Remote IP address(es) of the attacker
- Evidence of the compromise, including timestamps (ex: suspicious files or log entry)
- What was lost, details of the attack (ex: compromised credentials, (root) compromised host)
- If available and relevant, the list of other sites possibly affected
- If available and relevant, possible vulnerabilities exploited by the attacker
- The actions taken to resolve the incident
Should a security incident be suspected, the use of the following email templates is encouraged.
The first template is aimed at notifying the grid participants soon after the incident has been discovered (heads-up), as described in Step 3 of the procedure above.
FROM: <your_email_address@your_organisation>
TO: < project-egee-security-csirts@in2p3.fr >
SUBJECT: Security incident suspected at <your site>
** PLEASE DO NOT REDISTRIBUTE ** EGEE-<DATE> (ex: EGEE-20090531)
** This message is sent to the EGEE CSIRTs and must NOT be publicly archived **
Dear CSIRTs,
It seems a security incident has been detected at <your site>.
Summary of the information available so far:
<Ex: A malicious SSH connection was detected from 012.012.012.012. The extent of the incident is
unclear for now, and more information will be published in the coming hours as forensics are
progressing at our site. However, all sites should check for successful SSH connection from
012.012.012.012 as a precautionary measure.>
The second template can be used to provide a detailed view of the incident, and may be completed and reposted as the investigation progresses, as described in Step 5 of the procedure above.
It is also possible to use the following moderated Web form.
FROM: <your_email_address@your_organisation>
TO: < project-egee-security-csirts (at) in2p3.fr >
SUBJECT: Security incident suspected at <your site>
** PLEASE DO NOT REDISTRIBUTE ** EGEE-<DATE> (ex: EGEE-20090531)
** This message is sent to the EGEE CSIRTs and must NOT be publicly archived **
Dear CSIRTs,
It seems a security incident has been detected at <your site>.
- Short summary of the incident
<Provide a high level overview of the incident>
- Host(s) affected
< List of compromised hosts and/or hosts running suspicious user code.
ex: grid-worker-node-124.mysite.org (123.123.123.123)>
- Host(s) used as a local entry point to the site (ex: UI or WMS IP address)
CERN-LCG-EDMS-867454 version 1.3 Last Saved on 20 Apr 2009
<The host that the attacker is likely to have used to access the site.
ex: grid-ui-101.mysite.org (123.123.123.124)>
- Remote IP address(es) of the attacker
<The remote host from where the attacker is likely to have connected from.
ex: 123.adsl.somecorp.com (012.012.012.012)>
- Evidence of the compromise, including timestamps (ex: suspicious files or log entry)
<Ex: the attacker logged in has root from 123.adsl.somecorp.com. Times are UTC:
Mar 24 12:00:09 grid-ui-101 sshd[13896]: Accepted password for root from 012.012.012.012>
- What was lost, details of the attack
< Provide available details on the extent of the compromise. For ex:
System logs revealed the attacker guested the root password of grid-ui-101 on Mar 24 12:00:09
(UTC) after hundreds of attempts. Then, the attacker [...] etc.>
- If available and relevant, the list of other sites possibly affected
<Ex: firewall logs reveals suspicious SSH connections from the compromised node to grid-
ui.friendlysite.org on Mar 24 13:01:03 (UTC). friendlysite.org has been contacted.>
- Possible vulnerabilities exploited by the attacker
<Ex: the attacker exploited a weak root password and gained further access by exploiting
CVE-20091234 against [...] etc.>
- The actions taken to resolve the incident
<Ex: Disc images have been saved, hosts have been reinstalled from scratched with new, strong root
passwords, and SSH has been configured to prevent "root" logins with password.>
- Recommendations for other sites, actions suggested
<Ex: Sites should check and report any successful SSH connection grid-ui-101 between Mar 24
12:00:09 (UTC) and Mar 24 17:00:00 (UTC).
It is also recommended to avoid direct SSH access, and to configure sshd with "PermitRootLogin
without-password".>
- Timeline of the incident
<Ex:
2009-03-24 09:12:43 Multiple SSH connection attempts from 012.012.012.012
2009-03-24 12:00:09 Attacker connects as root on grid-ui-101.mysite.org from 012.012.012.012
2009-03-24 13:01:03 SSH scan from grid-ui-101 against grid-ui.friendlysite.org
[,,,]
2009-03-24 15:00:00 Site security team investigating
2009-03-24 15:34:00 EGEE CSIRTs informed via project-egee-security-csirts@in2p3.fr
[...]>
A security incident coordinator from the EGEE Operational Security Coordination Team (OSCT,
http://cern.ch/OSCT) appointed for each incident.
The role of OSCT incident coordinator includes:
RFC 2350 - Expectations for Computer Security Incident Response
RFC 2196 - Site Security Handbook
RFC 3013 – Guidelines for Evidence Collection and Archiving
IETF Extended Incident Handling (INCH)
IETF Incident Object Description Exchange Format (IODEF)
LCG Security Group, Agreement on Incident Response
CERT/CC - Handbook for Computer Security Incident Response Teams
CERT/CC - Incident Reporting Guidelines
CERT/CC - Creating a Computer Security Incident Response Team: A Process for Getting
Started
CERT/CC - State of the Practice of Computer Security Incident Response Teams (CSIRTs)