Email sender authentication: advantages and shortcomings

2007-08-01

Alberto Treviño

Brigham Young University, USA

J. J. Ekstrom Ph.D.

Brigham Young University, USA

Editor: Helen Martin

Abstract

Alberto Treviño and Dr J. J. Ekstrom detail the advantages and shortcomings of email sender authentication as an anti-spam technique, explaining why they consider it a useful technique in the fight against spam.

Table of contents

Introduction
Email authentication
Pros and cons of email verification
Proper use of email authentication
Sender authentication without SPF or SenderID
How sender authentication can help reduce spam
Conclusions and future work

Introduction

The severity of the spam and malware calamity should not be new to readers of this publication; neither should the close relationship between the two, nor the difficulty in combating them. For new readers, things can be summed up as follows: the situation is perilous, spam and malware go hand in hand, and defence is difficult and expensive. It often seems we are in an arms race, with each side trying to out-gun their opponent. Although we all secretly hope for and seek the magic weapon that will obliterate the problem once and for all, the reality is such a weapon does not currently exist, and we still rely on our tried and tested methods of defence to help us survive.

Email authentication

During the last few years we have seen the emergence of several technologies commonly referred to as ‘email authentication’. Email authentication attempts to verify and certify the authenticity of an email. Email authentication is usually divided into two categories [1]:

Sender authentication attempts to verify the validity of an email’s source. More specifically, it attempts to detect header forgeries and ensures that emails from a particular domain are sent from trusted ystems. Well-known specifications in this category include Sender Policy Framework (SPF) and SenderID.
Content authentication ensures that the contents of an email have not been tampered with or modified in any way during delivery. A well-known specification in this category is DomainKeys, which recently received RFC standard status.

This paper will focus mainly on sender authentication.

Pros and cons of email verification

In reality it is hard to find a multi-purpose tool that is the best at everything. In order to use a tool effectively, it is essential to understand its strengths and weaknesses. The same applies to email authentication. In order to use it effectively, it is necessary to understand what it can do well, what it can’t do very well, and what it can’t do at all.

To understand properly the best application of email authentication technologies let’s begin with their deficiencies:

Email authentication is not a silver bullet. It will not eliminate spam once and for all and it will certainly not do it overnight.
It will not stop spammers from attempting to spew spam.
It will not tell you directly if a message is spam or ham.
Spammers can also make use of these technologies and ‘authenticate’ in the same way as legitimate email.

These deficiencies have been the cause of heavy criticism from some in the internet community. Several have even labelled authentication technologies as ‘useless’. We disagree. If used properly, email authentication can help reduce (not eliminate) spam.

Proper use of email authentication

So how do you use email authentication properly? You use it to help establish trust. For example, sender authentication helps answer the questions ‘is this email coming from the place it should come from?’ If the sender authentication fails, we have reason to doubt the legitimacy of the email. But if the authentication is successful it won’t automatically mean the email is legitimate.

In other words, failed authentication does not make an email spam and successful authentication does not make it ham. This is the area where many have problems understanding the proper use of email authentication. They erroneously believe that authentication by default equates to legitimacy. To their credit, spammers were quick to point out this flawed idea. Unfortunately, many who made this mistake were quick to blame the technology rather than their own misunderstanding of the technology for its failures and precipitately dubbed email authentication useless.

So what does email authentication do for us? Let’s begin with sender authentication in the form of SPF and its cousin SenderID.

SPF publishes rules via DNS records about which SMTP servers are allowed to send email for a particular domain. If we receive an email from a server that does not conform to the published rules, the possibility of header forgery for that email increases. SPF (if not set up too broadly) effectively reduces the number of SMTP servers that can send email for a given domain from hundreds of millions to a mere handful. This can help reduce the impact of botnets and help catch phishing attacks.

Message authentication can also help authenticate email by providing a verifiable signature for each email message. This helps us determine whether the message was signed by a valid server and whether the message has arrived unmodified.

Sender authentication without SPF or SenderID

One major shortcoming of sender authentication technologies is that they are not currently mandated. Not all domains have deployed these technologies and they don’t enjoy broad deployment. Politics have also played a role, with many early adopters taking sides with one technology or the other.

During the MIT Spam Conference 2007 [2] we presented a technique similar to SPF that uses existing DNS records to authenticate email. We called this technique ‘Relay Detection’ [3]. It works under the assumption that legitimate email is rarely relayed these days, and that spammers continue to relay spam, mostly through botnets. In other words, a Hotmail user will send email through Hotmail servers while a spammer claiming to come from Hotmail will use an open relay or a machine in a botnet to send the email.

The first step in Relay Detection is to verify the HELO identity of the sender SMTP, either during the SMTP transmission or through the correct Received header (see Figure 1). The SMTP standards indicate the sender SMTP should send its Fully Qualified Domain Name (FQDN). Therefore we verify the validity of the HELO identity as follows:

Verify it is a FQDN.
Verify it against existing DNS entries by performing a forward DNS lookup and comparing the DNS record’s IP address to the sender SMTP IP address.
As a last resort, perform a reverse DNS lookup on the sender SMTP IP address and compare the two domain names for consistency.
Figure 1. Sender SMTP identity verification.

In our preliminary testing, these simple tests were capable of detecting over 60% of spam with less than 0.35% false positives. Although these numbers may not be terribly impressive, they can be extremely useful if these checks are implemented at the SMTP level before the email is even received.

The second step in Relay Detection is to verify the second command in the SMTP transmission: MAIL FROM (see Figure 2). If done after the SMTP transmission, you can use the Return-Path header. The main goal of this test will be to establish through DNS records a relationship between the domain information given in the HELO command with the domain in the MAIL FROM address. You can verify the validity of the MAIL FROM address as follows:

Extract the domain part of the email address (hotmail.com, gmail.com, etc.).
Compare the domain in the email address with the domain in the HELO identity. If they match (e.g. hotmail.com and mx4.hotmail.com would match), the verification passes.
If the domains don’t match, you perform an MX DNS lookup on the email domain. If any domain in the MX record matches the HELO identity domain, the verification passes.
If the MX and HELO domains don’t match, you perform a forward DNS lookup on the MX records to obtain their IP address. Then you compare the MX record IP addresses with the address of the sender SMTP. If the IP addresses are in the same IP address range (even in the same set of /16 is OK), the verification passes.
As a last resort, you can perform a DNS lookup on the email address domain and see if any of the IP addresses are in the same IP address range as the sender SMTP.
Figure 2. MAIL FROM verification.

In our preliminary testing we were able to identify 78% of the spam messages not flagged by the Sender SMTP Verification tests with roughly a 0.5% false positive rate. Overall, these two forms of verification flagged over 90% of spam with a less than 1% false positive rate.

Be aware that the tests and their order are not arbitrary. They are designed to work for nearly all email configurations, including domains in virtual hosting environments.

The only cases where these tests fail is when headers are forged or email is relayed (after all, that’s what they are looking for). However, these tests are usually very fast and consume very little processing time, with most of the time being spent waiting for DNS queries.

How sender authentication can help reduce spam

It is important to note that sender authentication technologies have been adopted by many spammers. This greatly reduces the efficacy of email authentication as a spam-fighting tool.

Many who have encountered this unfortunate fact have concluded that these technologies simply ‘don’t work’ and are not worth the effort. Luckily, email authentication is still helpful in the fight against spam. Just because a screwdriver doesn’t make a good hammer doesn’t mean it should be thrown out.

As we mentioned earlier, email authentication helps the initial establishment of trust. Sender authentication in any or all of its flavours does one thing very well: it reduces the number of servers that can legitimately send email for a domain to a privileged few.

If used properly, sender authentication prevents header forgeries, forces spammers to be more honest and pushes them to start using their own domains. This has important positive implications for mitigating phishing attacks. For example, a scam artist trying to impersonate a bank (such as Bank of America) cannot simply forge the email headers any more.

In order to bypass these email authentication techniques the spammer would have to:

Set up a new domain.
Set up all proper DNS records, including MX records for the machines in the botnet.
Set up SPF/Sender ID records.

Not only does this start to affect the economics of spam, it also plants the seed of distrust for the receiver. An astute recipient receiving an email from ‘Bank of America Support’ <[email protected]> may wonder why the email did not originate from bankofamerica.com. Not all email users may be so savvy but anti-phishing techniques like those used by modern browsers could also help diffuse the attack.

Conclusions and future work

Although our research may not be seen as revolutionary, it has helped us confirm several things:

Many spammers continue to relay email.
Simple header analysis combined with DNS lookups can be used to authenticate an email’s sender.
Sender authentication (if used properly) can still help reduce spam.
The only way for spammers to circumvent any form of email authentication is to be more honest and comply with existing and emerging standards.
Email authentication technologies work. They may not eliminate all spam, but they can help eliminate some of the spam.

In continuing with our research we may investigate further the relationship that could be established between sender identification and domain trust.

With sender authentication technologies helping verify the source of incoming email, we can more fully develop trust relationships between domains.

With the help of other spam-filtering technologies and the confidence that an email is coming from a verified source, it should be possible to measure a domain’s tendency to spew spam. Those measurements could be calculated automatically from incoming email and could provide another measure to help determine the probability of a message being spam. Even though several companies specialize in the analysis of spam for the purposes of publishing blacklists and such, this technique may improve blacklist and whitelist efficiency by ensuring the lists are relevant to the spam and legitimate email received by a particular host.

Another possible direction for our research is to use sender authentication as a means of prioritizing spam filtering, virus scanning and delivery queues. Along with the domain spam measurements mentioned above, incoming SMTP connections could be prioritized and handled appropriately, allowing efficient routing. For example, an untrusted domain could be greeted by a tar pit in an effort to slow the spammer and wait for the sender to give up. At the same time messages from another domain with a good ham record but poor virus record could immediately be sent through a very thorough virus scanner. And, in some cases, it may even be possible that some domains could bypass spam filtering and virus scanning altogether and be queued for immediate delivery.

These ideas are still in their early infancy. We are barely exploring the many possibilities provided by sender authentication. In either case, we feel confident that email authentication (both sender authentication and message authentication) can help reduce email if it is used as a tool for establishing trust and not mistaken as a complete spam solution in and of itself.

Bibliography

[1] http://www.openspf.org/Related_Solutions.

[2] http://www.spamconference.org/.

[3] For the full paper and presentation follow-up see http://mel.byu.edu/spam/.

Latest articles:

Nexus Android banking botnet – compromising C&C panels and dissecting mobile AppInjects

Aditya Sood & Rohit Bansal provide details of a security vulnerability in the Nexus Android botnet C&C panel that was exploited to compromise the C&C panel in order to gather threat intelligence, and present a model of mobile AppInjects.

Cryptojacking on the fly: TeamTNT using NVIDIA drivers to mine cryptocurrency

TeamTNT is known for attacking insecure and vulnerable Kubernetes deployments in order to infiltrate organizations’ dedicated environments and transform them into attack launchpads. In this article Aditya Sood presents a new module introduced by…

Collector-stealer: a Russian origin credential and information extractor

Collector-stealer, a piece of malware of Russian origin, is heavily used on the Internet to exfiltrate sensitive data from end-user systems and store it in its C&C panels. In this article, researchers Aditya K Sood and Rohit Chaturvedi present a 360…

Fighting Fire with Fire

In 1989, Joe Wells encountered his first virus: Jerusalem. He disassembled the virus, and from that moment onward, was intrigued by the properties of these small pieces of self-replicating code. Joe Wells was an expert on computer viruses, was partly…

Run your malicious VBA macros anywhere!

Kurt Natvig wanted to understand whether it’s possible to recompile VBA macros to another language, which could then easily be ‘run’ on any gateway, thus revealing a sample’s true nature in a safe manner. In this article he explains how he recompiled…

Bulletin Archive