Software security
=================

Software bugs lead to security problems.
  Rule of thumb: one bug per 1000 lines of code.
  Surprisingly often, bugs lead to security compromises.
  Good mindset to have: any bug can lead to a potential security exploit.
    Even bugs in code that might not seem to be security-critical.
  Another view: security requires much of the program to work correctly.

This module in the class: security in the presence of software bugs.
  Today: overview, motivation, broad approaches.
  Next 2 lectures: specific techniques.

What kinds of bugs might lead to security problems?
  Bugs can be arbitrary, so how do we make some progress here?
  Turns out, we (broadly speaking) have lots of experience with bugs.
  Common classes of bugs that programmers make, leading to security issues.

Memory corruption.
  Used to be extremely prevalent, still reasonably common in some software.
  Sloppy memory operations translate into arbitrary code execution.

Simple example: buffer overflow.
  void f() {
    char buf[128];
    gets(buf);
  }

What does gets() do?
  Keep writing input bytes to the buffer.
  Buffer passed as a pointer.
  When the end of input is reached, write a zero byte indicating end-of-string.

What happens if the input is longer than 128 bytes?
  gets() keeps writing data, incrementing the pointer.
  What does that do?  Depends on what's around buf in memory.
  Typically there is stack frame data, including return address for f.
  Return address determines what code gets executed when f returns.
  Adversary can completely control execution on f's return.

                         |                  |
                         +------------------+
        entry %esp ----> | f return address |    ^
                         +------------------+    |
        new %ebp ------> |    saved %rbp    |    | Overflow goes
                         +------------------+    | this way.
                         |     buf[127]     |    |
                         |       ...        |    | Stack grows
                         |      buf[0]      |    | this way.
                         +------------------+    v
                         |                  |

What happens if the stack grows up, instead of grows down?
  Another return address (for gets).
  Adversary gets to control what code executes on return from gets.

Checked example:
  void g() {
    char buf[N];
    uint32_t n = get_input(); // will get n 16-byte chunks
    for (uint32_t i = 0; i < n; i++) {
      // read into buf[i*16] .. buf[i*16+15]
    }
  }

  What check should we add?
    Candidate: if (n * 16 >= N) { return; }

  Potential problem: what if n = 2^30?
    2^30 * 16 = 0
    Check passes just fine.
    But overflow still happens..

Defense idea: stack canaries (StackGuard, gcc SSP, ..)
  Detects modification of return PC on stack *before* it is used by RET.
  Compiler generates code that pushes a "canary" value on stack at
    function entry, pops and checks value before return.
  Canary sits between variables and return address, e.g.:
                         |                  |
                         +------------------+
        entry %esp ----> |  return address  |    ^
                         +------------------+    |
        new %ebp ------> |    saved %rbp    |    |
                         +------------------+    |
                         |     CANARY       |    | Overflow goes
                         +------------------+    | this way.
                         |     buf[127]     |    |
                         |       ...        |    |
                         |      buf[0]      |    |
                         +------------------+
                         |                  |
  Q: what value should we use for the canary?
  A: perhaps a random number, chosen at program start,
     stored somewhere.

Many variants of memory corruption.
  C requires programmers to follow many rules to ensure memory safety.
  Easy to make a mistake in C code.
  Dramatic consequences.

Use-after-free example:
  void h() {
    char *buf = malloc(N);
    int err = 0;

    read(0, buf, N);
    if (strncmp(buf, "GET", 3)) {
      err = 1;
      free(buf);
    }

    ...

    if (err) {
      printf("Error processing request: %s\n", buf);
    }
  }

What might go wrong?
  Will print contents of buf.
  But buf might have been reused for something else.
    E.g., another code path might allocate memory for a cryptographic key.
  Adversary could send other concurrent requests to trigger other code paths.
  Could reveal sensitive data (e.g., crypto key) to adversary!

Use-after-free are the most prevalent memory errors today.
  Either leakage of sensitive data or corruption (e.g., function pointers).
  Tricky to prevent with just range checks.
  Need lifetime checks.
    Either in type system at compile-time (e.g., Rust).
    Or in "band-aid" runtime checks (but tricky with memory re-allocation, etc).

What if you write all of your code in Python?
  Python runtime written in C.
  Python modules use libraries written in C.
  Underlying OS kernel, etc, written in C.
  Some hope: newer languages like Rust provide more memory safety guarantees.
    Harder to make mistakes that lead to similar kinds of memory corruption.

Another common category of problems: encoding / decoding.
  Challenging to correctly encode or decode untrusted data.

Encoding example: SQL injection.
  Applications often store data in a SQL database.
  Database is accessed over a text-oriented query interface.
  Application formulates query, sends to database.
  E.g., SELECT name FROM users WHERE phone="617-253-6005"
    Might be used by application to look up name for a phone number.
    Common pattern (perhaps becoming less so): just use string concatenation.
  What if adversary supplies the phone number?
    Suppose adversary supplies a phone number of: 617" OR email="nz@mit.edu
      Can find name for a given email address.
    Or even so: 617"; DELETE FROM users
      Might cause database to select some name, then delete all users data.

Encoding example: cross-site scripting.
  Web pages can contain Javascript code.
  Javascript code can access sensitive state in user's web browser.
    E.g., HTTP cookie often contains secret token to access user's login session.
  Web applications might embed user data when constructing web pages.
  Setup:
    Adversary --[adversary's data]--> Web application --[web page]--> Victim
  Suppose web application wants to include a list of user names (incl. adversary).
    Build up a list like this:
      <li>Alice</li>
      <li>Bob</li>
      <li>Adversary</li>
    Again, common pattern (perhaps becoming less so): use string concat.
  What can adversary do when supplying their name?
    Suppose adversary's supplied name is: <script>alert(document.cookie);</script>
    The victim's page gets:
      <li><script>alert(...);</script></li>
    Causes victim's browser to run adversary's Javascript code.
    But this code runs with access to victim's cookie!

Subtle encoding problems lead to security collapse, as with memory safety.

Decoding example: Python object pickling.
  Python has a pickle module that allows serializing objects to bytes.
  Supports pickling any object, and objects can override pickle/unpickle.
  As a result, when decoding, can potentially invoke code (to unpickle).
  Adversary can construct byte sequence that causes arbitrary code exec.

Decoding example: ambiguous checks.
  Android applications (.apk) are just ZIP files.
  ZIP files store their list of files in two places.
    Historical reasons, to deal with cases where seeking was costly.
  Android checked signature on ZIP file, then installed it.
  Signature checker used one ZIP decoder, installer used different ZIP decoder.
  Attack: use one ZIP file list to store correct list of app files with sig,
    and store a different set of files for the app in the other list!
  Signature passes but the installed app files are completely different.
  [[ Ref: https://www.saurik.com/masterkey1.html ]]

Missing checks.
  Complex application logic -> many places to check permissions.
  Easy for developers to miss some checks.
  Example: forget to check if user is logged-in in web app URL handler.
  Example: forget to check if retrieving object belonging to current user.
  (Assumes we do not have a sophisticated delegation plan.)

Concurrency and race conditions.
  Example: hypothetical bank web site.
    transfer(src, dst, amt):
      s = balances[src]
      d = balances[dst]
      if s < amt: error
      balances[dst] = d + amt
      balances[src] = s - amt
  What happens if a user makes many invocations to transfer with same src?
    Many threads get same starting balances[src], deduct amt just once!
  Might have been the bug that was exploited in Silk Road's web site.
    [[ Ref: https://news.ycombinator.com/item?id=33506736 ]]

  Example: file system, trying to avoid opening a symlink.
    if (lstat(path, &s) < 0) error;
    if (!S_ISREG(s.st_mode)) error;
    open(path, O_RDONLY);
  Adversary could have raced and replaced regular file with symlink!
  Often called a TOCTTOU (time-of-check-to-time-of-use) bug.

Resource consumption bugs.
  Matters for open distributed systems or open servers.
  Easy to write code that allocates memory, processes, CPU, file descriptors, ..
  How to avoid allocating lots of resources in response to a cheap user req?
  Example: hash table collisions.
    Hash tables typically use a cheap hash function, not cryptographic.
    E.g., application puts user names into a hash table.
    What if adversary knows the hash?  Can choose names that collide.
    Many languages include a randomization step when creating hash table.
    Makes it more difficult for adversary to force hash table inefficiency.

What can we do about software bugs?
  Specification level.
    Have a clear spec, including corner cases.
    Understand how spec ensures security.
    E.g., encoders and decoders should be inverses.
  Make your design/impl as simple as possible.
    Good: fewer bugs, which you can achieve roughly by having fewer LOC.
    Write less software.
      Others have written software that they’ve tested well already.
      Unless you’re going to do better, use existing software.
      Caveat: you need to choose software that’s actually good.
      Caveat: you need to understand it in detail!
    Use tools that let you write less code, such as a higher-level language.
  Design level: Make the bug irrelevant.
    Principle of Least Privilege.
    Will look at this next.
  Development time: Find and prevent bugs.
    Fuzzing, testing, verification, safe languages (memory safety, SQL injection).
    Next week.
  Runtime: Catch or mitigate bugs.
    Stack canaries, ASLR, NX for memory safety.
    Safer languages (runtime bounds checking).
    CFI for code injection.
    Taint tracking (injection attacks).
    Next lecture.
  Deployment: Fast software updates (e.g., Google Chrome).
  Recovery: Auditing, logging, restore.