Providence Salumu

Naïve Haskell data representation

A value requires a constructor, plus arguments
- At runtime, need to determine a value's constructor, but not it's type
  (Compiler already type-checked program, so no runtime type checks)
```
struct Val {
  unsigned long constrno; /* constructor # */
  struct Val *args[];     /* flexible array */
};
```
- For a type like [Int], constrno might be 0 for [] and 1 for (:), where [] has 0-sized args and (:) has 2-element args
- For a type like Int, constrno can be the actual integer, with no args
- For a single-constructor type (e.g., Point) constrno not used
Problems with our approach so far
- No way to represent exceptions or thunks
- Garbage collector needs to know how many elements are in args
- Small values such as Ints always require chasing a pointer

Metadata for values

Let's add a level of indirection to describe values

typedef struct Val {
  const struct ValInfo *info;
  struct Val *args[];
} Val;

struct ValInfo {
  struct GCInfo gcInfo;  /* for garbage collector */
  enum { THUNK, CONSTRNO, FUNC, IND } tag;
  union {
    Exception *(*thunk) (Val *closure);
    unsigned int constrno;
    Val *(*func) (const Val *closure, const Val *arg);
  };
};

gcInfo says how many Val *s are in args and where they are
tag == CONSTRNO means constrno valid, used as on last slide
tag == IND means args[0] is an indirect forwarding pointer to another Val and union is unused; useful if size of args grows

Function values

A Val whose ValInfo has tag == FUNC uses the func field

    Val *(*func) (const Val *closure, const Val *arg);

closure is the Val whose ValInfo contains func
- Provides an environment so ValInfo/func can be re-used
arg is the function argument
Assume all functions take one argument
- Logically this is fine since we have currying
- For performance, real compilers must optimize multi-argument case
To apply function f to argument a, where both are type Val *:
```
        f->info->func (f, a);
```

Closures

Top-level bindings don't need closures
```
addOne :: Int -> Int
addOne x = x + 1
```
- The Val for function addOne can have zero-length args
Local bindings may need environment values in closure
```
add :: Int -> (Int -> Int)
add n = \m -> addn m
    where addn m = n + m
```
- Compiler will only emit code for local function addn once
- But logically, there is a separate addn function (with a different n) for each invocation of add
- Each addn instance is a different Val, but all share same ValInfo
- Use args[0] in each Val to specify value of n

Thunk values

A Val with tag == THUNK uses the thunk field in ValInfo
```
    Exception *(*thunk) (Val *closure);
```
- Updates v (turns it into non-thunk) or returns a non-NULL Exception *
To evaluate a thunk:
```
        v->info->thunk (v);
```
Two big differences between thunks and functions
- A function takes an argument, while a thunk does not
- A function value is immutable, while a thunk updates itself
Note also that a thunk may throw an exception
- Functions can, too, but for simplicity let's implement it by having the function return a thunk that throws an exception

Forcing

Turning a thunk into a non-thunk is known as forcing it
What if a thunk's return value doesn't fit in thunk's args?
- This is why we have the IND ValInfo tag--Allocate new Val, place indirect forwarding pointer in old Val

A possible implementation of forcing that walks IND pointers:

Exception *force (Val **vp)
{
  for (;;) {
    if ((*vp)->info->tag == IND)
      *vp = (*vp)->arg[0].boxed;
    else if ((*vp)->info->tag == THUNK) {
      Exception *e = (*vp)->info->thunk (*vp);
      if (e)
        return e;
    }
    else
      return NULL;
  }
}

Currying

Let's use simple implementation of currying (GHC very complex)
Set closure->args to head of list of previously curried args
```
const3 :: a -> b -> c -> a
const3 a b c = a
```
- Compiler emits 3 ValInfos and 3 functions for const3
- Top-level binding's ValInfo has func = const3_1
- const3_1 creates Val where arg[0] is first argument (a) and info->func = const3_2
- const3_2 creates a Val where arg[0] is the second argument (b), arg[1] is closure, and info->func is const3_3
- const3_3 has access to all arguments and actually implements const3

Shared arguments have common arg tails, only evaluated once

    let f = const3 (superExpensive 5) -- evaluated once
    in (f 1 2, f 3 4)

Unboxed types

Unfortunately, now Int has even more overhead
- To use, must check i->info->tag then access i->info->constr
- Moreover, each number needs a distinct ValInfo structure

Idea: Have special unboxed types that don't use struct Val

union Arg {
  struct Val *boxed;     /* most values are boxed */
  unsigned long unboxed; /* "primitive" values */
};

typedef struct Val {
  const struct ValInfo *info;
  union Arg *args[];  /* args can be boxed or unboxed */
} Val;

Unboxed types have no constructor and cannot be thunks
Can fit in a single register or take the place of a Val * arg
Must extend GCInfo to identify which args are and are not boxed

Unboxed types in GHC

GHC exposes unboxed types (even though not part of Haskell)
- Symbols use # character--must enable with -XMagicHash option
- Have unboxed types (Int#) and primitive operations on them (+#)
- See GHC.Prim or type ":browse GHC.Prim" in GHCI
- Also have unboxed constants--2#, 'a'#, 2## (unsigned), 2.0##

What is Int really?

Single-constructor data type, with a single, unboxed argument

Prelude> :set -XMagicHash
Prelude> :m +GHC.Types GHC.Prim
Prelude GHC.Types GHC.Prim> :i Int
data Int = I# Int#      -- Defined in GHC.Types
...
Prelude GHC.Types GHC.Prim> case 1 of I# u -> I# (u +# 2#)
3

Lets Int contain thunk, but avoids pointer dereference once evaluated

Restrictions on unboxed types

Cannot instantiate type variables with unboxed types

{-# LANGUAGE MagicHash #-}
import GHC.Prim

data FastPoint = FastPoint Double# Double#  -- ok
fp = FastPoint 2.0## 2.0##                  -- ok

-- Error: can't pass unboxed type to polymorphic function
fp' = FastPoint 2.0## (id 2.0##)

-- Error: can't use unboxed type as type parameter
noInt :: Maybe Int#
noInt = Nothing

Enforced by making unboxed types a different kind of type
```
Prelude GHC.Types GHC.Prim> :kind Int#
Int# :: #
```
- Recall type variables have kinds with stars (∗, ∗ → ∗, etc.), never #

`seq` revisited

Recall seq :: a -> b -> b
- If seq a b is forced, then first a is forced, then b is forced and returned

Consider the following code:

infiniteLoop = infiniteLoop :: Char   -- loops forever

seqTest1 = infiniteLoop `seq` "Hello" -- loops forever

seqTest2 = str `seq` length str       -- returns 6
    where str = infiniteLoop:"Hello"

seqTest1 hangs forever, while seqTest2 happily returns 6

seq only forces a Val, not the arg fields of the Val
- seqTest2's seq forces str's constructor (:), but not the head or tail
- This is known as putting str in Weak Head Normal Form (WHNF)
- Can't fully evaluate an arbitrary data type (but see Control.DeepSeq)

Example: `seq` implementation

Val *seq_2 (Val *a, Val *b)
{ /* assume seq_1 put first arg in a */
  val = gc_malloc (offsetof (Val, args[2]));
  val->info = &seq_info;
  val->args[0] = a->args[0];
  val->args[1] = b->args[0];
  return val;
}

struct ValInfo seq_info = {
  some_gcinfo, THUNK, .thunk = &seq_thunk
};

Exception *seq_thunk (Void *c)
{
  Exception *e = force (&c->args[0]);
  if (!e) {
    c->info = &ind_info;     /* ValInfo with tag IND */
    c->args[0] = c->args[1]; /* forward to b */
  }
  return e;
}

Strictness revisited

Recall strictness flag on fields in data declarations
```
data IntWrapper = IntWrapper !Int
```
- Int has ! before it, meaning it must be strict
- Strict means the Int's ValInfo cannot have tag THUNK or IND
Accessing a strict Int touches only one cache line
- Recall data Int = I# Int# has only one constructor
- Plus strict flag means tag == CONSTRNO, so know what's in ValInfo
- Plus Int# is unboxed
- Thus, once IntWrapper forced, immediately safe to access Int as
```
    myIntWrapper.arg[0].boxed->arg[0].unboxed
```

Semantic effects of strictness

Strictness is primarily used for optimization
- To avoid building up long chains of thunks
- To save overhead of checking whether thunk evaluated
But has semantic effects: A non-strict Int is not just a number
- Can also throw an exception or loop forever when evaluated
- Such behavior can be modeled as a special value ⊥ ("bottom")
- So the values of Int are {0, 1}⁶⁴ ∪ {⊥}
- Types that include value ⊥ are called lifted
Note 1: an unboxed type is necessarily unlifted

Note 2: !Int not a first-class type, only valid for data fields

data SMaybe a = SJust !a | SNothing   -- ok, data field
strictAdd :: !Int -> !Int -> !Int     -- error
type StrictMaybeInt = Maybe !Int      -- error

`case` statements revisited

case statement pattern matching can force thunks
- An irrefutable pattern is one that always matches
- A pattern consisting of a single variable or _ is irrefutable
- Any non-irrefutable pattern forces evaluation of the argument
- Matching happens top-to-bottom, and left-to-right within alternatives

Function pattern matching is the same as (desuggared into) case

undefined :: a is Prelude symbol with value ⊥, handy for testing

f ('a':'b':rest) = rest
f _              = "ok"
test1 = f (undefined:[])   -- error
test2 = f ('a':undefined)  -- error
test3 = f ('x':undefined)  -- "ok" (didn't force tail)

Adding ~ before a pattern makes it irrefutable

three = (\ ~(h:t) -> 3) undefined  -- evaluates to 3

`newtype` declarations

We've seen two ways to introduce new types
- data -- creates a new (boxed) type, adding overhead of a Val wrapper
- type -- creates an alias for an existing type, with no overhead
Sometimes you want a new type implemented by an existing type
- E.g., might want Meters, Seconds, Grams, all implemented by Double
- Using type would make them all synonymous, facilitating errors
- Might want different instances of Show for each, impossible with type
- Could say data Meters = Meters Double -- but will add overhead
The newtype keyword introduces new type with no overhead
- Use just like data, but limited to one constructor and one field
- This is possible because all type-checking is compile-time

`newtype` semantics

What's the semantic difference between these two declarations?

newtype NTInt = NTInt Int deriving (Show)

data SInt = SInt !Int deriving (Show)

`newtype` semantics

What's the semantic difference between these two declarations?

newtype NTInt = NTInt Int deriving (Show)

data SInt = SInt !Int deriving (Show)

The NTInt constructor is a "fake" compile-time-only construct

A case statement deconstructing a newtype compiles to nothing

newtype NTInt = NTInt Int deriving (Show)
uNTInt = NTInt undefined
testNT = case uNTInt of NTInt _ -> True   -- returns True

Conversely, forcing a value (by matching constructor) forces strict fields

data SInt = SInt !Int deriving (Show)
uSInt = SInt undefined
testS = case uSInt of SInt _ -> True      -- undefined

The `UNPACK` pragma

newtype almost always better than data when it applies
What about a multi-field data type?
```
data TwoInts = TwoInts !Int !Int
```
- Fields are strict, we know they'll have CONSTRNO ValInfo
- Why not stick the Int#s directly into the args of a TwoInts Val?
- GHC provides an UNPACK pragma to do just this
```
data TwoInts = TwoInts {-# UNPACK #-} !Int {-# UNPACK #-} !Int
```
- Works for any strict field with a single-constructor datatype
Unlike newtype, UNPACK is not always a win
- If you pass field as argument, will need to re-box it
-funbox-strict-fields flag unpacks all strict fields

User-managed memory

Opaque type Ptr a represents pointers to type a
- Pointers are not typesafe--allow pointer arithmetic and casting
```
nullPtr :: Ptr a
plusPtr :: Ptr a -> Int -> Ptr b
minusPtr :: Ptr a -> Ptr b -> Int
castPtr :: Ptr a -> Ptr b
```
- Pointer arithmetic is always in units of bytes (unlike in C, where unit is size of the pointed-to object)

Class Storable provides raw access to memory using Ptrs

class Storable a where
    sizeOf :: a -> Int
    alignment :: a -> Int
    peek :: Ptr a -> IO a
    poke :: Ptr a -> a -> IO ()
    ...

Most basic types (Bool, Int, Char, Ptr a, etc.) are Storable

`alloca`

Easiest way to get a valid Ptr is alloca:
```
alloca :: Storable a => (Ptr a -> IO b) -> IO b
```
- Allocates enough space for an object of type a
- Calls function with a Ptr to the space
- Reclaims the memory when the function returns (much like C alloca)
- Can also ask for a specific number of bytes:
```
allocaBytes :: Int -> (Ptr a -> IO b) -> IO b
```

Foreign module provides handy with utility

with :: Storable a => a -> (Ptr a -> IO b) -> IO b
with val f  =
  alloca $ \ptr -> do
    poke ptr val
    res <- f ptr
    return res

More `Storable` types

Foreign.C contains wrappers for C types
- CInt, CUInt, CChar, CDouble, CIntPtr etc.
Data.Int and Data.Word have all sizes of machine integer
- Int8, Int16, Int32, Int64 -- signed integers
- Word8, Word16, Word32, Word64 -- unsigned integers

Example: extract all the bytes from a Storable object

toBytes :: (Storable a) => a -> [Word8]
toBytes a = unsafePerformIO $
    with a $ \pa -> go (castPtr pa) (pa `plusPtr` sizeOf a)
    where go p e | p < e = do b <- peek p
                              bs <- go (p `plusPtr` 1) e
                              return (b:bs)
                 | otherwise = return []

unsafePerformIO might be okay here since toBytes pure
Notice how plusPtr lets us change from Ptr a to Ptr Word8

`malloc` and `mallocForeignPtr`

Can also allocate longer-lived memory with malloc

malloc :: Storable a => IO (Ptr a)
mallocBytes :: Int -> IO (Ptr a)
free :: Ptr a -> IO ()
realloc :: Storable b => Ptr a -> IO (Ptr b)
reallocBytes :: Ptr a -> Int -> IO (Ptr a)

Disadvantage: bad programming can lead to memory leaks/corruption

ForeignPtr lets you delegate deallocation to garbage collector

mallocForeignPtr :: Storable a => IO (ForeignPtr a)
mallocForeignPtrBytes :: Int -> IO (ForeignPtr a)

Working with `ForeignPtr`s

To use ForeignPtr, must convert it to Ptr
- Problem: How does GC know ForeignPtr in scope when using Ptr?
- Solution: use Ptr within function that keeps reference to ForeignPtr
```
withForeignPtr :: ForeignPtr a -> (Ptr a -> IO b) -> IO b
```

Can also convert Ptrs to ForeignPtrs

type FinalizerPtr a = FunPtr (Ptr a -> IO ())
newForeignPtr :: FinalizerPtr a -> Ptr a
              -> IO (ForeignPtr a)
newForeignPtr_ :: Ptr a -> IO (ForeignPtr a)
addForeignPtrFinalizer :: FinalizerPtr a -> ForeignPtr a
                       -> IO ()

Can add multiple finalizers, will run in reverse order

Note use of FunPtr -- this is type wrapper for C function pointer
- Need foreign function interface to create these
- finalizerFree symbol conveniently provides function pointer for free

`ByteString`s

Haskell Strings obviously not very efficient
Strict ByteStrings efficiently manipulate raw bytes
```
import qualified Data.ByteString as S
import qualified Data.ByteString.Char8 as S8
```
- Implements a similar interface to lists: S.head, S.tail, S.length, S.foldl, S.cons (like :), S.empty (like []), S.hPut (like hPutStr), S.readFile
- Must import qualified to avoid name clashes
- S.pack and S.unpack translate to/from [Word8]
- S8 has same functions as S, but uses Char instead of Word8--means you lose upper bits of Char (use toString from utf8-string to avoid loss)

Implementation

data ByteString = PS {-# UNPACK #-} !(ForeignPtr Word8)
                     {-# UNPACK #-} !Int  -- offset
                     {-# UNPACK #-} !Int  -- length

Lazy `ByteString`s

Same package implements lazy ByteStrings

import qualified Data.ByteString.Lazy as L
import qualified Data.ByteString.Lazy.Char8 as L8

Provides mostly the same functions as strict ByteString modules

Confusing that both modules use same names for many things
- Important to look at import qualifications to understand code
- Worse: documentation does not qualify symbol names
  Tip: hover your mouse over symbol and look at URL to figure out module
- Also, S.ByteString and S8.ByteString are the same type (re-exported), and similarly for L.ByteString and L8.ByteString
- S.ByteString and L.ByteString not same type, but can convert:
```
fromChunks :: [S.ByteString] -> L.ByteString
toChunks :: L.ByteString -> [S.ByteString]
```

Lazy `ByteString` implementation

Lazy ByteStrings are implemented in terms of strict ones
```
data ByteString = Empty
                | Chunk {-# UNPACK #-} !S.ByteString ByteString
```
- Invariant: Chunk's first argument (S.ByteString) never null
- Basically a linked list of strict ByteStrings
- Head is strict, tail is not, allowing lazy computation or I/O
When to use strict/lazy ByteStrings?
- Obviously use lazy when you need laziness (e.g., lazy I/O, infinite or cyclical strings, etc.)
- Lazy also much faster at concatenation (need to build a new list of S.ByteStrings, but not copy the data they contain)
- Strict makes it much easier to implement things like string search
- Converting strict to lazy ByteStrings is cheap, reverse is not (so if a library can work efficiently on lazy ByteStrings, good to expose that functionality)

Providence Salumu

Naïve Haskell data representation

Metadata for values

Function values

Closures

Thunk values

Forcing

Currying

Unboxed types

Unboxed types in GHC

Restrictions on unboxed types

seq revisited

Example: seq implementation

Strictness revisited

Semantic effects of strictness

case statements revisited

newtype declarations

newtype semantics

newtype semantics

The UNPACK pragma

User-managed memory

alloca

More Storable types

malloc and mallocForeignPtr

Working with ForeignPtrs

ByteStrings

Lazy ByteStrings

Lazy ByteString implementation

`seq` revisited

Example: `seq` implementation

`case` statements revisited

`newtype` declarations

`newtype` semantics

`newtype` semantics

The `UNPACK` pragma

`alloca`

More `Storable` types

`malloc` and `mallocForeignPtr`

Working with `ForeignPtr`s

`ByteString`s

Lazy `ByteString`s

Lazy `ByteString` implementation