Haskell (programming language)

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Haskell
Logo of Haskell
Paradigm(s) functional, lazy/non-strict, modular
Appeared in 1990
Designed by Simon Peyton Jones, Lennart Augustsson, Dave Barton, Brian Boutel, Warren Burton, Joseph Fasel, Kevin Hammond, Ralf Hinze, Paul Hudak, John Hughes, Thomas Johnsson, Mark Jones, John Launchbury, Erik Meijer, John Peterson, Alastair Reid, Colin Runciman, Philip Wadler
Stable release Haskell 2010[1] (July 2010; 3 years ago (2010-07))
Preview release to be announced as Haskell 2012[2]
Typing discipline static, strong, inferred
Major implementations GHC, Hugs, NHC, JHC, Yhc, UHC
Dialects Helium, Gofer
Influenced by Clean,[3] FP,[3] Gofer,[3] Hope and Hope+,[3] Id,[3] ISWIM,[3] KRC,[3] Lisp,[3] Miranda,[3] ML and Standard ML,[3] Orwell, SASL,[3] SISAL,[3] Scheme[3]
Influenced Agda,[4] Bluespec,[5] C++11/Concepts,[6] C#/LINQ,[7][8][9][10] CAL,[citation needed] Cayenne,[7] Clean,[7] Clojure,[11] CoffeeScript,[12] Curry,[7] Elm, Epigram,[citation needed] Escher,[citation needed] F#,[13] Isabelle,[7] Java/Generics,[7] Kaya,[citation needed] Mercury,[7] Omega,[citation needed] Perl 6,[14] Python,[7][15] Qi,[citation needed] Scala,[7][16] Timber,[citation needed] Visual Basic 9.0[7][8]
OS Cross-platform
Usual filename extensions .hs, .lhs

Haskell /ˈhæskəl/[17] is a standardized, general-purpose purely functional programming language, with non-strict semantics and strong static typing.[18] It is named after logician Haskell Curry.[19] In Haskell, "a function is a first-class citizen" of the programming language.[20] As a functional programming language, the primary control construct is the function.

History[edit source | edit]

Following the release of Miranda by Research Software Ltd, in 1985, interest in lazy functional languages grew: by 1987, more than a dozen non-strict, purely functional programming languages existed. Of these, Miranda was the most widely used, but was not in the public domain. At the conference on Functional Programming Languages and Computer Architecture (FPCA '87) in Portland, Oregon, a meeting was held during which participants formed a strong consensus that a committee should be formed to define an open standard for such languages. The committee's purpose was to consolidate the existing functional languages into a common one that would serve as a basis for future research in functional-language design.[21]

Haskell 1.0 to 1.4[edit source | edit]

The first version of Haskell ("Haskell 1.0") was defined in 1990.[19] The committee's efforts resulted in a series of language definitions (1.0, 1.1, 1.2, 1.3, 1.4).

Haskell 98[edit source | edit]

In late 1997, the series culminated in Haskell 98, intended to specify a stable, minimal, portable version of the language and an accompanying standard library for teaching, and as a base for future extensions. The committee expressly welcomed the creation of extensions and variants of Haskell 98 via adding and incorporating experimental features.[21]

In February 1999, the Haskell 98 language standard was originally published as "The Haskell 98 Report".[21] In January 2003, a revised version was published as "Haskell 98 Language and Libraries: The Revised Report".[18] The language continues to evolve rapidly, with the Glasgow Haskell Compiler (GHC) implementation representing the current de facto standard.[22]

Haskell Prime[edit source | edit]

In early 2006, the process of defining a successor to the Haskell 98 standard, informally named Haskell Prime, began.[23] This is an ongoing incremental process to revise the language definition, producing a new revision once per year. The first revision, named Haskell 2010, was announced in November 2009[1] and published in July 2010.

Haskell 2010[edit source | edit]

Haskell 2010 adds the Foreign Function Interface (FFI) to Haskell, allowing for bindings to other programming languages, fixes some syntax issues (changes in the formal grammar) and bans so-called "n-plus-k-patterns", that is, definitions of the form fact (n+1) = (n+1) * fact n are no longer allowed. It introduces the Language-Pragma-Syntax-Extension which allows for designating a Haskell source as Haskell 2010 or requiring certain Extensions to the Haskell Language. The names of the extensions introduced in Haskell 2010 are DoAndIfThenElse, HierarchicalModules, EmptyDataDeclarations, FixityResolution, ForeignFunctionInterface, LineCommentSyntax, PatternGuards, RelaxedDependencyAnalysis, LanguagePragma, NoNPlusKPatterns.[1]

Features[edit source | edit]

Haskell features lazy evaluation, pattern matching, list comprehension, type classes, and type polymorphism. It is a purely functional language, which means that in general, functions in Haskell do not have side effects. There is a distinct construct for representing side effects, orthogonal to the type of functions. A pure function may return a side effect which is subsequently executed, modeling the impure functions of other languages.

Haskell has a strong, static type system based on Hindley–Milner type inference. Haskell's principal innovation in this area is to add type classes, which were originally conceived as a principled way to add overloading to the language,[24] but have since found many more uses.[25]

The construct which represents side effects is an example of a monad. Monads are a general framework which can model different kinds of computation, including error handling, nondeterminism, parsing, and software transactional memory. Monads are defined as ordinary datatypes, but Haskell provides some syntactic sugar for their use.

The language has an open, published specification,[18] and multiple implementations exist.

There is an active community around the language, and more than 4900 third-party open-source libraries and tools are available in the online package repository Hackage.[26]

The main implementation of Haskell, GHC, is both an interpreter and native-code compiler that runs on most platforms. GHC is noted for its high-performance implementation of concurrency and parallelism,[27] and for having a rich type system incorporating recent innovations such as generalized algebraic data types and Type Families.

Code examples[edit source | edit]

The following is a Hello world program written in Haskell (note that all but the last line can be omitted):

module Main where
 
main :: IO ()
main = putStrLn "Hello, World!"

Here is the factorial function in Haskell, defined in a few different ways:

-- Type annotation (optional)
factorial :: Integer -> Integer
 
-- Using recursion
factorial 0 = 1
factorial n | n > 0 = n * factorial (n - 1)
 
-- Using recursion but written without pattern matching
factorial n = if n > 0 then n * factorial (n-1) else 1
 
-- Using a list
factorial n = product [1..n]
 
-- Using fold (implements product)
factorial n = foldl1 (*) [1..n]
 
-- Point-free style
factorial = foldr (*) 1 . enumFromTo 1
 
-- Point-free style analytic solution
import Math.Gamma
factorial = round . exp . lnGamma . fromIntegral . (+1)

An efficient implementation of the Fibonacci numbers, as an infinite list, is this:

-- Type annotation (optional)
fib :: Int -> Integer
 
-- With self-referencing data
fib n = fibs !! n
        where fibs = 0 : scanl (+) 1 fibs
        -- 0,1,1,2,3,5,...
 
-- Same, coded directly
fib n = fibs !! n
        where fibs = 0 : 1 : next fibs
              next (a : t@(b:_)) = (a+b) : next t
 
-- Similar idea, using zipWith
fib n = fibs !! n
        where fibs = 0 : 1 : zipWith (+) fibs (tail fibs)
 
-- Using a generator function
fib n = fibs (0,1) !! n
        where fibs (a,b) = a : fibs (b,a+b)

The Int type refers to a machine-sized integer (used as a list subscript with the !! operator), while Integer is an arbitrary-precision integer. For example, the above code quickly computes "fib 10000" as a 2090-digit number.

This is an implementation of the quick sort algorithm over lists, in which the first element is taken as the pivot:

quickSort :: Ord a => [a] -> [a]
quickSort []     = []
quickSort (x:xs) = quickSort[a | a <- xs, a < x] ++ -- Sort the left part of the list
                   [x] ++                           -- Insert pivot between two sorted parts
                   quickSort[b | b <- xs, b >= x]   -- Sort the right part of the list

Implementations[edit source | edit]

All listed implementations are distributed under open source licenses. There are currently no proprietary Haskell implementations.[28]

The following implementations comply fully, or very nearly, with the Haskell 98 standard.

  • The Glasgow Haskell Compiler (GHC) compiles to native code on a number of different architectures—as well as to ANSI C—using C-- as an intermediate language. GHC has become the de facto standard Haskell dialect.[29] There are libraries (e.g. bindings to OpenGL) that will work only with GHC. GHC is also distributed along with the Haskell platform.
  • The Utrecht Haskell Compiler (UHC) is a Haskell implementation from Utrecht University. UHC supports almost all Haskell 98 features plus many experimental extensions. It is implemented using attribute grammars and is currently mainly used for research into generated type systems and language extensions.
  • Jhc is a Haskell compiler written by John Meacham emphasising speed and efficiency of generated programs as well as exploration of new program transformations. LHC is a recent fork of Jhc.

The following implementations are no longer being actively maintained:

  • Hugs, the Haskell User's Gofer System, is a bytecode interpreter. It used to be one of the most widely used implementations alongside the GHC compiler,[30] but has now been mostly replaced by GHCi. It also comes with a graphics library.
  • nhc98 is another bytecode compiler. Nhc98 focuses on minimizing memory usage.
  • Yhc, the York Haskell Compiler was a fork of nhc98, with the goals of being simpler, more portable and more efficient, and integrating support for Hat, the Haskell tracer. It also featured a JavaScript backend allowing users to run Haskell programs in a web browser.
  • HBC is an early implementation supporting Haskell 1.4. It was implemented by Lennart Augustsson in, and based on, Lazy ML. It has not been actively developed for some time.

Implementations below are not fully Haskell 98 compliant, and use a language that is a variant of Haskell:

  • Gofer was an educational dialect of Haskell, with a feature called "constructor classes", developed by Mark Jones. It was supplanted by Hugs (see above).
  • Helium is a newer dialect of Haskell. The focus is on making it easy to learn by providing clearer error messages. It currently lacks full support for type classes, rendering it incompatible with many Haskell programs.

Applications[edit source | edit]

Audrey Tang's Pugs is an implementation for the long-forthcoming Perl 6 language with an interpreter and compilers that proved useful after just a few months of its writing; similarly, GHC is often a testbed for advanced functional programming features and optimizations. Darcs is a revision control system written in Haskell, with several innovative features. Linspire GNU/Linux chose Haskell for system tools development.[31] Xmonad is a window manager for the X Window System, written entirely in Haskell.[32]

Bluespec SystemVerilog (BSV) is a language for semiconductor design that is an extension of Haskell. Additionally, Bluespec, Inc.'s tools are implemented in Haskell. Cryptol, a language and toolchain for developing and verifying cryptographic algorithms, is implemented in Haskell. Notably, the first formally verified microkernel, seL4 was verified using Haskell.

There are Haskell web frameworks,[33] such as:

Related languages[edit source | edit]

Clean is a close relative of Haskell. Its biggest deviation from Haskell is in the use of uniqueness types instead of monads for I/O and side-effects.

A series of languages inspired by Haskell, but with different type systems, have been developed, including:

  • Agda, a functional language with dependent types
  • Idris, a general purpose functional language with dependent types
  • Epigram, a functional language with dependent types suitable for proving properties of programs
  • Cayenne
  • Ωmega

JVM-based:

  • Frege, a Haskell-like language with Java's scalar types and good Java integration.[35][36][37]
  • Jaskell, a functional scripting programming language that runs in Java VM.[38]

Other related languages include:

  • Curry, a functional/logic programming language based on Haskell

Haskell has served as a testbed for many new ideas in language design. There have been a wide number of Haskell variants produced, exploring new language ideas, including:

  • Parallel Haskell:
  • Distributed Haskell (formerly Goffin) and Eden.[citation needed]
  • Eager Haskell, based on speculative evaluation.
  • Several object-oriented versions: Haskell++, and Mondrian.
  • Generic Haskell, a version of Haskell with type system support for generic programming.
  • O'Haskell, an extension of Haskell adding object-orientation and concurrent programming support which "has reportedly been superseded by Timber."[43]
  • Disciple, a strict-by-default (laziness available by annotation) dialect of Haskell which supports destructive update, computational effects, type directed field projections and allied functional goodness.
  • Scotch, a kind of hybrid of Haskell and Python[44]
  • Hume, a strict functional programming language for embedded systems based on processes as stateless automata over a sort of tuples of single element mailbox channels where the state is kept by feedback into the mailboxes, and a mapping description from outputs to channels as box wiring, with a Haskell-like expression language and syntax.

Criticism[edit source | edit]

Jan-Willem Maessen, in 2002, and Simon Peyton Jones, in 2003, discussed problems associated with lazy evaluation while also acknowledging the theoretical motivation for it,[45][46] in addition to purely practical considerations such as improved performance.[47] They note that, in addition to adding some performance overhead, lazy evaluation makes it more difficult for programmers to reason about the performance of their code (particularly its space usage).

Bastiaan Heeren, Daan Leijen, and Arjan van IJzendoorn in 2003 also observed some stumbling blocks for Haskell learners: "The subtle syntax and sophisticated type system of Haskell are a double edged sword — highly appreciated by experienced programmers but also a source of frustration among beginners, since the generality of Haskell often leads to cryptic error messages."[48] To address these, researchers from Utrecht University developed an advanced interpreter called Helium which improved the user-friendliness of error messages by limiting the generality of some Haskell features, and in particular removing support for type classes.

Ben Lippmeier designed Disciple[49] as a strict-by-default (lazy by explicit annotation) dialect of Haskell with a type-and-effect system, to address Haskell's difficulties in reasoning about lazy evaluation and in using traditional data structures such as mutable arrays.[50] He argues (p. 20) that "destructive update furnishes the programmer with two important and powerful tools... a set of efficient array-like data structures for managing collections of objects, and ... the ability to broadcast a new value to all parts of a program with minimal burden on the programmer."

Robert Harper, one of the authors of Standard ML, has given his reasons for not using Haskell to teach introductory programming. Among these are the difficulty of reasoning about resource usage with non-strict evaluation, that lazy evaluation complicates the definition of data types and inductive reasoning,[51] and the "inferiority" of Haskell's class system compared to ML's module system.[52]

Conferences and workshops[edit source | edit]

The Haskell community meets regularly for research and development activities. The primary events are:

Since 2006, there have been a series of organized "hackathons", the Hac series, aimed at improving the programming language tools and libraries.[53]

Since 2005, a growing number of Haskell User Groups have formed, in the United States, Canada, Australia, South America, Europe and Asia.

References[edit source | edit]

  1. ^ a b c Marlow, Simon (24 November 2009). "Announcing Haskell 2010". Haskell mailing list. http://www.haskell.org/pipermail/haskell/2009-November/021750.html. Retrieved 12 March 2011.
  2. ^ "[Haskell] Announce: ~Haskell 2011". Haskell.org. Retrieved 2013-06-26. 
  3. ^ a b c d e f g h i j k l m Peyton Jones 2003, p. xi
  4. ^ Norell, Ulf (2008). "Dependently Typed Programming in Agda". Gothenburg: Chalmers University. Retrieved 9 February 2012. 
  5. ^ Hudak et al. 2007, p. 12-38,43.
  6. ^ Stroustrup, Bjarne; Sutton, Andrew (2011). Design of Concept Libraries for C++. 
  7. ^ a b c d e f g h i j Hudak et al. 2007, pp. 12-45–46.
  8. ^ a b Meijer, Erik. "Confessions of a Used Programming Language Salesman: Getting the Masses Hooked on Haskell". OOPSLA 2007. 
  9. ^ Meijer, Erik (1 October 2009). "C9 Lectures: Dr. Erik Meijer - Functional Programming Fundamentals, Chapter 1 of 13". Channel 9. Microsoft. Retrieved 9 February 2012. 
  10. ^ Drobi, Sadek (4 March 2009). "Erik Meijer on LINQ". InfoQ (QCon SF 2008: C4Media Inc.). Retrieved 9 February 2012. 
  11. ^ Hickey, Rich. "Clojure Bookshelf". Listmania!. Amazon.com. Retrieved 9 February 2012. 
  12. ^ Heller, Martin (18 October 2011). "Turn up your nose at Dart and smell the CoffeeScript". JavaWorld (InfoWorld). Retrieved 9 February 2012. 
  13. ^ Syme, Don; Granicz, Adam; Cisternino, Antonio (2007). Expert F#. Apress. p. 2. "F# also draws from Haskell particularly with regard to two advanced language features called sequence expressions and workflows." 
  14. ^ "Glossary of Terms and Jargon". Perl Foundation Perl 6 Wiki. The Perl Foundation. 28 February. Retrieved 9 February 2012. 
  15. ^ Kuchling, A. M. "Functional Programming HOWTO". Python v2.7.2 documentation. Python Software Foundation. Retrieved 9 February 2012. 
  16. ^ Fogus, Michael (6 August 2010). "MartinOdersky take(5) toList". Send More Paramedics. Retrieved 9 February 2012. 
  17. ^ Chevalier, Tim (28 January 2008). "anybody can tell me the pronuncation of "haskell"?". Haskell-cafe mailing list. http://www.haskell.org/pipermail/haskell-cafe/2008-January/038756.html. Retrieved 12 March 2011.
  18. ^ a b c Peyton Jones 2003.
  19. ^ a b Hudak et al. 2007.
  20. ^ Burstall, Rod (2000). "Christopher Strachey—Understanding Programming Languages". Higher-Order and Symbolic Computation 13 (52). 
  21. ^ a b c Peyton Jones 2003, Preface.
  22. ^ "Haskell Wiki: Implementations". Retrieved 18 December 2012. 
  23. ^ "Welcome to Haskell'". The Haskell' Wiki. 
  24. ^ Wadler, P.; Blott, S. (1989). "How to make ad-hoc polymorphism less ad hoc". Proceedings of the 16th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages (ACM): 60–76. doi:10.1145/75277.75283. ISBN 0-89791-294-2. 
  25. ^ Hallgren, T. (January 2001). "Fun with Functional Dependencies, or Types as Values in Static Computations in Haskell". Proceedings of the Joint CS/CE Winter Meeting (Varberg, Sweden). 
  26. ^ "HackageDB statistics". Hackage.haskell.org. Retrieved 2013-06-26. 
  27. ^ Computer Language Benchmarks Game
  28. ^ "Implementations" at the Haskell Wiki
  29. ^ C. Ryder and S. Thompson (2005). "Porting HaRe to the GHC API"
  30. ^ Hudak et al. 2007, p. 12-22.
  31. ^ "Linspire/Freespire Core OS Team and Haskell". Debian Haskell mailing list. May 2006. 
  32. ^ xmonad.org
  33. ^ HaskellWiki - Haskell web frameworks
  34. ^ "Snap: A Haskell Web Framework: Home". Snapframework.com. Retrieved 2013-06-26. 
  35. ^ The Frege prog. lang.
  36. ^ Project Frege at google code
  37. ^ Hellow World and more with Frege
  38. ^ Jaskell
  39. ^ Glasgow Parallel Haskell
  40. ^ GHC Language Features: Parallel Haskell
  41. ^ Using GHC: Using SML parallelism
  42. ^ MIT Parallel Haskell
  43. ^ OHaskell at HaskellWiki
  44. ^ Scotch
  45. ^ Jan-Willem Maessen. Eager Haskell: Resource-bounded execution yields efficient iteration. Proceedings of the 2002 ACM SIGPLAN workshop on Haskell.
  46. ^ Simon Peyton Jones. Wearing the hair shirt: a retrospective on Haskell. Invited talk at POPL 2003.
  47. ^ Lazy evaluation can lead to excellent performance, such as in The Computer Language Benchmarks Game [1]
  48. ^ Heeren, Bastiaan; Leijen, Daan; van IJzendoorn, Arjan (2003). "Helium, for learning Haskell". Proceedings of the 2003 ACM SIGPLAN workshop on Haskell. 
  49. ^ "DDC - HaskellWiki". Haskell.org. 2010-12-03. Retrieved 2013-06-26. 
  50. ^ Ben Lippmeier, Type Inference and Optimisation for an Impure World, Australian National University (2010) PhD thesis, chapter 1
  51. ^ Robert Harper. "The point of laziness". 
  52. ^ Robert Harper. "Modules matter most.". 
  53. ^ "Hackathon - HaskellWiki". 

Further reading[edit source | edit]

Reports
Textbooks
History

External links[edit source | edit]

Tutorials
Various
Applications