Share on Facebook Share on Twitter Email
Answers.com

List of important publications in computer science

 
Wikipedia: List of important publications in computer science

This is a list of important publications in computer science, organized by field.

Some reasons why a particular publication might be regarded as important:

  • Topic creator – A publication that created a new topic
  • Breakthrough – A publication that changed scientific knowledge significantly
  • Introduction – A publication that is a good introduction or survey of a topic
  • Influence – A publication which has significantly influenced the world
  • Latest and greatest – The current most advanced result in a topic

Contents

Theoretical computer science

Topics covered: theoretical computer science, including computability theory, computational complexity theory, algorithms, algorithmic information theory, information theory and formal verification.

Concurrent, parallel, and distributed computing

Topics covered: concurrent computing, parallel computing, and distributed computing.

Networks and security

Topics covered: cryptography and computer security, computer networks and the Internet.

Computational linguistics

Realization of Natural-Language Interfaces Using Lazy Functional Programming

Description: This survey documents relatively less researched importance of lazy functional programming languages (i.e. Haskell) to construct Natural Language Processors and to accommodated many linguistic theories.

Operating systems

An experimental timesharing system.

Description: This paper discuss time-sharing as a method of sharing computer resource. This idea changed the interaction with computer systems.

The Working Set Model for Program Behavior

Description: The beginning of cache. For more information see SIGOPS Hall of Fame.

Virtual Memory, Processes, and Sharing in MULTICS

Description: The classic paper on the most ambitious operating system in the early history of computing. Difficult reading, but it describes the implications of trying to build a system that takes information sharing to its logical extreme. Most operating systems since Multics have incorporated a subset of its facilities.

A note on the confinement problem

Description: This paper addresses issues in constraining the flow of information from untrusted programs. It discusses covert channels, but more importantly it addresses the difficulty in obtaining full confinement without making the program itself effectively unusable. The ideas are important when trying to understand containment of malicious code, as well as aspects of trusted computing.

The UNIX Time-Sharing System

Description: The Unix operating system and its principles were described in this paper. The main importance is not of the paper but of the operating system, which had tremendous effect on operating system and computer technology.

Weighted voting for replicated data

  • David K. Gifford
  • Proceedings of the 7th ACM Symposium on Operating Systems Principles, pages 150-159, December 1979. Pacific Grove, California
  • Online copy (few formats)

Description: This paper describes the consistency mechanism known as quorum consensus. It is a good example of algorithms that provide a continuous set of options between two alternatives (in this case, between the read-one write-all, and the write-one read-all consistency methods). There have been many variations and improvements by researchers in the years that followed, and it is one of the consistency algorithms that should be understood by all. The options available by choosing different size quorums provide a useful structure for discussing of the core requirements for consistency in distributed systems.

Experiences with Processes and Monitors in Mesa

Description: This is the classic paper on synchronization techniques, including both alternate approaches and pitfalls.

Scheduling Techniques for Concurrent Systems

Description: Algorithms for coscheduling of related processes were given

A Fast File System for UNIX

Description: The file system of UNIX. One of the first papers discussing how to manage disk storage for high-performance file systems. Most file-system research since this paper has been influenced by it, and most high-performance file systems of the last 20 years incorporate techniques from this paper.

The Design and Implementation of a Log-Structured File System

Description: Log-structured file system.

Microkernel operating system architecture and Mach

  • David L. Black, David B. Golub, Daniel P. Julin, Richard F. Rashid, Richard P. Draves, Randall W. Dean, Alessandro Forin, Joseph Barrera, Hideyuki Tokuda, Gerald Malan, David Bohman
  • Proceedings of the USENIX Workshop on Microkernels and Other Kernel Architectures, pages 11–30, April 1992.

Description: This is a good paper discussing one particular microkernel architecture and contrasting it with monolithic kernel design. Mach underlies Mac OS X, and its layered architecture had a significant impact on the design of the Windows NT kernel and modern microkernels like L4. In addition, its memory-mapped files feature was added to many monolithic kernels.

An Implementation of a Log-Structured File System for UNIX

Description: The paper was the first production-quality implementation of that idea which spawned much additional discussion of the viability and short-comings of log-structured filesystems. While "The Design and Implementation of a Log-Structured File System" was certainly the first, this one was important in bringing the research idea to a usable system.

Soft Updates: A Solution to the Metadata Update problem in File Systems

Description: A new way of maintaining filesystem consistency.

Databases

A relational model for large shared data banks

Description: This paper introduced the relational model for databases. This model became the number one model.

Binary B-Trees for Virtual Memory

  • Rudolf Bayer
  • ACM-SIGFIDET Workshop 1971, San Diego, California, Session 5B, p. 219-235.

Description: This paper introduced the B-Trees data structure. This model became the number one model.

Relational Completeness of Data Base Sublanguages

  • E. F. Codd
  • In: R. Rustin (ed.): Database Systems: 65-98, Prentice Hall and IBM Research Report RJ 987, San Jose, California : (1972)
  • Online version (PDF)

Description: Completeness of Data Base Sublanguages

The Entity Relationship Model – Towards a Unified View of Data

Description: This paper introduced the Entity-relationship diagram(ERD) method of database design.

SEQUEL: A structured English query language

  • Donald D. Chamberlin, Raymond F. Boyce
  • International Conference on Management of Data, Proceedings of the 1974 ACM SIGFIDET (now SIGMOD) workshop on Data description, access and control, Ann Arbor, Michigan, pp. 249–264

Description: This paper introduced the SQL language.

The notions of consistency and predicate locks in a database system

  • K.P. Eswaran, J. Gray, R.A. Lorie, I.L. Traiger
  • Communications of the ACM 19, 1976, 624—633

Description: This paper defined the concepts of transaction, consistency and schedule.It also argued that a transaction needs to lock a logical rather than a physical subset of the database.

Mining association rules between sets of items in large databases

Description: Association rules, a very common method for data mining.

Principles of Transaction-Oriented Database Recovery

Description: Introduced the ACID paradigm for transactions.

Information retrieval

A Vector Space Model for Automatic Indexing

  • Gerard Salton, A. Wong, C. S. Yang
  • Commun. ACM 18(11): 613-620 (1975)

Description: Presented the vector space model.

Extended Boolean Information Retrieval

  • Gerard Salton, Edward A. Fox, Harry Wu
  • Commun. ACM 26(11): 1022-1036 (1983)

Description: Presented the inverted index

Artificial intelligence

Computing machinery and intelligence

Description: This paper discusses whether machine can think and suggested the Turing test as a method for checking it.

A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence

Description: This summer research proposal inaugurated and defined the field. It contains the first use of the term artificial intelligence and this succinct description of the philosophical foundation of the field: "every aspect of learning or any other feature of intelligence can in principle be so precisely described that a machine can be made to simulate it." (See philosophy of AI) The proposal invited researchers to the Dartmouth conference, which is widely considered the "birth of AI". (See history of AI.)

Fuzzy sets

Description: The seminal paper published in 1965 provides details on the mathematics of fuzzy set theory.

Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference

Description: This book introduced Bayesian methods to AI.

Artificial Intelligence: A Modern Approach

Description: The standard textbook in Artificial Intelligence. The book web site lists over 1100 colleges and universities in 102 countries using it.

Machine learning

An Inductive Inference Machine

  • Ray Solomonoff
  • IRE Convention Record, Section on Information Theory, Part 2, pp. 56–62, 1957
  • (A longer version of this, a privately circulated report, 1956, is online).

Description: The first paper written on machine learning. Emphasized the importance of training sequences, and the use of parts of previous solutions to problems in constructing trial solutions to new problems.

Language identification in the limit

Description: This paper created Algorithmic learning theory.

On the uniform convergence of relative frequencies of events to their probabilities

Description: Computational learning theory, VC theory, statistical uniform convergence and the VC dimension.

A theory of the learnable

Description: The Probably approximately correct learning (PAC learning) framework.

Learning representations by back-propagating errors

Description: Development of Backpropagation algorithm for artificial neural networks. Note that the algorithm was first described by Paul Werbos in 1974.

Induction of Decision Trees

Description: Decision Trees are a common learning algorithm and a decision representation tool. Development of decision trees was done by many researchers in many areas, even before this paper. Though this paper is one of the most influential in the field.

Learning Quickly When Irrelevant Attributes Abound: A New Linear-threshold Algorithm

Description: One of the papers that started the field of on-line learning. In this learning setting, a learner receives a sequence of examples, making predictions after each one, and receiving feedback after each prediction. Research in this area is remarkable because (1) the algorithms and proofs tend to be very simple and beautiful, and (2) the model makes no statistical assumptions about the data. In other words, the data need not be random (as in nearly all other learning models), but can be chosen arbitrarily by "nature" or even an adversary. Specifically, this paper introduced the winnow algorithm.

Learning to predict by the method of temporal differences

Description: The temporal differences method for reinforcement learning.

Learnability and the Vapnik-Chervonenkis dimension

Description: The complete characterization of PAC learnability using the VC dimension.

Cryptographic limitations on learning boolean formulae and finite automata

Description: Proving negative results for PAC learning.

The strength of weak learnability

Description: Proving that weak and strong learnability are equivalent in the noise free PAC framework. The proof was done by introducing the boosting method.

Learning in the presence of malicious errors

Description: Proving possibility and impossibility result in the malicious errors framework.

A training algorithm for optimum margin classifiers

Description: This paper presented support vector machines, a practical and popular machine learning algorithm. Support vector machines utilize the kernel trick, a generally used method.

Knowledge-based analysis of microarray gene expression data by using support vector machines

Description: The first application of supervised learning to gene expression data, in particular Support Vector Machines. The method is now standard, and the paper one of the most cited in the area.

Computer vision

The Phase Correlation Image Alignment Method

  • C.D. Kuglin and D.C. Hines
  • IEEE 1975 Conference on Cybernetics and Society, 1975, New York, pp. 163–165, September

Description: A correlation method based upon the inverse Fourier transform

Determining Optical Flow

  • B.K.P. Horn and B.G. Schunck
  • Artificial Intelligence, Volume 17, 185–203, 1981

Description: A method for estimating the image motion of world points between 2 frames of a video sequence.

An Iterative Image Registration Technique with an Application to Stereo Vision

Description: This paper provides efficient technique for image registration

The Laplacian Pyramid as a compact image code

Description: A technique for image encoding using local operators of many scales.

Snakes: Active contour models

  • Michael Kass, Andrew Witkin, and Demetri Terzopoulos
  • International Journal of Computer Vision, 1(4):321–331, 1988. (Marr Prize Special Issue)
  • Online version

Description: An interactive variational technique for image segmentation and visual tracking.

Condensation – conditional density propagation for visual tracking

  • M. Isard and A. Blake
  • International Journal of Computer Vision, 29(1):5–28, 1998.
  • Online version

Description: A technique for visual tracking

Object recognition from local scale-invariant features

Description: A technique (Scale-invariant feature transform) for robust feature description

Compilers

On the translation of languages from left to right

Description: Bottom up parsing for deterministic context-free languages from which later the LALR approach of Yacc developed.

Semantics of Context-Free Languages.

Description: About grammar attribution, the base for yacc's s-attributed and zyacc's LR-attributed approach.

A program data flow analysis procedure

  • F.E. Allen, J. Cocke
  • Commun. ACM, 19, 137—147.

Description: From the abstract: "The global data relationships in a program can be exposed and codified by the static analysis methods described in this paper. A procedure is given which determines all the definitions which can possibly reach each node of the control flow graph of the program and all the definitions that are live on each edge of the graph."

A Unified Approach to Global Program Optimization

Description: Formalized the concept of data-flow analysis as fixpoint computation over lattices, and showed that most static analyses used for program optimization can be uniformly expressed within this framework.

YACC: Yet another compiler-compiler

Description: Yacc is a tool that made compiler writing much easier.

gprof: A Call Graph Execution Profiler

Description: The gprof profiler

Compilers: Principles, Techniques and Tools

Description: This book became a classic in compiler writing. It is also known as the Dragon book, after the (red) dragon that appears on its cover.

Software engineering

Software engineering: Report of a conference sponsored by the NATO Science Committee

Description: Conference of leading figures in software field circa 1968
The paper defined the field of Software engineering

Go To Statement Considered Harmful

Description: Don't use goto – the beginning of structured programming.

On the criteria to be used in decomposing systems into modules

Description: The importance of modularization and information hiding. Note that information hiding was first presented in a different paper of the same author - "Information Distributions Aspects of Design Methodology", Proceedings of IFIP Congress '71, 1971, Booklet TA-3, pp. 26–30

Hierarchical Program Structures

  • Ole-Johan Dahl, C. A. R. Hoare
  • in Dahl, Dijkstra and Hoare, Structured Programming, Academic Press, London and New York, pp. 175–220, 1972.

Description: The beginning of Object-oriented programming. This paper argued that programs should be decomposed to independent components with small and simple interfaces. They also argued that objects should have both data and related methods.

A technique for software module specification with examples

Description: software specification.

Structured Design

Description: Seminal paper on Structured Design, data flow diagram, coupling, and cohesion.

The Emperor's Old Clothes

Description: A lovely story of how large software projects can go right, and then wrong, and then right again, told with humility and humor. Illustrates the "second-system effect" and the importance of simplicity.

The Mythical Man-Month: Essays on Software Engineering

Description: Throwing more people at the task will not speed its completion...

No Silver Bullet: Essence and Accidents of Software Engineering

Description: We will keep having problems with software...

The Cathedral and the Bazaar

Description: Open source methodology.

Design Patterns: Elements of Reusable Object Oriented Software

Description: This book was the first to define and list design patterns in computer science.

Statecharts: A Visual Formalism For Complex Systems

  • David Harel
  • D. Harel. Statecharts: A visual formalism for complex systems. Science of Computer Programming, 8:231—274, 1987
  • Online version

Description: Statecharts are a visual modeling method. They are an extension of state machine that might be exponentially more efficient. Therefore, statcharts enable formal modeling of applications that were too complex before. Statecharts are part of the UML diagrams.

Technology of Automata-based programming

Description: Technology of Automata-based programming is a programming techology based on a principle of using finite state machine as a description of behavior and isomorphical transformation from state machine to code.

Programming languages

The FORTRAN Automatic Coding System

Description: This paper describes the design and implementation of the first FORTRAN compiler by the IBM team. Fortran is a general-purpose, procedural, imperative programming language that is especially suited to numeric computation and scientific computing.

Recursive functions of symbolic expressions and their computation by machine, part I

Description: This paper introduced LISP, the first functional programming language, which was used heavily in many areas of computer science, especially in AI. LISP also has powerful features for manipulating LISP programs within the language.

ALGOL 60

Description: Algol 60 introduced block structure.

The next 700 programming languages

Description: This seminal paper proposed an ideal language ISWIM, which without being ever implemented influenced the whole later development.

Lambda Papers

Description: This series of papers and reports first defined the influential Scheme programming language and questioned the prevailing practices in programming language design, employing lambda calculus extensively to model programming language concepts and guide efficient implementation without sacrificing expressive power.

Structure and Interpretation of Computer Programs

Description: This textbook explains core computer programming concepts, and is widely considered a classic text in computer science.

The C Programming Language

Description: Co-authored by the man who designed the C programming language, the first edition of this book served for many years as the language's de facto standard. As such, the book is regarded by many to be the authoritative reference on C.

The Java (programming language)

Computer architecture

Colossus computer

Description: The Colossus machines were early computing devices used by British codebreakers to read encrypted German messages during World War II. Colossus was an early binary electronic digital computer. The design of Colossus was later described in the referenced paper.

First Draft of a Report on the EDVAC

Description: It contains the first published description of the logical design of a computer using the stored-program concept, which has come to be known as the von Neumann architecture.

Architecture of the IBM System/360

Description: The IBM System/360 (S/360) is a mainframe computer system family announced by IBM on April 7, 1964. It was the first family of computers making a clear distinction between architecture and implementation.

The case for the reduced instruction set computer

Description: The reduced instruction set computer( RISC) CPU design philosophy. The RISC is a CPU design philosophy that favors a reduced instruction set as well as a simpler set of instructions.

Comments on "the Case for the Reduced Instruction Set Computer"

Description:

The CRAY-1 Computer System

  • DW Clark, WD Strecker
  • Communications of the ACM, January 1978, volume 21, number 1, pages 63–72.
  • Online version(PDF)

Description: The Cray-1 was a supercomputer designed by a team including Seymour Cray for Cray Research. The first Cray-1 system was installed at Los Alamos National Laboratory in 1976, and it went on to become one of the best known and most successful supercomputers in history.

Validity of the Single Processor Approach to Achieving Large Scale Computing Capabilities

Description: The Amdahl's Law.

A Case for Redundant Arrays of Inexpensive Disks (RAID)

Description: This paper discusses the concept of RAID disks, outlines the different levels of RAID, and the benefits of each level. It is a good paper for discussing issues of reliability and fault tolerance of computer systems, and the cost of providing such fault-tolerance.

Computer graphics

The Rendering Equation

  • J. Kajiya
  • SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques pages 143—150 [4]

Elastically deformable models

  • D. Terzopoulos, J. Platt, A. Barr, K. Fleischer
  • Computer Graphics, 21(4), 1987, 205-214, Proc. ACM SIGGRAPH'87 Conference, Anaheim, CA, July, 1987.
  • Online version(PDF)

Description: The Academy of Motion Picture Arts and Sciences cited this paper as a "milestone in computer graphics".

History of computation

The Computer from Pascal to von Neumann

Description: Perhaps the first book on the history of computation.

A History of Computing in the Twentieth Century

edited by:

Description: Several chapters by pioneers of computing.

Computer science humor

The Complexity of Songs

Description: The article capitalizes on the tendency of popular songs to evolve from long and content-rich ballads to highly repetitive "content-free" texts.

On Folk Theorems

Description: A paper that is both serious and funny about "the things we all know".

How to prove it

  • Dana Angluin
  • Sigact News, Winter-Spring 1983, Volume 15 #1
  • Online version

Description: Angluin presents some common proof techniques that should become less common.

See also

Further reading

  • Laplante, Phillip (ed). (1996) Great Papers in Computer Science. New York: IEEE Press. ISBN 031406365X.
  • Randell, Brian (ed). (1982). The Origins of Digital Computers: Selected Papers. 3rd ed. Berlin: Springer-Verlag. ISBN 0387113193.
  • Turning Points in Computing: 1962-1999, Special Issue, IBM Systems Journal, 38 (2/3),1999.
  • Yourdon, Edward (ed.) (1979) Classics in Software Engineering. New York: Yourdon Press. ISBN 0917072146

External links

Academic Search Engines


Search unanswered questions...
Enter a question here...
Search: All sources Community Q&A Reference topics
 
 
Learn More
List of important publications in science
David S. Johnson
Operating Systems Design and Implementation

Post a question - any question - to the WikiAnswers community:

 

Copyrights:

Wikipedia. This article is licensed under the Creative Commons Attribution/Share-Alike License. It uses material from the Wikipedia article "List of important publications in computer science" Read more