R Graphics, R

[ Pobierz całość w formacie PDF ]
Chapman & Hall/CRC
Computer Science and Data Analysis Series
The interface between the computer and statistical sciences is increasing,
as each discipline seeks to harness the power and resources of the other.
This series aims to foster the integration between the computer sciences
and statistical, numerical, and probabilistic methods by publishing a broad
range of reference works, textbooks, and handbooks.
SERIES EDITORS
John Lafferty, Carnegie Mellon University
David Madigan, Rutgers University
Fionn Murtagh, Royal Holloway, University of London
Padhraic Smyth, University of California, Irvine
Proposals for the series should be sent directly to one of the series editors
above, or submitted to:
Chapman & Hall/CRC
23-25 Blades Court
London SW15 2NU
UK
Published Titles
Bayesian Artificial Intelligence
Kevin B. Korb and Ann E. Nicholson
Pattern Recognition Algorithms for Data Mining
Sankar K. Pal and Pabitra Mitra
Exploratory Data Analysis with MATLAB
®
Wendy L. Martinez and Angel R. Martinez
Clustering for Data Mining: A Data Recovery Approach
Boris Mirkin
Correspondence Analysis and Data Coding with Java and R
Fionn Murtagh
R Graphics
Paul Murrell
Computer Science and Data Analysis Series
R Graphics
Paul Murrell
The University of Auckland
New Zealand
Boca Raton London New York Singapore
Published in 2006 by
Chapman & Hall/CRC
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487-2742
© 2006 by Taylor & Francis Group, LLC
Chapman & Hall/CRC is an imprint of Taylor & Francis Group
No claim to original U.S. Government works
Printed in the United States of America on acid-free paper
10987654321
International Standard Book Number-10: 1-58488-486-X (Hardcover)
International Standard Book Number-13: 978-1-58488-486-6 (Hardcover)
Library of Congress Card Number 2005046278
This book contains information obtained from authentic and highly regarded sources. Reprinted material is
quoted with permission, and sources are indicated. A wide variety of references are listed. Reasonable efforts
have been made to publish reliable data and information, but the author and the publisher cannot assume
responsibility for the validity of all materials or for the consequences of their use.
No part of this book may be reprinted, reproduced, transmitted, or utilized in any form by any electronic,
mechanical, or other means, now known or hereafter invented, including photocopying, microfilming, and
recording, or in any information storage or retrieval system, without written permission from the publishers.
For permission to photocopy or use material electronically from this work, please access
Danvers, MA 01923, 978-750-8400. CCC is a not-for-profit organization that provides licenses and registration
for a variety of users. For organizations that have been granted a photocopy license by the CCC, a separate
system of payment has been arranged.
Trademark Notice:
Product or corporate names may be trademarks or registered trademarks, and are used only
for identification and explanation without intent to infringe.
Library of Congress Cataloging-in-Publication Data
Murrell, Paul.
R graphics / Paul Murrell.
p. cm.
Includes bibliographical references and index.
ISBN 1-58488-486-X
1. Computer graphics. 2. R (Computer program language) I. Title.
T385.M9 2005
006.6—dc22
2005046278
Visit the Taylor & Francis Web site at
Taylor & Francis Group
is the Academic Division of T&F Informa plc.
and the CRC Press Web site at
Preface
R
is a popular open source software tool for statistical analysis and graphics.
This book focuses on the very powerful graphics facilities that
R
provides for
the production of publication-quality diagrams and plots.
What this book is about
This book describes the graphics system in
R
. The first chapter provides an
overview of the
R
graphics facilities. There are many pictures that demon-
strate the variety and complexity of plots and diagrams that can be produced
using
R
. There is a description of the different output formats that
R
graphics
can produce and there is a description of the overall organization of the
R
graphics facilities, so that the user has some idea of where to find a function
for a particular purpose.
The most important feature of the
R
graphics setup is the existence of two
distinct graphics systems within
R
: the traditional graphics system and the
grid graphics system. Section 1.2.2 offers some advice on which system to use.
Part I of this book is concerned with the traditional graphics system, which
implements many of the “traditional”graphics facilities of the
S
language[11][5]
(originally developed at Bell Laboratories and available in a commercial im-
plementation as
S-PLUS
). The majority of
R
graphics functions available at
the time of writing are based upon this system. The chapters in this part of
the book describe how to work with the traditional graphics functions, with
a particular emphasis on how to modify or add output to a plot to produce
exactly the right final output.
Chapter 2
describes the functions that are avail-
able to produce complete plots and
Chapter 3
focuses on how to customize
the details of plots, combine multiple plots, and add further output to plots.
Part II describes the grid graphics system, which is unique to
R
and is much
more powerful than the traditional system. At the time of writing, there
are fewer functions based on grid for producing complete plots, but there is
more power to produce a wider range of final results. Most of the functions
that produce complete plots using grid come from Deepayan Sarkar’s lattice
package, which implements Bill Cleveland’s Trellis graphics system. This
is described in
Chapter 4.
The remaining chapters describe how the grid
system can be used to produce graphical scenes starting from a blank page.
In particular, there is a discussion of how to develop new graphical functions
that are easy for other people to use and build on.
Appendix A provides a very brief introduction to the
R
system in general and
Appendix B discusses ways in which the traditional and grid graphics systems
can be combined.
The main part of the book assumes a basic familiarity with the
R
language
and environment. For more detailed information, the reader is directed to
the home page of the
R
Project (the URL is given below), which has links to
on-line documents and references to printed material.
There are a number of projects working on graphical user interfaces to
R
,
but the common underlying method of interaction is via a command line.
This book focuses on the production of graphical output by entering
R
code
interactively at the command-line interface to
R
and writing code in scripts
to load into
R
or to run as a batch job.
What this book is not about
This book does
not
contain discussions about which sort of plot is most appro-
priate for a particular sort of data, nor does it contain guidelines for correct
graphical presentation. In fact, instructions are provided for producing some
types of plots and graphical elements that are generally disapproved of, such
as pie charts and cross-hatched fill patterns.
The information in this book is meant to be used to produce a plot once the
format of the plot has been decided upon and to experiment with different
ways of presenting a set of data. No plot types are deliberately excluded,
partly because no plot type is all bad (e.g., a pie chart can be a very effec-
tive way to present a simple proportion) and partly because some graphical
elements, such as cross-hatching, are sometimes required by a particular pub-
lisher.
The flexibility of
R
graphics encourages the user
not
to be constrained to
thinking in terms of just the traditional types of plots. The aim of this book
is to provide lots of useful tools and to describe how to use them. There are
many other sources of information on graphical guidelines and recommended
plot types, some of which are mentioned below.
Most introductory statistics text books will contain basic guidelines for se-
lecting an appropriate type of plot. Examples of books that deal specif-
ically with the construction of effective plots and are aimed at a general
audience are “Creating More Effective Graphs” by Naomi Robbins[51] and
Edward Tufte’s “Visual Display of Quantitative Information”[60] and “Envi-
sioning Information”[61]. For more technical discussions of these issues, see
“Visualizing Data”and “Elements of Graphing Data” by Bill Cleveland[12][13],
and “The Grammar of Graphics” by Leland Wilkinson[67].
[ Pobierz całość w formacie PDF ]

  • zanotowane.pl
  • doc.pisz.pl
  • pdf.pisz.pl
  • tejsza.htw.pl
  •