Useful SAS Procedures
Data Related
- IMPORT
Import an external file to a SAS data set
- SORT
Order SAS data set observations by the values of one or more character or numeric variables
Descriptive Analytics
- ANOVA
Performs analysis of variance for balanced data
- FREQ
Produces one-way to n-way frequency and contingency (crosstabulation) tables
- MEANS
Compute descriptive statistics for variables
- PRINT
Print observations in a data set
- SUMMARY
Compute desctiptive statistics
- TABULATE
displays descriptive statistics in tabular format
- TTEST
Performs t tests and computes confidence limits for one sample, paired observations, two independent samples,
and the AB/BA crossover design. Two-sided, TOST (two one-sided test) equivalence, and upper and lower one-sided hypotheses are supported for means, mean differences, and mean ratios for either
normal or lognormal data
Graphs
- BOXPLOT
Creates side-by-side box-and-whiskers plots of measurements organized in groups
- SGPLOT
Performs analysis of variance for balanced data
- SGPLOT Example by Horstman
Getting Started with the SGPLOT Procedure
- SGPLOT Example by Slaughter and Delwiche
Getting Started with the SGPLOT Procedure
Classification and Clustering
- CANDISC
Performs a canonical discriminant analysis, computes squared Mahalanobis distances
between class means, and performs both univariate and multivariate one-way analyses of variance
- CLUSTER
Hierarchically clusters the observations in a SAS data
- DISCRIM
Develops a discriminant criterion to classify each observation into groups
- DISTANCE
Computes various measures of distance, dissimilarity, or similarity between the
observations (rows) of a SAS data set. Proximity measures are stored as a lower triangular matrix or a square matrix in an output data set that can then be used as input to the CLUSTER, MDS, and
MODECLUS procedures.
- FASTCLUS
Performs a disjoint cluster analysis on the basis of distances computed from one
or more quantitative variables
- TREE
Produces a tree diagram, also known as a dendrogram or phenogram, from a data set created by
the CLUSTER or VARCLUS procedure that contains the results of hierarchical clustering as a tree structure
Regression Analysis
- GLM
Fits general linear models
- LOGISTIC
Fits models with binary, ordinal, or nominal dependent variables
- MCMC
A general purpose Markov chain Monte Carlo (MCMC) simulation procedure that is designed to fit Bayesian models
- MIXED
Fits general linear models with fixed and random effects
- MODECLUS
Clusters observations in a SAS data set by using any of several algorithms based
on nonparametric density estimates
- PROBIT
Calculates maximum likelihood estimates of regression parameters and the natural (or threshold)
response rate for quantal response data from biological assays or other discrete event data
- QUANTREG
Fits quantile regression models
- QUANTSELECT
Performs effect selection for linear quantile regression models
- REG
General purpose procedure for ordinary least squares regression
- ROBUSTREG
Provides resistant (stable) results for linear regression models in the presence of outliers
- SURVEYSELECT
Provides a variety of methods for selecting probability-based random samples.