| Type: | Package |
| Title: | Project MOSAIC Data Sets |
| Version: | 0.20.4 |
| Depends: | R (≥ 4.1.0) |
| Suggests: | lattice, mosaic, reshape2, ggplot2, dplyr, tidyr, ggformula |
| Author: | Randall Pruim <rpruim@calvin.edu>, Daniel Kaplan <kaplan@macalester.edu>, Nicholas Horton <nhorton@amherst.edu> |
| Maintainer: | Randall Pruim <rpruim@calvin.edu> |
| Description: | Data sets from Project MOSAIC (http://www.mosaic-web.org) used to teach mathematics, statistics, computation and modeling. Funded by the NSF, Project MOSAIC is a community of educators working to tie together aspects of quantitative work that students in science, technology, engineering and mathematics will need in their professional lives, but which are usually taught in isolation, if at all. |
| License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] |
| LazyLoad: | yes |
| LazyData: | yes |
| RoxygenNote: | 7.2.3 |
| Encoding: | UTF-8 |
| URL: | https://github.com/ProjectMOSAIC/mosaicData |
| BugReports: | https://github.com/ProjectMOSAIC/mosaicData/issues |
| NeedsCompilation: | no |
| Packaged: | 2023-11-05 00:17:51 UTC; rpruim |
| Repository: | CRAN |
| Date/Publication: | 2023-11-05 05:50:02 UTC |
Alcohol Consumption per Capita
Description
These data provide per capita alcohol consumption values for many countries in 2005 and 2008. There are also a few countries for which there are data in other years.
Usage
data(Alcohol)
Format
A data frame with 411 observations on the following variables.
countrycountry name
yearyear
alcoholestimated per capita alcohol consumption for adults (15+) in litres pure alcohol
Source
Gapminder (https://www.gapminder.org/data/)
Examples
data(Alcohol)
# There are only a few observations in years other than 2005 and 2008
subset(Alcohol, ! year %in% c(2005,2008))
US Births in 1969 - 1988
Description
A day by day record of the number of births in each US State.
Usage
data(Birthdays)
Format
A data frame with 374221 observations on the following variables.
statestate where child was born
yearyear (1969-1988)
monthmonth (1-12)
dayday of month
datedate as a date object
wdayDay of week (ordered factor)
birthsnumber of births
See Also
Births, Births78, Births2015, BirthsSSA, BirthsCDC for data sets that are
aggregated at the level of the entire country.
Examples
data(Birthdays)
if (require(mosaic)) {
MI <- Birthdays |> filter(state == "MI")
gf_point(births ~ date, Birthdays, data = MI)
gf_line(births ~ date, Birthdays, data = MI, color = ~ wday)
gf_line(births ~ date,
data = Birthdays |> group_by(date) |> summarise(births = sum(births)))
}
US Births
Description
Number of births in the United States. There are several data sets covering different date ranges and obtaining data from different sources.
Usage
data(Births)
data(Births78)
data(Births2015)
data(BirthsSSA)
data(BirthsCDC)
Format
A data.frame with the following 8 variables.
dateDate
birthsNumber of births on
date(integer)wdayDay of week (ordered factor)
yearYear (integer)
monthMonth (integer)
day_of_yearDay of year (integer)
day_of_monthDay of month (integer)
day_of_weekDay of week (integer)
Details
There are some overlapping dates in the various data sets, but the number of births does not always agree due to the different sources of the data. See the examples.
Source
Data source for
Births: National Vital Statistics System natality data, as provided by Google BigQuery and exported to csv by Robert Kern https://www.mechanicalkern.com:443/static/birthdates-1968-1988.csv.Data source for
BirthsSSAUS Social Security Administration, as curated at https://github.com/fivethirtyeight/data/tree/master/birthsData source for
BirthsCDCUS Centers for Disease Control, as curated at https://github.com/fivethirtyeight/data/tree/master/birthsData source for
Births2015: Obtained from the National Center for Health Statistics, National Vital Statistics System, Natality, 2015 data.
See Also
Birthdays for a data set aggregated at the state level.
Examples
data(Births78)
data(Births2015)
data(Births)
data(BirthsSSA)
data(BirthsCDC)
# date ranges for the different data sets
lapply(
list(Births = Births, Births78 = Births78, Biths2015 = Births2015, BirthsSSA = BirthsSSA,
BirthsCDC = BirthsCDC),
function(x) range(x$date))
range(Births78$date)
range(Births2015$date)
range(Births$date)
range(BirthsSSA$date)
range(BirthsCDC$date)
# Births and Births78 have slightly different numbers of births
if(require(ggplot2)) {
ggplot(data = Births, aes(x = date, y = births, colour = ~ wday)) +
stat_smooth(se = FALSE, alpha = 0.8, geom = "line")
ggplot(data = Births, aes(x = day_of_year, y = births, colour = ~ wday)) +
geom_point(size = 0.4, alpha = 0.5) +
stat_smooth(se = FALSE, geom = "line", alpha = 0.6, size = 1.5)
if (require(dplyr)) {
ggplot(
data = bind_cols(Births |> filter(year == 1978),
Births78 |> rename(births78 = births)),
aes(x = births - births78)
) +
geom_histogram(binwidth = 1)
}
}
if(require(ggplot2)) {
ggplot(data = Births, aes(x = date, y = births, colour = ~ wday)) +
stat_smooth(se = FALSE, alpha = 0.8, geom = "line")
ggplot(data = Births, aes(x = day_of_year, y = births, colour = ~ wday)) +
geom_point(size = 0.4, alpha = 0.5) +
stat_smooth(se = FALSE, geom = "line", alpha = 0.6, size = 1.5)
if (require(dplyr)) {
ggplot(
data = bind_cols(Births |> filter(year == 1978),
Births78 |> rename(births78 = births)),
aes(x = births - births78)
) +
geom_histogram(binwidth = 1)
# SSA records more births than CDC
ggplot(
data = bind_cols(BirthsSSA |> filter(year <= 2003) |> rename(SSA = births),
BirthsCDC |> filter(year >= 2000) |> rename(CDC = births)),
aes(x = SSA - CDC)
) +
geom_histogram(binwidth = 10)
}
}
Data from the 1985 Current Population Survey (CPS85)
Description
The Current Population Survey (CPS) is used to supplement census information between census years. These data consist of a random sample of persons from the CPS85, with information on wages and other characteristics of the workers, including sex, number of years of education, years of work experience, occupational status, region of residence and union membership.
Usage
data(CPS85)
Format
A data frame with 534 observations on the following variables.
wagewage (US dollars per hour)
educnumber of years of education
racea factor with levels
NW(nonwhite) orW(white)sexa factor with levels
FMhispanica factor with levels
HispNHsoutha factor with levels
NSSmarrieda factor with levels
MarriedSingleexpernumber of years of work experience (inferred from
ageandeduc)uniona factor with levels
NotUnionageage in years
sectora factor with levels
clericalconstmanagmanufotherprofsalesservice
Details
Data are from 1985. The data file is recoded from the original, which had entirely numerical codes.
Source
Data are from https://dasl.datadescription.com
References
Berndt, ER. The Practice of Econometrics 1991. Addison-Wesley.
Examples
data(CPS85)
Standard Deck of Cards
Description
A character vector with two or three character representations of each card in a standard 52-card deck.
Usage
Cards
Details
The 2 of clubs is represented as "2C", while the 10 of diamonds is "10D".
Examples
if (require(mosaic)) {
deal(Cards, 13) # bridge hand
deal(Cards, 5) # poker hand
shuffle(Cards) # shuffled deck
}
CoolingWater
Description
Temperature of a mug of water as it cools
Usage
data(CoolingWater)
Format
A data frame with 222 observations of the following variables.
timetime in minutes
temptemperature in Celsius
Details
The water was poured into a mug and a temperature probe inserted into the water with a few seconds of the pour.
Source
These data were collected Stan Wagon to help his mathematical modeling students explore Newton's Law of Cooling and the ways that the law is really only an approximation. More about Stan: http://stanwagon.com.
Examples
data(CoolingWater)
if (require(ggformula)) {
gf_point(temp ~ time, data = CoolingWater, alpha = 0.5)
}
Countries
Description
A data frame containing country names as used by Gapminder and the maps
package to facilitate conversation between the two.
Usage
data(Countries)
Format
A data frame with 258 observations on the following variables.
maptoolsregion name http://mappinghacks.com/ data sets
gapmindercountry name in Gapminder data sets
mapsregion name in
mapsdata sets
Details
The "countries" in the maps data include several other geographic regions (bodies
of water, islands belonging to other countries, Hawaii, etc.) that are not countries.
Furthermore, the maps countries do not include many of the countries that
have been created since ca. 2000. The mapping is therefore many-to-many, and also
includes some NAs when there is no appropriate mapping. Bodies of water in the
maps data, for example, are not assigned a country in the Gapminder.
Examples
data(Countries)
subset(Countries, maps=="Yugoslavia") # Where has Yugoslavia gone?
subset(Countries, is.na(gapminder)) # Things from maps with no Gapminder equivalent
subset(Countries, is.na(maps)) # Things from Gapminder with no maps equivalent
Weight of dimes
Description
Weights of a sample of dimes.
Usage
data(Dimes)
Format
A data frame with 30 observations on the following 2 variables.
massmass of dime in grams
yearyear the dime was minted
Details
These data were collected on a sample taken from a large sack of dimes for the purpose of estimating the total number of dimes in the sack based on the weights of the individual dimes.
Source
Data were collected by Michael Stob.
Galton's dataset of parent and child heights
Description
In the 1880's, Francis Galton was developing ways to quantify the heritability of traits. As part of this work, he collected data on the heights of adult children and their parents.
Usage
data(Galton)
Format
A data frame with 898 observations on the following variables.
familya factor with levels for each family
fatherthe father's height (in inches)
motherthe mother's height (in inches)
sexthe child's sex:
ForMheightthe child's height as an adult (in inches)
nkidsthe number of adult children in the family, or, at least, the number whose heights Galton recorded.
Details
Entries were deleted for those children whose heights were not recorded numerically by Galton, who sometimes used entries such as "tall", "short", "idiotic", "deformed" and so on.
Source
The data were transcribed by J.A. Hanley who has published them at http://www.medicine.mcgill.ca/epidemiology/hanley/galton/
References
"Transmuting" women into men: Galton's family data on human stature. (2004) The American Statistician, 58(3):237-243.
Examples
data(Galton)
Data from the Child Health and Development Studies
Description
Birth weight, date, and gestational period collected as part of the Child Health and Development Studies in 1961 and 1962. Information about the baby's parents — age, education, height, weight, and whether the mother smoked is also recorded.
Usage
data(Gestation)
Format
A data frame with 1236 observations on the following variables.
ididentification number
pluralityall "single fetus" in this data set
outcomeall "live birth" (survived at least 28 days) in this data set
datebirth date where 1096=January 1, 1961
gestationlength of gestation (in days)
wtbirth weight (in ounces)
paritytotal number of previous pregnancies (including fetal deaths and still births)
sex"male"
racemother's race: "asian", "black", "mex", "mixed", or "white"
agemother's age in years at termination of pregnancy
edmother's education
htmother's height in inches to the last completed inch
wt.1mother's prepregnancy weight (in pounds)
dracefather's race
dagefather's age (in years)
dedfather's education
dhtfather's height in inches to the last completed inch
dwtfather's weight (in pounds)
maritalmarital status
,
incfamily yearly income in $2500 increments
smokedoes mother smoke? (never, smokes now, until current pregnancy, once did, not now)
timetime since quitting smoking (never smoked, still smokes, during current preg, within 1 year, 1 to 2 years ago, 2 to 3 years ago, 3 to 4 years ago, 5 to 9 years ago, 10+ years ago, quit and don't know
numbernumber of cigarettes smoked per day for past and current smokers (never, 1-4, 5-9, 10-14, 15-19, 20-29, 30-39, 40-60, 60+, smoke but don't know)
Details
The data were presented by Nolan and Speed to address the question of whether there is a link between maternal smoking and the baby's health for male births.
Source
The book by Nolan and Speed describes the data in more detail and provides an Internet site for accessing them: https://www.stat.berkeley.edu/users/statlabs/
References
D Nolan and T Speed. Stat Labs: Mathematical Statistics Through Applications (2000), Springer-Verlag.
Examples
data(Gestation)
Goose Permit Study
Description
237 hunters were each offered one of 11 cash amounts (bids) ranging from $1 to $200 in return for their goose permits. Hunters returned either their permit or the cash.
Usage
data(GoosePermits)
Format
A data.frame with 11 observations on the following 3 variables.
bidamount offered for permit (US $) (numeric)
keepnumber of hunters who kept the permit and returned the cash (numeric)
sellnumber of hunters who kept the cash and returned the permit (numeric)
Source
Bishop and Heberlein. "Measuring values of extramarket goods: are indirect measures biased?". Amer. J. Agr. Econ. 61, 1979. Available at https://onlinelibrary.wiley.com/doi/abs/10.2307/3180348
Examples
data(GoosePermits)
goose.model <-
glm( cbind(keep, sell) ~ log(bid), data = GoosePermits, family = binomial())
if (require(ggformula)) {
y.hat <- makeFun(goose.model)
gf_point( (keep/(keep+sell)) ~ bid, data = GoosePermits, ylim = c(0,1.05)) |>
gf_fun(y.hat(b) ~ b, add = TRUE, color = "red", alpha = 0.5)
}
Health Evaluation and Linkage to Primary Care
Description
The HELP study was a clinical trial for adult inpatients recruited from a detoxification unit. Patients with no primary care physician were randomized to receive a multidisciplinary assessment and a brief motivational intervention or usual care, with the goal of linking them to primary medical care.
Usage
data(HELPfull)
Format
A data frame with 1472 observations on the following variables.
IDSubject ID
A10Marital Status (1=Married, 2=Remarried, 3=Widowed, 4= Separated, 5=Divorced, 6=Never Married
A11ADo you currently have a living mother? (0=No, 1= Yes
A11BDo you currently have a living father? (0=No, 1=Yes
A11CDo you currently have siblings? (0=No, 1=Yes
A11DDo you currently have a partner (0=No, 1=Yes)
A11EDo you currently have children? (0=No, 1=Yes)
A12B_RECHollingshead category (recode) (0=Cat 1,2,3, 1=Cat 4,5,6, 2=Cat 7,8,9)
A12BHollingshead categories (1=Major profess, 2=Lesser professional, 3=Minor professional, 4=Clerical/sales, 5=Skilled manual, 6=Semi-skilled, 7=Unskilled, 8= Homemaker, 9=No occupation)
A13Usual employment pattern in last 6 months (1=Full time, 2=Part time, 3=Student, 4=Unemployed, 5=Control envir)
A14ALived alone-last 6 months (0=No, 1=Yes)
A14BLived with a partner-last 6 months (0=No, 1=Yes
A14CLived with parent(s)-last 6 months (0=No, 1=Yes)
A14DLived with children-last 6 months (0=No, 1=Yes)
A14ELived with other family-last 6 months (0=No, 1=Yes
A14FLived with friend(s)-last 6 months (0=No, 1=Yes)
A14G_Ta factor with levels
1/2 WAY HOUSE3/4 HOUSEANCHOR INNARMYASSOCIATESBOARDERSBOYFRIENDS MOMCORRECTIONAL FACILITCRACK HOUSEDEALERENTRE FAMILIAFENWOODGAVIN HSEGIRLFRIENDS DAUGHTEGIRLFRIENDS SONGIRLFRIENDS CHILDRENGIRLFRIENDS DAUGHTERGROUP HOMEHALF-WAY HOUSEHALFWAY HOUSEHALFWAY HOUSESHALFWAY HSEHOLDING UNITHOME BORDERHOMELESSHOMELESS SHELTERIN JAILIN PROGRAMSINCARCERATEDJAILJAIL HALFWAY HOUSEJAIL, SHELTERJAIL, STREETJAIL/PROGRAMJAIL/SHELTERJAILSLANDLADYLANDLORDLODGING HOUSEMERIDIAN HOUSENURSING HOMEON THE STREETPARTNERS MOTHERPARTNERS CHILDPARTNERS CHILDRENPRDGRAMSPRISONPROGRAMPROGRAM MTHPPROGRAM ROOMMATESPROGRAM SOBER HOUSEPROGRAM-RESIDENTIALPROGRAM/HALFWAY HOUSPROGRAM/JAILPROGRAM/SHELTERPROGRAM/SHELTERSPROGRAMSPROGRAMS SUBSTANCEPROGRAMS/SHELTERPROGRAMS/SHELTERSPROGRAMS/SHELTERS/DEPROJECT SOARRESIDENTIAL FACILITYRESIDENTIAL PROGRAMROOMING HOUSEROOMING HOUSE (RELIGROOMMATEROOMMATESROOMMATES AT TRANSITRYAN HOUSESALVATION ARMYSHELTERSHELTER/HALFWAY HSESHELTER/HOTELSHELTER/PROGRAMSHELTERSSHELTERS/HOSPITALSSHELTERS/JAILSHELTERS/PROGRAMSSHELTERS/STREETSSOBER HOUSESOBER HOUSINGSOUTH BAY JAILSTEPSONSTREETSTREETSSUBSTANCE ABUSE TREATRANSITIONAL HOUSEVA SHELTERA14GLived w/ other-last 6 months (0=No, 1=Yes)
A15A#nights in overnight shelter-last 6 months
A15B# nights on street-last 6 months
A15C# months in jail-last 6 months
A16A# months in overnight shelter-last 5 years
A16B# moths on street-last 5 years
A16C# months in jail-last 5 years
A17AReceived SSI – past 6 months (0=No, 1=Yes)
A17BReceived SSDI – past 6 months (0=No, 1=Yes)
A17CReceived AFDC – past 6 months (0=No, 1=Yes)
A17DReceived EAEDC – past 6 months (0=No, 1=Yes)
A17EReceived WIC – past 6 months (0=No, 1=Yes)
A17FReceived unemployment benefits – past 6 months (0=No, 1=Yes)
A17GReceived Workman's Compensation – past 6 months (0=No, 1=Yes)
A17HReceived Child Support – past 6 months (0=No, 1=Yes)
A17I_Ta factor with levels
DISABLED VETERANEBT (FOOD STAMPS)EMERGENCY FOOD STAMPFOOD STAMPFOOD STAMPSFOOD STAMPS/VETERANFOOD STAMPS/VETERANSINSURANCE SETTLEMENTPENSION CHECKSECTION 8SERVICE CONNECTED DISOCIAL SECURITYSSDI FOR SONSURVIVORS BENEFITSTEMPORARY DISABILITYVA BENEFITS-DISABILIVA COMPENSATIONVA DISABILITY PENSIOVETERAN BENEFITSVETERANS SERVICESVETERANS AFFAIRSA17IReceived other income – past 6 months (0=No, 1=Yes)
A18_REC1Most money made in 1 year (recode) (0=$19,000 or less, 1=$20,000-$49,000, 2=$50,000 or more)
A18_REC2Most money made-continuous recode
A18Most money made in any 1 year-last 5 years (1=<5000, 2=5000-10000, 3=11000-19000, 4=20000-29000, 5=30000-39000, 6=40000-49000, 7=50000+
A1Gender (1=Male, 2=Female)
A9Years of education completed
ABUSE2Type of abuse (0=No abuse, 1=Physical only, 2=Sexual only, 3=Physical and sexual)
ABUSE3Type of abuse (0=No abuse, 1=Physical only, 2=Sexual +/- physical (0=No, 1=Yes)
ABUSEAbuse-physical or sexual (0=No abuse, 1=Family abuse, 2=Stranger only abuse)
AGEAge in years
ALCOHOL1st/2nd drug of coice=Alcohol (0=No, 1=Yes)
ALCQ_30Total number drinks past 30 days
ALONE6MUsually lived alone past 6 months (0=No, 1=Yes)
ALT_TRTAlternative tratments (0=No, 1=Yes)
ANYSUBSTATUSUsed alcohol, heroin, or cocaine since leaving detox-6 months
ANY_INSDid you have health insurance in past 6 months (0=No, 1=Yes)
ANY_UTILAny recent health utilization (0=No, 1=Yes)
ANY_VIS_CUMULCumulative # visits to regular doctor's office
ANY_VIS# visits to regular doctor's office–This time point
B10Any physcal/emotional problem interfere with social activities-last 4 weeks (1=All of the time, 2=Most of the time, 3=Some of the time, 4= A lttle of time, 5= None of the time)
B11AI seem to get sick easier than other people (1=Definitely true, 2=Mostly True, 3=Don't know, 4=Mostly false, 5=Definitely false)
B11BI am as healthy as anybody I know (1=Definitely true, 2=Mostly true, 3=Don't know, 4=Mostly false, 5=Definitely False)
B11CI expect my health to get worse (1=Definitely true, 2=Mostly true, 3=Don't know, 3=Mostly false, 5=Definitely false)
B11DMy health is excellent (1=Definitely true, 2=Mostly true, 3=Don't know, 4=Mostly false, 5=Definitely false)
B1In general, how is your health (1=Excellent, 2=Very Good, 3=Good, 4=Fair, 5=Poor)
B2Compared to 1 year ago, how is your health now (1=Much better, 2=Somewhat better, 3=About the same, 4=Somewhat worse, 5=Much worse)
B3ADoes health limit you in vigorous activity (1=Limited a lot, 2=Limited a little, 3=Not limited)
B3BDoes your health limit you in moderate activity (1=Limited a lot, 2=Limited a little, 3=Not limited)
B3CDoes health limit you in lift/carry groceries (1=Limited a lot, 2=Limited a little, 3=Not limited)
B3DDoes health limit you in climb several stair flights (1=Limited a lot, 2=Limited a little, 3=Not limited)
B3EDoes health limit you in climb 1 stair flight (1=Limited a lot, 2=Limited a little, 3=Not limited)
B3FDoes health limit you in bend/kneel/stoop (1=Limited a lot, 2=Limited a little, 3=Not limited)
B3GDoes health limit you in walking >1 mile (1=Limited a lot, 2=Limited a little, 3=Not limited)
B3HDoes health limit you in walking sevral blocks (1=Limited a lot, 2=Limited a little, 3=Not limited)
B3IDoes health limit you in walking 1 block (1=Limited a lot, 2=Limited a little, 3=Not limited)
B3JDoes health limit you in bathing/dressing self (1=Limited a lot, 2=Limited a little, 3=Not limited)
B4ACut down work/activity due to physical health-last 4 weeks (0=No, 1=Yes)
B4BAccomplish less due to phys health-last 4 weeks (0=No, 1=Yes)
B4CLim wrk/act type due to phys health-last 4 weeks (0=No, 1=Yes)
B4DDiff perf work due to phys health-last 4 weeks (0=No, 1=Yes)
B5ACut wrk/act time due to emot prbs-last 4 weeks (0=No, 1=Yes)
B5BAccomplish ess due to emot probs-last 4 weeks (0=No, 1=Yes)
B5C<carefl w/wrk/act due to em prb-last 4 weeks (0=No, 1=Yes)
B6Ext phys/em intf w/norm soc act-last 4 weeks (1=Not al all, 2=Slightly, 3=Moderately, 4=Quite a bit, 5=Extremely)
B7Amount of bodily pain – past 4 weeks (1=None, 2=Very mild, 3= Mild, 4=Moderate, 5= Severe, 6= Very severe)
B8Amount of pain interfering with normal work-last 4 weeks (1=Not at all, 2=A little bit, 3=Moderately, 4=Quite a bit, 5=Extremely
B9ADid you feel full of pep – past 4 weeks (1=All of the time, 2=Most of the time, 3 = Good bit of the time, 4=Some of the time, 5=A little of time, 6=None of the time)
B9BHave you been nervous – past 4 weeks (1=All of the time, 2=Most of the time, 3 = Good bit of the time, 4=Some of the time, 5=A little of time, 6=None of the time)
B9CFelt nothing could cheer you-last 4 weeks (1=All of the time, 2=Most of the time, 3 = Good bit of the time, 4=Some of the time, 5=A little of time, 6=None of the time)
B9DHave you felt calm/peaceful – past 4 weeks (1=All of the time, 2=Most of the time, 3 = Good bit of the time, 4=Some of the time, 5=A little of time, 6=None of the time)
B9EDid you have a lot of energy – past 4 weeks (1=All of the time, 2=Most of the time, 3 = Good bit of the time, 4=Some of the time, 5=A little of time, 6=None of the time)
B9FDid you feel downhearted – past 4 weeks (1=All of the time, 2=Most of the time, 3 = Good bit of the time, 4=Some of the time, 5=A little of time, 6=None of the time)
B9GDid you feel worn out – past 4 weeks (1=All of the time, 2=Most of the time, 3 = Good bit of the time, 4=Some of the time, 5=A little of time, 6=None of the time)
B9HHave you been a happy pers – past 4 weeks (1=All of the time, 2=Most of the time, 3 = Good bit of the time, 4=Some of the time, 5=A little of time, 6=None of the time)
B9IDid you feel tired – past 4 weeks (1=All of the time, 2=Most of the time, 3 = Good bit of the time, 4=Some of the time, 5=A little of time, 6=None of the time)
BIRTHPLCWhere born (recode) (0=USA, 1=Foreign)
BPSF-36 pain index (0-100)
C1ATolf by MD had seix, epil, convuls (0=No, 1=Yes)
C1BTold by MD had asthma, emphysema, chr lung dis (0=No, 1=Yes)
C1CTold by MD had MI (0=No, 1=Yes)
C1DTold by MD had CHF (0=No, 1=Yes)
C1ETold by MD had other heart dis (req med) (0=No, 1=Yes)
C1FTold by MD had HBP (0=No, 1=Yes)
C1GTold by MD had chronic liver disease (0=No, 1=Yes)
C1HTold by MD had kidney failure (0=No, 1=Yes)
C1ITold by MD had chronic art, osteoarth (0=No, 1=Yes)
C1JTold by MD had peripheral neuropathy (0=No, 1=Yes)
C1KEver told by MD had cancer (0=No, 1=Yes)
C1LEver told by MD had diabetes (0=No, 1=Yes)
C1MEver told by MD had stroke (0=No, 1=Yes)
C2A1Have you ever had skin infections (0=No, 1=Yes)
C2A2Have you had skin infections – past 6 months (0=No, 1=Yes)
C2B1Have you ever had pneumonia (0=No, 1=Yes)
C2B2Have you had pneumonia – past 6 months (0=No, 1=Yes)
C2C1Have you ever had septic arthritis (0=No, 1=Yes)
C2C2Have you had septic arthritis – past 6 months (0=No, 1=Yes)
C2D1Have you ever had TB (0=No, 1=Yes)
C2D2Have you had TB-last 6 months (0=No, 1=Yes)
C2E1Have you ever had endocarditis (0=No, 1=Yes)
C2E2Have you had endocarditis – past 6 months (0=No, 1=Yes)
C2F1Have you ever had an ulcer (0=No, 1=Yes)
C2F2Have you had an ulcer – past 6 months (0=No, 1=Yes)
C2G1Have you ever had pancreatitis (0=No, 1=Yes)
C2G2Have you had pancreatitis – past 6 months (0=No, 1=Yes)
C2H1Ever had abdom pain req overnt hosp stay (0=No, 1=Yes)
C2H2Abdom pain req ovrnt hosp stay-last 6 months (0=No, 1=Yes)
C2I1Have you ever vomited blood (0=No, 1=Yes)
C2I2Have you vomited blood – past 6 months (0=No, 1=Yes)
C2J1Have you ever had hepatitis (0=No, 1=Yes)
C2J2Have you had hepatitis – past 6 months (0=No, 1=Yes)
C2K1Ever had blood clots in legs/lungs (0=No, 1=Yes)
C2K2Blood clots in legs/lungs – past 6 months (0=No, 1=Yes)
C2L1Have you ever had osteomyelitis (0=No, 1=Yes)
C2L2Have you had osteomyelitis – past 6 months (0=No, 1=Yes)
C2M1Chest pain using cocaine req ER/hosp (0=No, 1=Yes)
C2M2Chest pain using coc req ER/hosp-last 6 months (0=No, 1=Yes)
C2N1Have you ever had jaundice (0=No, 1=Yes)
C2N2Have you had jaundice – past 6 months (0=No, 1=Yes)
C2O1Lower back pain > 3 months req med attn (0=No, 1=Yes)
C2O2Lwr back pain >3 months req med attention-last 6 months (0=No, 1=Yes)
C2P1Ever had seizures or convulsions (0=No, 1=Yes)
C2P2Had seizures or convulsions – past 6 months (0=No, 1=Yes)
C2Q1Ever had drug/alcohol overdose requiring ER attention (0=No, 1=Yes)
C2Q2Drug/alcohol overdose req ER attn (0=No, 1=Yes)
C2R1Have you ever had a gunshot wound (0=No, 1=Yes)
C2R2Had a gunshot wound – past 6 months (0=No, 1=Yes)
C2S1Have you ever had a stab wound (0=No, 1=Yes)
C2S2Have you had a stab wound – past 6 months (0=No, 1=Yes)
C2T1Ever had accident/falls req med attn (0=No, 1=Yes)
C2T2Had accident/falls req med attn – past 6 months (0=No, 1=Yes)
C2U1Ever had fract/disloc to bones/joints (0=No, 1=Yes)
C2U2Fract/disloc to bones/joints – past 6 months (0=No, 1=Yes)
C2V1Ever had injury from traffic accident (0=No, 1=Yes)
C2V2Had injury from traffic accident – past 6 months (0=No, 1=Yes)
C2W1Have you ever had a head injury (0=No, 1=Yes)
C2W2Have you had a head injury – past 6 months (0=No, 1=Yes)
C3A1Have you ever had syphilis (0=No, 1=Yes)
C3A2# times had syphilis
C3A3Have you had syphilis in last 6 months (0=No, 1=Yes)
C3B1Have you ever had gonorrhea (0=No, 1=Yes)
C3B2# times had gonorrhea
C3B3Have you had gonorrhea in last 6 months (0=No, 1=Yes)
C3C1Have you ever had chlamydia (0=No, 1=Yes)
C3C2# of times had Chlamydia
C3C3Have you had chlamydia in last 6 months (0=No, 1=Yes)
C3DHave you ever had genital warts (0=No, 1=Yes)
C3EHave you ever had genital herpes (0=No, 1=Yes)
C3F1Have you ever had other STD's (not HIV) (0=No, 1=Yes)
C3F2# of times had other STD's (not HIV)
C3F3Had other STD's (not HIV)-last 6 months (0=No, 1=Yes)
C3F_Ta factor with levels
7CRABSCRABS - TRICHONOMISCRABS, HEP BDOESNT KNOW NAMEHAS HAD ALL 3 ABCHEP BHEP B, TRICAMONASHEP. BHEPATITIS BHEPATITS BTRICHAMONAS VAGINALATRICHAMONISTRICHOMONASTRICHOMONIASISTRICHOMONISTRICHOMONIS VAGINITITRICHOMORASTRICHONOMISC3G1Have you ever been tested for HIV/AIDS (0=No, 1=Yes)
C3G2# times tested for HIV/AIDS
C3G3Have you been tested for HIV/AIDS-last 6 months (0=No, 1=Yes)
C3G4What was the result of last test (1=Positive, 2=Negative, 3=Refused, 4=Never got result, 5=Inconclusive
C3H1Have you ever had PID (0=No, 1=Yes)
C3H2# of times had PID
C3H3Have you had PID in last 6 months (0=No, 1=Yes)
C3IHave you ever had a Pap smear (0=No, 1=Yes)
C3JHave you had a Pap smear in last 3 years (0=No, 1=Yes)
C3K_MHow many months pregnant
C3KAre you pregnant (0=No, 1=Yes)
CESD_CUTCES-D score > 21 y/n (0=No, 1=Yes)
CES_DCES-D score, measure of depressive symptoms, high scores are worse
CHR_6MChronic medical conds/HIV – past 6m y/n (0=No, 1=Yes)
CHR_EVERChronic medical conds/HIV-ever y/n (0=No, 1=Yes)
CHR_SUMSum chronic medical conds/HIV ever
CNTRLInDUC-2L-Control score
COC_HER1st/2nd drug of choice=cocaine or heroine (0=No, 1=Yes)
CUAD_CCUAD-Cocaine
CUAD_HCUAD-Heroin
CURPHYABCurrent abuse-physical (0=No, 1=Yes)
CURPHYSEXABCurent abuse-physical or sexual (0=No abuse, 1=Physical only, 2=Sexual +/- physical)
CURSEXABCurrent abuse-sexual (0=No, 1=Yes)
C_AUASI-Composite score for alcohol use
C_DUASI-Composite score for drug use
C_MSASI-Composite medical status
D1$ of times hospitalized for med probs
D2Take prescription medicdation regularly for physical problem (0=No, 1=Yes)
D3_RECAny medical problems past 30d y/n (0=No, 1=Yes)
D3# days had med probs-30 days bef detox
D4_RECBothered by medical problems y/n (0=No, 1=Yes)
D4How bother by med prob-30days bef detox (0=Not at all, 1=Slightly, 2=Moderately, 3=Considerably, 4=Extremely)
D5_RECMedical trtmt is important y/n (0=No, 1=Yes)
D5How import is trtmnt for these med probs (0=Not at all, 1=Slightly, 2= Moderately, 3= Considerably, 4= Extremely
DAYSANYSUBtime (days) from baseline to first alcohol, heroin, or cocaine since leaving detox-6m
DAYSDRINKTime (days) from baseline to first drink since leaving detox-6m
DAYSLINKTime (days) to linkage to primary care within 12 months (by administrative record)
DAYS_SINCE_BL# of days from baseline to current interview
DAYS_SINCE_PREV# of days from previous to current interview
DEADa numeric vector
DEC_AMSOCRATES-Ambivalence-Decile
DEC_RESOCRATES-Recognition-Decile
DEC_TSSOCRATES-Taking steps-Decile
DRINKSTATUSDrank alcohol since leaving detox-6m
DRUGRISKRAB-Drug risk total
E10Ahave you been to med clinic-last 6 months (0=No, 1=Yes)
E10B1_RMental health treatment past 6m y/n (0=No, 1=Yes)
E10B1# x visit ment health clin/prof-last 6 months
E10B2_RMed clinic/private MD past 6m y/n (0=No, 1=Yes)
E10B2# x visited med clin/priv MD-last 6 months
E10C19Visited private MD-last 6 months (0=No, 1=Yes)
E11ADid you stay ovrnite/+ in hosp-last 6 months (0=No, 1=Yes)
E11B# times ovrnight/+ in hosp-last 6 months
E11CTotal # nights in hosp-last 6 months
E12AVisited Hosp ER for med care – past 6 months (0=No, 1=Yes)
E12B# times visited hosp ER-last 6 months
E13Tlt # visits to MDs-last 2 weeks bef detox
E14ARecd trtmt from acupuncturist-last 6 months (0=No, 1=Yes)
E14BRecd trtmt from chiropractor-last 6 months (0=No, 1=Yes)
E14CTrtd by hol/herb/hom med prac-last 6 months (0=No, 1=Yes)
E14DRecd trtmt from spirit healer-last 6 months (0=No, 1=Yes)
E14EHave you had biofeedback-last 6 months (0=No, 1=Yes)
E14FHave you underwent hypnosis-last 6 months (0=No, 1=Yes)
E14GReceived other treatment-last 6 months (0=No, 1=Yes)
E15ATried to get subst ab services-last 6 months (0=No, 1=Yes)
E15BAlways able to get subst ab servies (0=No, 1=Yes)
E15C10My insurance didn't cover services (0=No, 1=Yes)
E15C11There were no beds available at the prog (0=No, 1=Yes)
E15C12Other reason not get sub ab services (0=No, 1=Yes)
E15C1I could not pay for services (0=No, 1=Yes)
E15C2I did not know where to go for help (0=No, 1=Yes)
E15C3Couldn't get to services due to transp prob (0=No, 1=Yes)
E15C4The offie/clinic hrs were inconvenient (0=No, 1=Yes)
E15C5Didn't speak/understnd Englsh well enough (0=No, 1=Yes)
E15C6Afraid other might find out about prob (0=No, 1=Yes)
E15C7My substance abuse interfered (0=No, 1=Yes)
E15C8Didn't have someone to watch my children (0=No, 1=Yes)
E15C9I did not want to lose my job (0=No, 1=Yes)
E16A10I do not want to lose my job (0=No, 1=Yes)
E16A11My insurance doesn't cover charges (0=No, 1=Yes)
E16A12I do not feel I need a regular MD (0=No, 1=Yes)
E16A13Other reasons don't have regular MD (0=No, 1=Yes)
E16A1I cannot pay for services (0=No, 1=Yes)
E16A2I am not eligible for free care (0=No, 1=Yes)
E16A3I do not know where to go (0=No, 1=Yes)
E16A4Can't get to services due to trans prob (0=No, 1=Yes)
E16A5a numeric vectorOffice/clinic hours are inconvenient (0=No, 1=Yes)
E16A6I don't speak/understnd enough English (0=No, 1=Yes)
E16A7Afraid othrs find out about my health prob (0=No, 1=Yes)
E16A8My substance abuse interferes (0=No, 1=Yes)
E16A9I don't have someone to watch my children (0=No, 1=Yes)
E16A_DDBarrier to regular MD: dislike docs/system (0=No, 1=Yes)
E16A_IBBarrier to regular MD: internal barriers (0=No, 1=Yes)
E16A_RTBarrier to regular MD: red tape (0=No, 1=Yes)
E16A_TMBarrier to regular MD: time restrictions (0=No, 1=Yes)
E18AI could not pay for services (0=No, 1=Yes)
E18BI did not know where to go for help (0=No, 1=Yes)
E18CCouldn't get to services due to transp prob (0=No, 1=Yes)
E18DThe office/clinic hrs were inconvenient (0=No, 1=Yes)
E18FAfraid others might find out about prob (0=No, 1=Yes)
E18GMy substance abuse interfered (0=No, 1=Yes)
E18HDidn't have someone to watch my children (0=No, 1=Yes)
E18II did not want to lose my job (0=No, 1=Yes)
E18JMy insurance didn't cover services (0=No, 1=Yes)
E18KThere were no beds available at the prog (0=No, 1=Yes)
E18LI do not need substance abuse services (0=No, 1=Yes)
E18MOther reason not get sub ab services (0=No, 1=Yes)
E2ADetox prog for alcohol or drug prob-last 6 months (0=No, 1=Yes)
E2B# times entered a detox prog-last 6 months
E2C# nights ovrnight in detox prg-last 6 months
E3AHolding unit for drug/alcohol prob-last 6 months (0=No, 1=Yes)
E3B# times in holding unity=last 6 months
E3C# total nights in holding unit-last 6 months
E4AIn halfway hse/resid facil-last 6 months (0=No, 1=Yes)
E4B# times in hlfwy hse/res facil-last 6 months
E4CTtl nites in hlfwy hse/res fac-last 6 months
E5AIn day trtmt prg for alcohol/drug-last 6 months (0=No, 1=Yes)
E5BTotal # days in day trtmt prg-last 6 months
E6In methadone maintenance prg-last 6 months (0=No, 1=Yes)
E7AVisit outpt prg subst ab couns-last 6 months (0=No, 1=Yes)
E7B# visits outpt prg subst ab couns-last 6 months
E8A1Saw MD/H care worker regarding alcohol/drugs-last 6 months (0=No, 1=Yes)
E8A2Saw Prst/Min/Rabbi re alcohol/drugs-last 6 months (0=No, 1=Yes)
E8A3Employ Asst Prg for alcohol/drug prb-last 6 months (0=No, 1=Yes)
E8A4Oth source cnsl for alcohol/drug prb-last 6 months (0=No, 1=Yes)
E9AAA/NA/slf-hlp for drug/alcohol/emot-last 6 months (0=No, 1=Yes)
E9BHow often attend AA/NA/slf-hlp-last 6 months (1=Daily, 2=2-3 Times/week, 3=Weekly, 4=Every 2 weeks, 5=Once/month
EPI_6M2BEpisodic(C2A-C2O)-6m y/n (0=No, 1=Yes)
EPI_6MEpisodic (C2A-C2O,C2R-C2U, STD)-6m y/n (0=No, 1=Yes)
EPI_SUMSum episodic (C2A-C2O, C2R-C2U, STD)-6m
F1ABothered by thngs not generally bothered by (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1BMy appetite was poor (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1CCouldn't shake blues evn w/fam+frnds hlp (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1DFelt I was just as good as other people (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1EHad trouble keeping mind on what doing (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1FI felt depressed (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1GI felt everything I did was an effort (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1HI felt hopeful about the future (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1II thought my life had been a failure (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1JI felt fearful (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1KMy sleep was restless (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1LI was happy (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1MI talked less than usual (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1NI felt lonely (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1OPeople were unfriendly (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1PI enjoyed life (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1QI had crying spells (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1RI felt sad (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1SI felt that people dislike me (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
F1TI could not get going (0=Rarely/never, 1=Some of the time, 2=Occas/moderately, 3=Most of the time)
FAMABUSEFamily abuse-physical or sexual (0=No, 1=Yes)
FRML_SATFormal substance abuse treatment y/n (0=No, 1=Yes)
G1A_30Diff contr viol beh-sig per last 30 days (0=No, 1=Yes)
G1ADiff contr viol beh for sig time per evr (0=No, 1=Yes)
G1B_30Had thoughts of suicide-last 30 days (0=No, 1=Yes)
G1B_RECSuicidal thoughts past 30 days y/n (0=No, 1=Yes)
G1BEver had thoughts of suicide (0=No, 1=Yes)
G1C_30Attempted suicide-last 30 days (0=No, 1=Yes)
G1CAttempted suicide ever (0=No, 1=Yes)
G1D_30Prescr med for psy/emot prob-last 30 days (0=No, 1=Yes)
G1D_RECPrescribed psych meds past 30 days y/n (0=No, 1=Yes)
G1DPrescr med for pst/emot prob ever (0=No, 1=Yes)
GHSF-36 general health perceptions (0-100)
GOV_SUPPReceived government support past 6 m (0=No, 1=Yes)
GROUPRandomization Group (0=Control, 1=Clinic)
H10_30# days in last 30 bef detox used cannabis
H10_LT# years regularly used cannabis
H10_PRBProblem sub: marijuana, cannabis (0=No, 1=Yes)
H10_RTRoute of admin of cannabis (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H11_30# days in last 30 bef detox used halluc
H11_LT# years regularly used hallucinogens
H11_PRBProblem sub: hallucinogens (0=No, 1=Yes)
H11_RTRoute of admin of hallucinogens (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H12_30# days in last 30 bef detox used inhalant
H12_LT# years regularly used inhalants
H12_PRBProblem sub: inhalants (0=No, 1=Yes)
H12_RTRoute of admin of inhalants (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H13_30# days used >1 sub/day-last 30 bef detox
H13_LT# years regularly used >1 subst/day
H13_RTRoute of admin of >1 subst/day (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H14According to interviewer, which substance is main problem (0=No problem, 1=Alcohol, 2=Alcohol to intox, 3=Heroin 4=Methadone, 5=Other opiate/analg, 6=Barbituates, 7=Sed/hyp/tranq, 8=Cocaine, 9=Amphetamines, 10=Marij/cannabis, 15=Alcohol and one or more drug, 16=More than one drug
H15A# times had alcohol DTs
H15B# times overdosed on drugs
H16A$ spent on alcohol-last 30 days bef detox
H16B$ spent on drugs-last 30 days bef detox
H17A# days had alcohol prob-last 30 days bef det
H17B# days had drug prob-last 30 days bef det
H18AHow troubled by alcohol probs-last 30 days (0=Not at all, 1=Slightly, 2=Moderately, 3=Considerably, 4=Extremely)
H18BHow troubled by drug probs-last 30 days (0=Not at all, 1=Slightly, 2=Moderately, 3=Considerably, 4=Extremely)
H19AHow import is treatment for alcohol problems now (0=Not at all, 1=Slightly, 2=Moderately, 3=Considerably, 4=Extremely)
H19BHow important is trtmnt for drug probs now (0=Not at all, 1=Slightly, 2=Moderately, 3=Considerably, 4=Extremely)
H1_30# days in past 30 bef detox used alcohol
H1_LT# years regularly used alcohol
H1_RTRoute of administration use alcohol (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H2_30#days in 3- bef detox use alcohol to intox
H2_LT# years regularly used alcohol to intox
H2_PRBProblem sub: alcohol to intox (0=No, 1=Yes)
H2_RTRoute of admin use alcohol to intox (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H3_30# days in past 30 bef detox used heroin
H3_LT# years regularly used heroin
H3_PRBProblem sub: heroin (0=No, 1=Yes)
H3_RTRoute of administration of heroin (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H4_30# days used methadone-last 30 bef detox
H4_LT# years regularly used methadone
H4_PRBProblem sub: methadone (0=No, 1=Yes)
H4_RTRoute of administration of methadone (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H5_30# days used opiates/analg-last 30 bef detox
H5_LT# years regularly used oth opiates/analg
H5_PRBProblem sub: other opiates/analg (0=No, 1=Yes)
H5_RTRoute of admin of other opiates/analg (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H6_30# days in past 30 before detox used barbiturates
H6_LT# years regularly used barbiturates
H6_PRBProblem sub: barbiturates (0=No, 1=Yes)
H6_RTRoute of admin of barbiturates (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H7_30# days used sed/hyp/trnq-last 30 bef det
H7_LT# years regularly used sed/hyp/trnq
H7_PRBProblem sub: sedat/hyp/tranq (0=No, 1=Yes)
H7_RTRoute of admin of sed/hyp/trnq (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H8_30# days in last 30 bef detox used cocaine
H8_LT# years regularly used cocaine
H8_PRBProblem sub: cocaine (0=No, 1=Yes)
H8_RTRoute of admin of cocaine (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
H9_30# days in last 30 bef detox used amphet
H9_LT# years regularly used amphetamines
H9_PRBProblem sub: amphetamines (0=No, 1=Yes)
H9_RTRoute of admin of amphetamines (0=N/A. 1=Oral, 2=Nasal, 3=Smoking, 4=Non-IV injection, 5=IV)
HOMELESSHomeless-shelter/street past 6 m (0=No, 1=Yes)
HS_GRADHigh school graduate (0=No, 1=Yes)
HTRaw SF-36 health transition item
I1Avg # drinks in last 30 days bef detox
I2Most drank any 1 day in last 30 bef detox
I3On days used heroin, avg # bags used
I4Most bags heroin used any 1 day – 30 before det
I5Avg $ amt of heroin used per day
I6AOn days used cocaine, avg # bags used
I6BOn days used cocaine, avg # rocks used
I7AMst bgs cocaine use any 1 day-30 bef det
I7BMst rcks cocaine use any 1 day-30 bef det
I8Avg $ amt of cocaine used per day
IMPUL2Inventory of Drug Use Consequences InDUC-2L-Impulse control-Raw (w/0 M23)
IMPULInventory of Drug Use Consequences InDUL-2L-Impulse control-Raw
INDTOT2InDUC-2L-Total drlnC-Raw- w/o M23 and M48
INDTOTInDUC-2LTotal drlnC sore-Raw
INTERInDUC-2L-Interpersonal-Raw
INTRAInDUC-2L-Intrapersonal-Raw
INT_TIME1# of months from baseline to current interview
INT_TIME2# of months from previous to current interview
J10AGet physically sick when stop using heroin (0=No, 1=Yes)
J10BEver use heroin to prevent getting sick (0=No, 1=Yes)
J1Evr don't stop using cocaine when should (0=No, 1=Yes)
J2Ever tried to cut down on cocaine (0=No, 1=Yes)
J3Does cocaine take up a lot of your time (0=No, 1=Yes)
J4Need use > cocaine to get some feeling (0=No, 1=Yes)
J5AGet physically sick when stop using cocaine (0=No, 1=Yes)
J5BEver use cocaine to prevent getting sick (0=No, 1=Yes)
J6Ever don't stop using heroin when should (0=No, 1=Yes)
J7Ever tried to cut down on heroin (0=No, 1=Yes)
J8Does heroin take up a lot of your time (0=No, 1=Yes)
J9Need use > heroin to get some feeling (0=No, 1=Yes)
JAIL_5YRAny jail time past 5 years y/n (0=No, 1=Yes)
JAIL_MOSTotal months in jail past 5 years
K1Do you currently smoke cigarettes (1=Yes-every day, 2=Yes-some days, 3=No-former smoker, 4=No-never>100 cigarettes
K2Avg # cigarettes smoked per day
K3Considering quitting cigarettes within next 6 months (0=No, 1=Yes)
L10Have had blkouts as result of drinkng (0=No, never, 1=Sometimes, 2=Often, 3=Alm evry time drink)
L11Do you carry bottle or keep close by (0=No, 1=Some of the time, 2=Most of the time)
L12After abstin end up drink heavily again (0=No, 1=Sometimes, 2=Almost evry time)
L13Passed out due to drinking-last 12 months (0=No, 1=Once, 2=More than once)
L14Had convuls following period of drinkng (0=No, 1=Once, 2=Several times)
L15Do you drink throughout the day (0=No, 1=Yes)
L16After drinkng heavily was thinkng unclear (0=No, 1=Yes, few hrs, 2=Yes,1-2 days, 3=Yes, many days)
L17D/t drinkng felt heart beat rapidly (0=No, 1=Once, 2=Several times)
L18Do you constntly think about drinkng/alcohol (0=No, 1=Yes)
L19D/t drinkng heard things not there (0=No, 1=Once, 2= Several times)
L1How often drink last time drank (1=To get high/less, 2=To get drunk, 3=To pass out)
L20Had weird/fright sensations when drinkng (0=No, 1=Once or twice, 2=Often)
L21When drinkng felt things rawl not there (0=No, 1=Once, 2=Several times)
L22With respect to blackouts (0=Never had one, 1=Had for <1hr, 2=Had several hrs, 3=Had for day/+)
L23Ever tried to cut down on drinking & failed (0=No, 1=Once, 2=Several times)
L24Do you gulp drinks (0=No, 1=Yes)
L25After taking 1 or 2 drinks can you stop (0=No, 1=Yes)
L2Often have hangovers Sun or Mon mornings (0=No, 1=Yes)
L3Have you had the shakes when sobering (0=No, 1=Sometimes, 2=Alm evry time drink)
L4Do you get physically sick as reslt of drinking (0=No, 1=Sometimes, 2=Alm evry time drink)
L5have you had the DTs (0=No, 1=Once, 2=Several times
L6When drink do you stumble/stagger/weave (0=No, 1=Sometimes, 2=Often)
L7D/t drinkng felt overly hot/sweaty (0=No, 1=Once, 2=Several times)
L8As result of drinkng saw thngs not there (0=No, 1=Once, 2=Several times)
L9Panic because fear not have drink if need it (0=No, 1=Yes)
LINKSTATUSLinked to primary care within 12 months (by administrative record)
M10Using alcohol/1 drug caused > use othr drugs (0=No, 1=Yes)
M11I have been sick/vomited aft alcohol/drug use (0=No, 1=Yes)
M12I have been unhappy because of alcohol/drug use (0=No, 1=Yes)
M13Lost weight/eaten poorly due to alcohol/drug use (0=No, 1=Yes)
M14Fail to do what expected due to alcohol/drug use (0=No, 1=Yes)
M15Using alcohol/drugs has helped me to relax (0=No, 1=Yes)
M16Felt guilt/ashamed because of my alcohol drug use (0=No, 1=Yes)
M17Said/done emarras thngs when on alcohol/drug (0=No, 1=Yes)
M18Personality changed for worse on alcohol/drug (0=No, 1=Yes)
M19Taken foolish risk when using alcohol/drugs (0=No, 1=Yes)
M1Had hangover/felt bad aftr using alcohol/drugs (0=No, 1=Yes)
M20Gotten into trouble because of alcohol/drug use (0=No, 1=Yes)
M21Said cruel things while using alcohol/drugs (0=No, 1=Yes)
M22Done impuls thngs regret due to alcohol/drug use (0=No, 1=Yes)
M23Gotten in physical fights when use alcohol/drugs (0=No, 1=Yes)
M24My physical health was harmed by alcohol/drug use (0=No, 1=Yes)
M25Using alcohol/drug helped me have more + outlook (0=No, 1=Yes)
M26I have had money probs because of my alcohol/drug use (0=No, 1=Yes)
M27My love relat harmed due to my alcohol/drug use (0=No, 1=Yes)
M28Smoked tobacco more when using alcohol/drugs (0=No, 1=Yes)
M29My physical appearance harmed by alcohol/drug use (0=No, 1=Yes)
M2Felt bad about self because of alcohol/drug use (0=No, 1=Yes)
M30My family hurt because of my alcohol drug use (0=No, 1=Yes)
M31Close relationsp damaged due to alcohol/drug use (0=No, 1=Yes)
M32Spent time in jail because of my alcohol/drug use (0=No, 1=Yes)
M33My sex life suffered due to my alcohol/drug use (0=No, 1=Yes)
M34Lost interst in activity due to my alcohol/drug use (0=No, 1=Yes)
M35Soc life> enjoyable when using alcohol/drug (0=No, 1=Yes)
M36Spirit/moral life harmed by alcohol/drug use (0=No, 1=Yes)
M37Not had kind life want due to alcohol/drug use (0=No, 1=Yes)
M38My alcohol/drug use in way of personal growth (0=No, 1=Yes)
M39My alcohol/drug use damaged soc life/reputat (0=No, 1=Yes)
M3Missed days wrk/sch because of alcohol/drug use (0=No, 1=Yes)
M40Spent/lost too much $ because alcohol/drug use (0=No, 1=Yes)
M41Arrested for DUI of alcohol or oth drugs (0=No, 1=Yes)
M42Arrested for offenses rel to alcohol/drug use (0=No, 1=Yes)
M43Lost marriage/love relat due to alcohol/drug use (0=No, 1=Yes)
M44Susp/fired/left job/sch due to alcohol/drug use (0=No, 1=Yes)
M45I used drugs moderately w/o having probs (0=No, 1=Yes)
M46I have lost a friend due to my alcohol/drug use (0=No, 1=Yes)
M47Had an accident while using alcohol/drugs (0=No, 1=Yes)
M48Physically hurt/injured/burned when using alcohol/drugs (0=No, 1=Yes)
M49I injured someone while using alcohol/drugs (0=No, 1=Yes)
M4Fam/frinds worry/compl about alcohol/drug use (0=No, 1=Yes)
M50Damaged things/prop when using alcohol/drugs (0=No, 1=Yes)
M5I have enjoyed drinking/using drugs (0=No, 1=Yes)
M6Qual of work suffered because of alcohol/drug use (0=No, 1=Yes)
M7Parenting ability harmed by alcohol/drug use (0=No, 1=Yes)
M8Trouble sleeping/nightmares aftr alcohol/drugs (0=No, 1=Yes)
M9Driven motor veh while undr inf alcohol/drugs (0=No, 1=Yes)
MAR_STATMarital status (recode) (0=Married, 1=Not married)
MCSStandardized mental component scale-00
MD_LANGLang prefer to speak to MD (recode) (0=English, 1=Other lang)
MHSF-36 mental health index (0-100)
MMSECMMSEC
N1AMy friends give me the moral support I need (0=No, 1=Yes)
N1BMost people closer to friends than I am (0=No, 1=Yes)
N1CMy friends enjoy hearing what I think (0=No, 1=Yes)
N1DI rely on my friends for emot support (0=No, 1=Yes)
N1EFriend go to when down w/o feel funny later (0=No, 1=Yes)
N1FFrnds and I open re what thnk about things (0=No, 1=Yes)
N1GMy friends sensitive to my pers needs (0=No, 1=Yes)
N1HMy friends good at helping me solve probs (0=No, 1=Yes)
N1Ihave deep sharing relat w/ a # of frnds (0=No, 1=Yes)
N1JWhen confide in frnds makes me uncomfort (0=No, 1=Yes)
N1KMy friends seek me out for companionship (0=No, 1=Yes)
N1LNot have as int relat w/frnds as others (0=No, 1=Yes)
N1MRecent good idea how to do somethng frm frnd (0=No, 1=Yes)
N1NI wish my friends were much different (0=No, 1=Yes)
N2AMy family gives me the moral support I need (0=No, 1=Yes)
N2BGood ideas of how do/make thngs from fam (0=No, 1=Yes)
N2CMost peop closer to their fam than I am (0=No, 1=Yes)
N2DWhen confide make close fam membs uncomf (0=No, 1=Yes)
N2EMy fam enjoys hearing about what I think (0=No, 1=Yes)
N2FMembs of my fam share many of my intrsts (0=No, 1=Yes)
N2GI rely on my fam for emot support (0=No, 1=Yes)
N2HFam memb go to when down w/o feel funny (0=No, 1=Yes)
N2IFam and I open about what thnk about thngs (0=No, 1=Yes)
N2JMy fam is sensitive to my personal needs (0=No, 1=Yes)
N2KFam memb good at helping me solve probs (0=No, 1=Yes)
N2LHave deep sharing relat w/# of fam membs (0=No, 1=Yes)
N2MMakes me uncomf to confide in fam membs (0=No, 1=Yes)
N2NI wish my family were much different (0=No, 1=Yes)
NUM_BARR# of perceived barriers to linkage
NUM_INTERVALSNumber of 6-month intervals from previous to current interview
O1A# people spend tx w/who drink alcohol (1=None, 2= A few, 3=About half, 4= Most, 5=All)
O1B_RECFamily/friends heavy drinkers y/n (0=No, 1=Yes)
O1B# people spend tx w/who are heavy drinkrs (1=None, 2= A few, 3=About half, 4= Most, 5=All)
O1C_RECFamily/friends use drugs y/n (0=No, 1=Yes)
O1C# people spend tx w/who use drugs (1=None, 2= A few, 3=About half, 4= Most, 5=All)
O1D_RECFamily/fiends support abst. y/n (0=No, 1=Yes)
O1D# peop spend tx w/who supprt your abstin (1=None, 2= A few, 3=About half, 4= Most, 5=All)
O2_RECLive-in partner drinks/drugs y/n (0=No, 1=Yes)
O2Does live-in part/spouse drink/use drugs (0=No, 1=Yes, 2=N/A)
P1APhysical abuse/assault by family members/person I know (0=No, 1=Yes, 7=Not sure)
P1BAge first physically assaulted by person I know
P1CPhysically assaulted by person I know-last 6 months (0=No, 1=Yes)
P2APhysical abuse/assault by stranger (0=No, 1=Yes, 7=Not sure)
P2BAge first physically assaulted by stranger
P2CPhysically assaulted by stranger-last 6 months (0=No, 1=Yes)
P3Using drugs/alcohol when physically assaulted (1=Don't know, 2=Never, 3=Some cases, 4=Most cases, 5=All cases, 9=Never assaulted)
P4Person who physically assaulted you was using alcohol/drugs (1=Don't know, 2=Never, 3=Some cases, 4=Most cases, 5=All cases, 9=Never assaulted)
P5ASexual abuse/assault by family member/person you know (0=No, 1= Yes, 7=Not sure)
P5BAge first sexually assaulted by person you know
P5CSexually assaulted by person you know-last 6 months (0=No, 1=Yes)
P6ASexual abuse/assault by stranger (0=No, 1=Yes, 7=Not sure)
P6BAge first sexually assaulted by stranger
P6CSexually assaulted by stranger-last 6 months (0=No, 1=Yes)
P7Using drugs/alcohol when sexually assaulted (1=Don't know, 2=Never, 3=Some cases, 4=Most cases, 5=All cases, 9=Never assaulted)
P8Person who sexually assaulted you using alcohol/drugs (1=Don't know, 2=Never, 3=Some cases, 4=Most cases, 5=All cases, 9=Never assaulted)
PCP_IDa numeric vector
PCSStandardized physical component scale-00
PC_REC7Primary cared received: Linked & # visits (0=Not linked, 1=Linked, 1 visit, 2=Linked, 2 visits, 3=Linked, 3 visits, 4=Linked, 4 visits, 5= Linked, 5 visits, 6=Linked, 6+visits)
PC_RECPrimary care received: Linked & # visits (0=Not linked, 1=Linked, 1 visit, 2=Linked, 2+ visits)
PFSF-36 physical functioning (0-100)
PHSXABUSAny abuse (0=No, 1=Yes)
PHYABUSEPhysical abuse-stranger or family (0=No, 1=Yes)
PHYS2InDUC-2L-Physical 9Raw (w/o M48)
PHYSInDUC-2L-Physical-Raw
POLYSUBPolysubstance abuser y/n (0=No, 1=Yes)
PREV_TIMEPrevious interview time
PRIMLANGFirst language (recode) (0=English, 1=Other lang)
PRIMSUB2First drug of choice (no marijuana) (0=None, 1=Alcohol, 2=Cocaine, 3=Heroin, 4=Barbituates, 5=Benzos, 6=Marijuana, 7=Methadone, 8=Opiates)
PRIM_SUBFirst drug of choice (0=None, 1=Alcohol, 2=Cocaine, 3=Heroin, 4=Barbituates, 5=Benzos, 6=Marijuana, 7=Methadone, 8=Opiates)
PSS_FAPerceived social support-family
PSS_FRPerceived social support-friends
Q10How would you describe yourself (0=Straight, 1=Gay/bisexual)
Q11# men had sex w/in past 6 months (0=0 men, 1=1 man, 2=2-3 men, 3=4+ men
Q12# women had sex w/in past 6 months (0=0 women, 1=1woman, 2=2-3 women, 3=4+ women
Q13# times had sex In past 6 months (0=Never, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
Q14How often had sex to get drugs-last 6 months (0=Never, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
Q15How often given drugs to have sex-last 6 months (0=Never, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
Q16How often were you paid for sex-last 6 months (0=Never, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
Q17How often you pay pers for sex-last 6 months (0=Never, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
Q18How often use condoms during sex=last 6 months (0=No sex/always, 1=Most of the time, 2=Some of the time, 3=None of the time)
Q19Condoms are too much of a hassle to use (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
Q1AHave you ever injected drugs (0=No, 1=Yes)
Q1BHave you injected drugs-last 6 months (0=No, 1=Yes)
Q20Safer sex is always your responsibility (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
Q2Have you shared needles/works-last 6 months (0=No/Not shot up, 3=Yes)
Q3# people shared needles w/past 6 months (0=No/Not shot up, 1=1 other person, 2=2-3 diff people, 3=4/+ diff people)
Q4How often been to shoot gall/hse-last 6 months (0=Never, 1=Few times or less, 2= Few times/month, 3= Once or more/week)
Q5How often been to crack house-last 6 months (0=Never, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
Q6How often shared rinse-water-last 6 months (0=Nevr/Not shot up, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
Q7How often shared a cooker-last 6 months (0=Nevr/Not shot up, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
Q8How often shared a cotton-last 6 months (0=Nevr/Not shot up, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
Q9How often use syringe to div drugs-last 6 months (0=Nevr/Not shot up, 1=Few times or less, 2=Few times/month, 3=Once or more/week)
R1AI really want to change my alcohol/drug use (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1BSometimes I wonder if I'm an alcohol/addict (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1CId I don't change alcohol/drug probs will worsen (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1DI started making changes in alcohol/drug use (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1EWas using too much but managed to change (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1FI wonder if my alcohol/drug use hurting othrs (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1GI am a prob drinker or have drug prob (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1HAlready doing thngs to change alcohol/drug use (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1Ihave changed use-trying to not slip back (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1JI have a serious problem w/ alcohol/drugs (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1KI wonder if I'm in control of alcohol/drug use (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1LMy alcohol/drug use is causing a lot of harm (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1MActively cutting down/stopping alcohol/drug use (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1NWant help to not go back to alcohol/drugs (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1OI know that I have an alcohol/drug problem (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1PI wonder if I use alcohol/drugs too much (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1QI am an alcoholic or drug addict (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1RI am working hard to change alcohol/drug use (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
R1SSome changes-want help from going back (1=Strongly disagree, 2=Disagree, 3= Agree, 4=Strongly agree)
RABSCALERAB scale sore
RACE2Race (recode) (1=White, 2=Minority)
RACERace (recode) (1=Afr Amer/Black, 2=White, 3=Hispanic, 4=Other)
RAWBPRaw SF-36 pain index
RAWGHRaw SF-36 general health perceptions
RAWMHRaw SF-36 mental health index
RAWPFRaw SF-36 physical functioning
RAWRERaw SF-36 role-emotional
RAWRPRaw SF-36 role-physical
RAWSFRaw SF-36 social functioning
RAWVTRaw SF-36 vitality
RAW_ADSADS score
RAW_AMSOCRATES-Ambivalence-Raw
RAW_RESOCRATES-Recognition-Raw
RAW_TSSOCRATES-Taking steps-Raw
RCT_LINKDid subject link to primary care (RCT)–This time point (0=No, 1=Yes)
REALM2REALM score (dichotomous) (1=0-60, 2=61-66)
REALM3REALM score (categorical) (1=0-44), 2=45-60), 3=61-66)
REALMREALM score
REG_MDDid subject report having regular doctor–This time point (0=No, 1=Yes)
RESF-36 role-emotional (0-100)
RPSF-36 role physical (0-100)
S1AAt interview pt obviously depressed/withdrawn (0=No, 1=Yes)
S1Bat interview pt obviously hostile (0=No, 1=Yes)
S1CAt interview patientt obviously anxious/nervous (0=No, 1=Yes)
S1DTrouble with real tst/thght dis/par at interview (0=No, 1=Yes)
S1EAt interview pt trbl w/ compr/concen/rememb (0=No, 1=Yes)
S1FAt interview pt had suicidal thoughts (0=No, 1=Yes)
SATREATAny BSAS substance abuse this time point (0=No, 1=Yes)
SECD_SUBSecond drug of choice (0=None, 1=Alcohol, 3=Cocaine, 3=Heroine, 4=Barbituates, 5=Benzos, 6=Marijuana, 7=Methadone, 8=Opiates)
SER_INJRecent (6m) serious injury y/n (0=No, 1=Yes)
SEXABUSESexual abuse-stranger or family (0=No, 1=Yes)
SEXRISKRAB-Sex risk total
SFSF-36 social functioning (0-100)
SMOKERCurrent smoker (every/some days) y/n (0=No, 1=Yes)
SRInDUC-2L-Social responsibility-Raw
STD_6MHad an STD past 6m y/n (0=No, 1=Yes)
STD_EVEREver had an STD y/n (0=No, 1=Yes)
STRABUSEStranger abuse-physical or sexual (0=No, 1=Yes)
T1B# days in row continued to drink
T1CLongest period abstain-last 6 months (alcohol)
T1Have used alcohol since leaving River St. (0=No, 1=Yes)
T2B# days in row continued to use heroin
T2CLongest period abstain-last 6 months (heroin)
T2Have used heroin since leaving River St (0=No, 1=Yes)
T3B# days in row continued to use cocaine
T3CLongest period abstain-last 6 months (cocaine)
T3Have used cocaine since leaving River St (0=No, 1=Yes)
TIMEInterview time point
TOTALRABRAB-Total RAB sore
U10A# times been to regular MDs office-pst 6 months
U10B# times saw regular MD in office-pst 6 months
U10C# times saw oth prof in office-pst 6 months
U11Rate convenience of MD office location (1=Very poor, 2=Poor, 3=Fair, 4=Good, 5=Very good, 6=Excellent)
U12Rate hours MD office open for medical appointments (1=Very poor, 2=Poor, 3=Fair, 4=Good, 5=Very good, 6=Excellent)
U13Usual wait for appointment when sick (unscheduled) (1=Very poor, 2=Poor, 3=Fair, 4=Good, 5=Very good, 6=Excellent)
U14Time wait for appointment to start at MD office (1=Very poor, 2=Poor, 3=Fair, 4=Good, 5=Very good, 6=Excellent)
U15ADO you pay for any/all of MD visits (0=No, 1=Yes)
U15BHow rate amt of $ you pay for MD visits (1=Very poor, 2=Poor, 3=Fair, 4=Good, 5=Very good, 6=Excellent)
U16ADo you pay for any/all of prescript meds (0=No, 1=Yes)
U16BRate amt $ pay for meds/prescript trtmnts (1=Very poor, 2=Poor, 3=Fair, 4=Good, 5=Very good, 6=Excellent)
U17Ever skip meds/trtmnts because too expensive (1=Yes, often, 2=Yes, occasionally, 3=No, never)
U18AAbility to reach MC office by phone (1=Very poor, 2=Poor, 3=Fair, 4=Good, 5=Very good, 6=Excellent)
U18BAbility to speak to MD by phone if need (1=Very poor, 2=Poor, 3=Fair, 4=Good, 5=Very good, 6=Excellent)
U19How often see regular MD when have regular check-up (1=Always, 2=Almost always, 3=A lot of the time, 4=Some of the time, 5=Almost never, 6=Never)
U1It is important to have a regular MD (1=Strongly agree, 2=Agree, 3=Uncertain, 4=Disagree, 5=Strongly Disagree)
U20When sick + go to MD how often see regular MD (1=Always, 2=Almost always, 3=A lot of the time, 4=Some of the time, 5=Almost never, 6=Never)
U21AHow thorough MD exam to check health prb (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U21BHow often question if MD diagnosis right (1=Always, 2=Almost always, 3=A lot of the time, 4=Some of the time, 5=Almost never, 6=Never)
U22AThoroughness of MD questions re symptoms (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U22BAttn MD gives to what you have to say (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U22CMD explanations of health problems/treatments need (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U22DMD instructions re symptom report/further care (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U22EMD advice in decisions about your care (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U23How often leave MD office with unanswd quests (1=Always, 2=Almost always, 3=A lot of the time, 4=Some of the time, 5=Almost never, 6=Never)
U24AAmount of time your MD spends with you (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U24BMDs patience w/ your questions/worries (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U24CMDs friendliness and warmth toward you (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U24DMDs caring and concern for you (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U24EMDs respect for you (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U25AReg MD ever talked to you about smoking (0=No, 1=Yes)
U25BReg MD ever talked to you about alcohol use (0=No, 1=Yes)
U25CReg MD ever talk to you about seat belt use (0=No, 1=Yes)
U25DReg MD ever talked to you about diet (0=No, 1=Yes)
U25EReg Mdever talked to you about exercise (0=No, 1=Yes)
U25FReg MD ever talked to you about stress (0=No, 1=Yes)
U25GReg MD ever talked to you about safe sex (0=No, 1=Yes)
U25HReg MD ever talked to you about drug use (0=No, 1=Yes)
U25IReg MD ever talked to you about HIV testing (0=No, 1=Yes)
U26ACut/quit smoking because of MDs advice (0=No, 1=Yes)
U26BTried to drink less alcohol because of MD advice (0=No, 1=Yes)
U26CWore my seat belt more because of MDs advice (0=No, 1=Yes)
U26DChanged diet because of MDs advice (0=No, 1=Yes)
U26EDone more exercise because MDs advice (0=No, 1=Yes)
U26FRelax/reduce stress because of MDs advice (0=No, 1=Yes)
U26GPracticed safer sex because of MDs advice (0=No, 1=Yes)
U26HTried to cut down/quit drugs because MD advice (0=No, 1=Yes)
"
U26IGot HIV tested because of MDs advice (0=No, 1=Yes)
"
U27AI can tell my MD anything (1=Strongly agree, 2= Agree, 3= Not sure, 4=Disagree, 5=Strongly disagree)
"
U27BMy MD pretends to know thngs if not sure (1=Strongly agree, 2= Agree, 3= Not sure, 4=Disagree, 5=Strongly disagree)
"
U27CI trust my MDs judgment re my med care (1=Strongly agree, 2= Agree, 3= Not sure, 4=Disagree, 5=Strongly disagree)
"
U27DMy MD cares > about < costs than my health (1=Strongly agree, 2= Agree, 3= Not sure, 4=Disagree, 5=Strongly disagree)
"
U27EMy MD always tell truth about my health (1=Strongly agree, 2= Agree, 3= Not sure, 4=Disagree, 5=Strongly disagree)
"
U27FMy MD cares as much as I about my health (1=Strongly agree, 2= Agree, 3= Not sure, 4=Disagree, 5=Strongly disagree)
"
U27GMy MD would try to hide a mistake in trtmt (1=Strongly agree, 2= Agree, 3= Not sure, 4=Disagree, 5=Strongly disagree)
"
U28How much to you trust this MD (0=Not at all, 1=1, 2=2, 3=3, 4=4, 5=5, 6=6, 7=7, 8=8, 9=9, 10=Completely)
"
U29AMDs knowledge of your entire med history (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
"
U29BMD knowledge of your response-home/work/sch (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
"
U29CMD knowledge of what worries you most-health (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
"
U29DMDs knowledge of you as a person (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
"
U2AI cannot pay for services (0=No, 1=Yes)
U2BI am not eligible for free care (0=No, 1=Yes)
U2CI do not know where to go (0=No, 1=Yes)
U2DCan't get services due to transport probs (0=No, 1=Yes)
U2EOffice/clinic hours are inconvenient (0=No, 1=Yes)
U2FI do not speak/understand English well (0=No, 1=Yes)
U2GAfraid others discover health prb I have (0=No, 1=Yes)
U2HMy substance abuse interferes (0=No, 1=Yes)
U2II do not have a babysitter (0=No, 1=Yes)
U2JI do not want to lose my job (0=No, 1=Yes)
U2KMy insurance does not cover services (0=No, 1=Yes)
U2LMedical care is not important to me (0=No, 1=Yes)
U2MI do not have time (0=No, 1=Yes)
U2NMed staff do not treat me with respect (0=No, 1=Yes)
U2OI do not trust my doctors or nurses (0=No, 1=Yes)
U2POften been unsatisfied w/my med care (0=No, 1=Yes)
U2Q_Ta factor with many levels
U2QOther reason hard to get regular med care (0=No, 1=Yes)
U2Ra factor with levels
7ABCDEFGHIJKLMNOPQU30MD would know what want done if unconscious (1=Strongly agree, 2=Agree, 3=Not sure, 4= Disagree, 5=Strongly disagree)
"
U31Oth MDs/RNs who play role in your care (0=No, 1=Yes)
" *
U32ATheir knowledge of you as a person (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U32BThe quality of care they provide (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U32CCoordination between them and your regular MD (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U32D_TN/A, only my regular MD does this
U32DTheir explanation of your health prbs/trtmts need (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U33Amt regular MD knows about care from others (1=Knows everything, 2=Knows almost everything, 3=Knows some things, 4=Knows very little, 5=Knows nothing)
U34Has MD ever recommended you see MD specialists (0=No, 1=Yes)
U35AHow helpful MD in deciding on specialist (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U35BHow helpful MD getting appointment with specialist (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U35CMDs involvement when you trtd by specialist (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U35DMDs communication w/your specialists/oth MDs (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U35EMD help in explain what specialists said (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U35FQuality of specialists MD sent you to (1=Very poor, 2= Poor, 3=Fair, 4=Good, 5= Very good, 6= Excellent)
U36How many minutes to get to MDs office (1=<15, 2=16-30. 3=31-60, 4=More than 60)
U37When sick+call how long take to see you (1=Same day, 2=Next day, 3=In 2-3 days, 4=In 4-5 days, 5=in >5 days)
U38How many minutes late appointment usually begin (1=None, 2=<5 minutes, 3=6-10 minutes, 4=11-20 minutes, 5=21-30 minutes, 6=31-45 minutes, 7=>45 minutes)
U39How satisfied are you w/your regular MD (1=Completely satisfied, 2=Very satisfied, 3=Somewhat satisfied, 4=Neither, 5=Somewhat dissatisfied, 6=Very dissatisfied, 7=Completely dissatisfied)
U3AHas MD evr talked to you about drug use (0=No, 1=Yes)
U3BHas MD evr talked to you about alcohol use (0=No, 1=Yes)
U4Is there an MD you consider your regular MD (0=No, 1=Yes)
U5Have you seen any MDs in last 6 months (0=No, 1=Yes)
U6AWould you go to this MD if med prb not emergency (0=No, 1=Yes)
U6BThink one of these could be your regular MD (0=No, 1=Yes)
U7A_Ta factor with levels
ARTHRITIS DOCTORCHIROPRACTORCOCAINE STUDYDETOX DOCTORDOEAR DOCTOREAR SPECIALISTEAR, NOSE, & THROAT.EAR/NOSE/THROATENTFAMILY PHYSICIANGENERAL MEDICINEGENERAL PRACTICEGENERAL PRACTITIONERGENERAL PRACTITIONERHEAD & NECK SPECIALISTHERBAL/HOMEOPATHIC/ACUPUNCTUREID DOCTORMAYBE GENERAL PRACTITIONERMEDICAL STUDENTNEUROLOGISTNURSENURSE PRACTITIONERNURSE PRACTITIONERONCOLOGISTPRENATALPRIMARYPRIMARY CAREPRIMARY CAREPRIMARY CARE DOCTORPRIMARY CARETHERAPISTUROLOGISTWOMENS CLINIC BMCU7AWhat type of MD is your regular MD/this MD (1=OB/GYN, 2=Family medicine, 3=Pediatrician, 4=Adolescent medicine, 5=Internal medicine, 6=AIDS doctor, 7=Asthma doctor, 8=Pulmonary doctor, 9=Cardiologist, 10=Gastroen)
U8AOnly saw this person once (=Only saw once)
U8BSaw this person for < 6 months (1 = <6 months)
U8CSaw this person for 6 months - 1 year (2=Between 6 months & 1 year)
U8DSaw this person for 1-2 years (3 = 1-2 years)
U8ESaw this person for 3-5 years (4 = 3-5 years)
U8FSaw this person for more than 5 years (5 = >5 years)
UNEMPLOYUsually unemployed last 6 months (0=No, 1=Yes)
V1Ever needed to drink much more to get effect (0=No, 1=Yes)
V2Evr find alcohol had < effect than once did (0=No, 1=Yes)
VTSF-36 vitality 0-100)
Z1Breath Alcohol Concentration:1st test
Z2Breath Alcohol Concentration:2nd test
Details
Eligible subjects were adults, who spoke Spanish or English, reported alcohol, heroin or cocaine as their first or second drug of choice, resided in proximity to the primary care clinic to which they would be referred or were homeless. Patients with established primary care relationships they planned to continue, significant dementia, specific plans to leave the Boston area that would prevent research participation, failure to provide contact information for tracking purposes, or pregnancy were excluded.
Subjects were interviewed at baseline during their detoxification stay and follow-up interviews were undertaken every 6 months for 2 years. A variety of continuous, count, discrete, and survival time predictors and outcomes were collected at each of these five occasions.
This dataset is a superset of the HELPmiss and HELPrct datasets which include far fewer variables. Full details of the survey instruments are available at the following link.
Source
https://nhorton.people.amherst.edu/help/
References
Samet JH, Larson MJ, Horton NJ, Doyle K, Winter M, and Saitz R. Linking alcohol and drug-dependent adults to primary medical care: A randomized controlled trial of a multi-disciplinary health intervention in a detoxification unit. Addiction, 2003; 98(4):509-516.
See Also
Examples
data(HELPfull)
Health Evaluation and Linkage to Primary Care
Description
The HELP study was a clinical trial for adult inpatients recruited from a detoxification unit. Patients with no primary care physician were randomized to receive a multidisciplinary assessment and a brief motivational intervention or usual care, with the goal of linking them to primary medical care.
Usage
data(HELPmiss)
Format
Data frame with 470 observations on the following variables.
agesubject age at baseline (in years)
anysubuse of any substance post-detox: a factor with levels
noyescesdCenter for Epidemiologic Studies Depression measure of depressive symptoms at baseline (higher scores indicate more symptoms)
d1lifetime number of hospitalizations for medical problems (measured at baseline)
daysanysubtime (in days) to first use of any substance post-detox
dayslinktime (in days) to linkage to primary care
drugriskRisk Assessment Battery drug risk scale at baseline
e2bnumber of times in past 6 months entered a detox program (measured at baseline)
female0 for male, 1 for female
sexa factor with levels
malefemaleg1bexperienced serious thoughts of suicide in last 30 days (measured at baseline): a factor with levels
noyeshomelesshousing status: a factor with levels
housedhomelessi1average number of drinks (standard units) consumed per day, in the past 30 days (measured at baseline)
i2maximum number of drinks (standard units) consumed per day, in the past 30 days (measured at baseline)
avg_drinksaverage number of drinks (standard units) consumed per day, in the past 30 days (measured at baseline). Same as
i1.max_drinksmaximum number of drinks (standard units) consumed per day, in the past 30 days (measured at baseline). Same as
i2.idsubject identifier
indtotInventory of Drug Use Consequences (InDUC) total score (measured at baseline)
linkstatuspost-detox linkage to primary care (0 = no, 1 = yes)
linkpost-detox linkage to primary care:
noyesmcsSF-36 Mental Component Score (measured at baseline, higher scores are better)
pcsSF-36 Physical Component Score (measured at baseline, higher scores are better)
pss_frperceived social support by friends (measured at baseline)
racegrprace/ethnicity: levels
blackhispanicotherwhitesatreatany BSAS substance abuse treatment at baseline:
noyessexriskRisk Assessment Battery sex risk score (measured at baseline)
substanceprimary substance of abuse:
alcoholcocaineherointreatrandomized to HELP clinic:
noyes
Details
Eligible subjects were adults, who spoke Spanish or English, reported alcohol, heroin or cocaine as their first or second drug of choice, resided in proximity to the primary care clinic to which they would be referred or were homeless. Patients with established primary care relationships they planned to continue, significant dementia, specific plans to leave the Boston area that would prevent research participation, failure to provide contact information for tracking purposes, or pregnancy were excluded.
Subjects were interviewed at baseline during their detoxification stay and follow-up interviews were undertaken every 6 months for 2 years. A variety of continuous, count, discrete, and survival time predictors and outcomes were collected at each of these five occasions.
This dataset is a superset of the HELPrct data with 17 subjects with partially observed data on some of the baseline variables. This is a subset of the HELPfull data which includes 5 timepoints and many additional variables.
Source
https://nhorton.people.amherst.edu/help/
References
Samet JH, Larson MJ, Horton NJ, Doyle K, Winter M, and Saitz R. Linking alcohol and drug-dependent adults to primary medical care: A randomized controlled trial of a multi-disciplinary health intervention in a detoxification unit. Addiction, 2003; 98(4):509-516.
See Also
Examples
data(HELPmiss)
Health Evaluation and Linkage to Primary Care
Description
The HELP study was a clinical trial for adult inpatients recruited from a detoxification unit. Patients with no primary care physician were randomized to receive a multidisciplinary assessment and a brief motivational intervention or usual care, with the goal of linking them to primary medical care.
Usage
data(HELPrct)
Format
Data frame with 453 observations on the following variables.
agesubject age at baseline (in years)
anysubuse of any substance post-detox: a factor with levels
noyescesdCenter for Epidemiologic Studies Depression measure at baseline (high scores indicate more depressive symptoms)
d1lifetime number of hospitalizations for medical problems (measured at baseline)
hospitalizationslifetime number of hospitalizations for medical problems (measured at baseline)
daysanysubtime (in days) to first use of any substance post-detox
dayslinktime (in days) to linkage to primary care
drugriskRisk Assessment Battery drug risk scale at baseline
e2bnumber of times in past 6 months entered a detox program (measured at baseline)
female0 for male, 1 for female
sexa factor with levels
malefemaleg1bexperienced serious thoughts of suicide in last 30 days (measured at baseline): a factor with levels
noyeshomelesshousing status: a factor with levels
housedhomelessi1average number of drinks (standard units) consumed per day, in the past 30 days (measured at baseline)
i2maximum number of drinks (standard units) consumed per day, in the past 30 days (measured at baseline)
idsubject identifier
indtotInventory of Drug Use Consequences (InDUC) total score (measured at baseline)
linkstatuspost-detox linkage to primary care (0 = no, 1 = yes)
linkpost-detox linkage to primary care:
noyesmcsSF-36 Mental Component Score (measured at baseline, lower scores indicate worse status)
pcsSF-36 Physical Component Score (measured at baseline, lower scores indicate worse status)
pss_frperceived social support by friends (measured at baseline, higher scores indicate more support)
racegrprace/ethnicity: levels
blackhispanicotherwhitesatreatany BSAS substance abuse treatment at baseline:
noyessexriskRisk Assessment Battery sex risk score (measured at baseline)
substanceprimary substance of abuse:
alcoholcocaineherointreatrandomized to HELP clinic:
noyes
Details
Eligible subjects were adults, who spoke Spanish or English, reported alcohol, heroin or cocaine as their first or second drug of choice, resided in proximity to the primary care clinic to which they would be referred or were homeless. Patients with established primary care relationships they planned to continue, significant dementia, specific plans to leave the Boston area that would prevent research participation, failure to provide contact information for tracking purposes, or pregnancy were excluded.
Subjects were interviewed at baseline during their detoxification stay and follow-up interviews were undertaken every 6 months for 2 years. A variety of continuous, count, discrete, and survival time predictors and outcomes were collected at each of these five occasions.
This data set is a subset of the HELPmiss data set restricted to
the 453 subjects who were fully observed on the
age, cesd, d1,
female, sex, g1b, homeless,
i1, i2, indtot, mcs, pcs, pss_fr,
racegrp, satreat, substance, treat,
and sexrisk variables. (There is some missingness in the other variables.)
HELPmiss contains 17 additional subjects with
partially observed data on some of these baseline variables. This is
also a subset of the HELPfull data which includes 5 timepoints and
many additional variables.
Note
The \code{HELPrct} data set was originally named \code{HELP} but has
been renamed to avoid confusion with the \code{help} function.
Source
https://nhorton.people.amherst.edu/help/
References
Samet JH, Larson MJ, Horton NJ, Doyle K, Winter M, and Saitz R. Linking alcohol and drug-dependent adults to primary medical care: A randomized controlled trial of a multi-disciplinary health intervention in a detoxification unit. Addiction, 2003; 98(4):509-516.
See Also
Examples
data(HELPrct)
Data from a heat exchanger laboratory
Description
These data were collected by engineering students at Calvin College. The apparatus consists of concentric pipes insulated from the environment so that as nearly as can be managed the only heat exchange is between the hot and cold water.
Usage
data(HeatX)
Format
A data frame with 6 observations on the following variables.
trialtrial number
T.cold.intemperature (C) of the cold water as it enters the apparatus
T.cold.outtemperature (C) of the cold water as it leaves the apparatus
m.coldflow rate (L/min) of the cold water
T.hot.intemperature (C) of the hot water as it enters the apparatus
T.hot.outtemperature (C) of the hot water as it leaves the apparatus
m.hotflow rate (L/min) of the hot water
Examples
# We can test for heat exchange with the environment by checking to see if the
# heat gained by the cold water matches the heat lost by the hot water.
C_p <- 4.182 / 60 # / 60 because measuring m in L/min
HeatX2 <-
dplyr::mutate(HeatX,
Q.cold = m.cold * C_p * (T.cold.out - T.cold.in),
Q.hot = m.hot * C_p * (T.hot.out- T.hot.in),
Q.env = Q.cold + Q.hot
)
if (require(ggformula)) {
gf_jitter( "" ~ Q.env, data = HeatX2, alpha = 0.6, size = 4,
width = 0, height = 0.1, seed = 123) |>
gf_labs(y = "")
}
if (require(mosaic)) {
t.test( ~Q.env, data = HeatX2 )
}
Foot measurements in children
Description
These data were collected by a statistician, Mary C. Meyer, in a fourth grade classroom in Ann Arbor, MI, in October 1997. They are a convenience sample — the kids who were in the fourth grade.
Usage
data(KidsFeet)
Format
A data frame with 39 observations on the following variables.
namea factor with levels corresponding to the name of each child
birthmonththe month of birth
birthyearthe year of birth
lengthlength of longer foot (in cm)
widthwidth of longer foot (in cm)
sexa factor with levels
BGbiggerfoota factor with levels
LRdomhanda factor with levels
LR
Details
Quoted from the source: "From a very young age, shoes for boys tend to be wider than shoes for girls. Is this because boys have wider feet, or because it is assumed that girls, even in elementary school, are willing to sacrifice comfort for fashion? To assess the former, a statistician measures kids' feet."
References
Mary C. Meyer (2006) "Wider Shoes for Wider Feet?" Journal of Statistics Education 14(1), http://jse.amstat.org/v14n1/datasets.meyer.html.
Examples
data(KidsFeet)
Marriage records
Description
Marriage records from the Mobile County, Alabama, probate court.
Usage
data(Marriage)
Format
A data frame with 98 observations on the following variables.
bookpageIDa factor with levels for each book and page (unique identifier)
appdatedate on which the application was filed
ceremonydatedate of the ceremony
delaynumber of days between the application and the ceremony
officialTitlea factor with levels
BISHOPCATHOLIC PRIESTCHIEF CLERKCIRCUIT JUDGEELDERMARRIAGE OFFICIALMINISTERPASTORREVERENDpersona factor with levels
BrideGroomdoba factor with levels corresponding to the date of birth of the person
ageage of the person (in years)
racea factor with levels
American IndianBlackHispanicWhiteprevcountthe number of previous marriages of the person, as listed on the application
prevconcthe way the last marriage ended, as listed on the application
hsthe number of years of high school education, as listed on the application
collegethe number of years College education, as listed on the application. Where no number was listed, this field was left blank, unless less than 12 years High School was reported, in which case it was entered as 0.
dayOfBirththe day of birth, as a number from 1 to 365 counting from January 1
signthe astrological sign, with levels
AquariusAriesCancerCapricornGeminiLeoLibraPiscesSagittariusScorpioTaurusVirgo
Details
The calculation of the astrological sign may not correctly sort people directly on the borders between signs. This variable is not part of the original record.
Source
The records were collected through http://www.mobilecounty.org/probatecourt/recordssearch.htm
Examples
data(Marriage)
Mites and Wilt Disease
Description
Data from an experiment to test whether exposure to mites protects against Wilt Disease in cotton plants.
Usage
data(Mites)
Format
A data frame with 47 observations on the following variables.
treatmenta factor with levels
mitesandno mitesoutcomea factor with levels
wiltandno wilt
Details
Researchers suspected that attack of a plant by one organism induced resistance to subsequent attack by a different organism. Individually potted cotton plants were randomly allocated to two groups: infestation by spider mites or no infestation. After two weeks the mites were dutifully removed by a conscientious research assistant, and both groups were inoculated with Verticillium, a fungus that causes Wilt disease. More information can be found at https://www.causeweb.org/cause/webinar/activity/2010-01/.
Source
Statistics for the Life Sciences, Third Edition; Myra Samuels & Jeffrey Witmer (2003), page 409.
Examples
data(Mites)
if (require(mosaic)) {
tally(~ treatment + outcome, data=Mites)
tally(~ outcome | treatment, format="percent", data=Mites)
}
Volume of Users of a Rail Trail
Description
The Pioneer Valley Planning Commission (PVPC) collected data north of Chestnut Street in Florence, MA for ninety days from April 5, 2005 to November 15, 2005. Data collectors set up a laser sensor, with breaks in the laser beam recording when a rail-trail user passed the data collection station.
Usage
data(RailTrail)
Format
A data frame with 90 observations on the following variables.
hightempdaily high temperature (in degrees Fahrenheit)
lowtempdaily low temperature (in degrees Fahrenheit)
avgtempaverage of daily low and daily high temperature (in degrees Fahrenheit)
springindicator of whether the season was Spring
summerindicator of whether the season was Summer
fallindicator of whether the season was Fall
cloudcovermeasure of cloud cover (in oktas)
precipmeasure of precipitation (in inches)
volumeestimated number of trail users that day (number of breaks recorded)
weekdaylogical indicator of whether the day was a non-holiday weekday
dayTypeone of "weekday" or "weekend"
Details
There is a potential for error when two users trigger the infrared beam at exactly the same time since the counter would only logs one of the crossings. The collectors left the motion detector out during the winter, but because the counter drops data when the temperature falls below 14 degrees Fahrenheit, there is no data for the cold winter months.
Source
Pioneer Valley Planning Commission
References
http://www.fvgreenway.org/pdfs/Northampton-Bikepath-Volume-Counts%20_05_LTA.pdf
Examples
data(RailTrail)
Volume of Users of a Massachusetts Rail Trail
Description
The Pioneer Valley Planning Commission (PVPC) collected data north of Chestnut Street in Florence, MA for ninety days from April 5, 2005 to November 15, 2005. Data collectors set up a laser sensor, with breaks in the laser beam recording when a rail-trail user passed the data collection station.
Usage
data(Riders)
Format
A data frame with 90 observations on the following 12 variables.
datedate of data collection (POSIXct)
daya factor with levels
Monday,Tuesday,Wednesday,Thursday,Friday,Saturday, andSunday.highThigh temperature for the day (in degrees Fahrenheit)
lowTlow temperature for the day (in degrees Fahrenheit)
hishorter name for
highTloshorter name for
lowTprecipinches of precipitation
cloudsmeasure of cloud cover (in oktas)
ridersestimated number of trail crossings that day (number of breaks recorded)
ctshorter name for
ridersweekdaytype of day: a factor with levels
N(weekend or holiday)Y(non-holiday weekday)wdayshorter name for
weekday
Details
There is a potential for error when two users trigger the infrared beam at exactly the same time since the counter would only logs one of the crossings. The collectors left the motion detector out during the winter, but because the counter drops data when the temperature falls below 14 degrees Fahrenheit, there are no data for the coldest winter months.
Source
Pioneer Valley Planning Commission, http://www.fvgreenway.org/pdfs/Northampton-Bikepath-Volume-Counts%20_05_LTA.pdf
References
"Rail trails and property values: Is there an association?", Nicholas J. Horton and Ella Hartenian (Journal of Statistics Education, 2015), http://www.amstat.org/publications/jse/v23n2/horton.pdf
Examples
data(Riders)
str(Riders)
State by State SAT data
Description
SAT data assembled for a statistics education journal article on the link between SAT scores and measures of educational expenditures
Usage
data(SAT)
Format
A data frame with 50 observations on the following variables.
statea factor with names of each state
expendexpenditure per pupil in average daily attendance in public elementary and secondary schools, 1994-95 (in thousands of US dollars)
ratioaverage pupil/teacher ratio in public elementary and secondary schools, Fall 1994
salaryestimated average annual salary of teachers in public elementary and secondary schools, 1994-95 (in thousands of US dollars)
fracpercentage of all eligible students taking the SAT, 1994-95
verbalaverage verbal SAT score, 1994-95
mathaverage math SAT score, 1994-95
sataverage total SAT score, 1994-95
Source
http://www.amstat.org/publications/jse/secure/v7n2/datasets.guber.cfm
References
Deborah Lynn Guber, "Getting what you pay for: the debate over equity in public school expenditures" (1999), Journal of Statistics Education 7(2).
Examples
data(SAT)
if (require(ggformula)) {
gf_point(sat ~ expend, data = SAT, color = "blue", alpha = 0.5) |>
gf_lm()
gf_text(sat ~ expend, data = SAT, label = ~ abbreviate(SAT$state, 3),
inherit = FALSE)
}
Houses in Saratoga County (2006)
Description
Data on houses in Saratoga County, New York, USA in 2006
Usage
data(SaratogaHouses)
Format
A data frame with 1728 observations on the following 16 variables.
priceprice (US dollars)
lotSizesize of lot (acres)
ageage of house (years)
landValuevalue of land (US dollars)
livingArealiving are (square feet)
pctCollegepercent of neighborhood that graduated college
bedroomsnumber of bedrooms
fireplacesnumber of fireplaces
bathroomsnumber of bathrooms (half bathrooms have no shower or tub)
roomsnumber of rooms
heatingtype of heating system
fuelfuel used for heating
sewertype of sewer system
waterfrontwhether property includes waterfront
newConstructionwhether the property is a new construction
centralAirwhether the house has central air
Source
Data collected by Candice Corvetti and used in the "Stat 101" case study "How much is a Fireplace Worth". See also https://www.saratogacountyny.gov/departments/real-property-tax-service-agency/
Snowfall data for Grand Rapids, MI
Description
Official snowfall data by month and season for Grand Rapids, MI, going back to 1893.
Usage
data(SnowGR)
Format
A data frame with 119 observations of the following variables.
SeasonStartYear in which season started (July is start of season)
SeasonEndYear in which season ended (June is end of season)
JulInches of snow in July
AugInches of snow in August
SepInches of snow in September
OctInches of snow in October
NovInches of snow in November
DecInches of snow in December
JanInches of snow in January
FebInches of snow in February
MarInches of snow in March
AprInches of snow in April
MayInches of snow in May
JunInches of snow in June
TotalInches of snow for entire season (July-June)
Source
These data were compiled by Laura Kapitula from data available from NOAA. The original URL used (http://www.crh.noaa.gov/grr/climate/data/grr/snowfall/) is no longer in service.
Examples
data(SnowGR)
if (require(ggformula)) {
df_stats(~ Total, data = SnowGR)
gf_histogram( ~ Total, data = SnowGR)
gf_point(Total ~ SeasonStart, data = SnowGR) |>
gf_smooth()
if (require(tidyr) && require(dplyr)) {
Snow2 <-
SnowGR |>
pivot_longer(Jul:Total, names_to = "month", values_to = "snowfall") |>
filter(month != "Total") |>
mutate(month = factor(month, levels = unique(month)))
gf_violin(snowfall ~ month, data = Snow2, scale = "width")
}
}
100 m Swimming World Records
Description
World records for men and women over time from 1905 through 2004.
Usage
data(SwimRecords)
Format
A data frame with 62 observations of the following variables.
timetime (in seconds) of the world record
yearYear in which the record was set
sexa factor with levels
MandF
Examples
data(SwimRecords)
if (require(ggformula)) {
gf_point(time ~ year, data = SwimRecords, color = ~ sex)
}
Cherry Blossom Race
Description
The Cherry Blossom 10 Mile Run is a road race held in Washington, D.C. in April each year. (The name comes from the famous cherry trees that are in bloom in April in Washington.) The results of this race are published. This data frame contains the results from the 2005 race.
Usage
data(TenMileRace)
Format
A data frame with 8636 observations on the following variables.
stateState of residence of runner.
timeOfficial time from starting gun to finish line.
netThe recorded time (in seconds) from when the runner crossed the starting line to when the runner crossed the finish line. This is generally less than the official time because of the large number of runners in the race: it takes time to reach the starting line after the gun has gone off.
ageAge of runner in years.
sexA factor with levels
FM.
Examples
data(TenMileRace)
if (require(ggformula)) {
gf_point(net ~ age | sex, data = TenMileRace, color = ~sex, alpha = 0.1) |>
gf_density2d(color = "gray40")
lm(net ~ age + sex, data = TenMileRace)
}
Utility bills
Description
Data from utility bills at a residence.
Utilities2 is a similar data set with some additional variables.
Usage
data(Utilities)
Format
A data frame containing 117 observations for the following variables.
monthmonth (coded as a number)
dayday of month on which bill was calculated
yearyear of bill
tempaverage temperature (F) for billing period
kwhelectricity usage (kwh)
ccfgas usage (ccf)
thermsPerDaya numeric vector
billingDaysnumber of billing days in billing period
totalbilltotal bill (in dollars)
gasbillgas bill (in dollars)
elecbillelectric bill (in dollars)
notesnotes about the billing period
Source
Daniel T. Kaplan, Statistical modeling: A fresh approach, 2009.
See Also
Examples
data(Utilities)
if (require(ggformula)) {
gf_point(gasbill ~ temp, data = Utilities)
}
Utility bills
Description
Data from utility bills at a private residence. This is an augmented version
of Utilities.
Usage
data(Utilities2)
Format
A data frame containing 117 observations for the following variables.
monthmonth (coded as a number)
dayday of month on which bill was calculated
yearyear of bill
tempaverage temperature (F) for billing period
kwhelectricity usage (kwh)
ccfgas usage (ccf)
thermsPerDaya numeric vector
billingDaysnumber of billing days in billing period
totalbilltotal bill (in dollars)
gasbillgas bill (in dollars)
elecbillelectric bill (in dollars)
notesnotes about the billing period
ccfpdayaverage gas usage per day (
Utilities2only)kwhpdayaverage electric usage per day (
Utilities2only)gasbillpdaygas bill divided by billing days (
Utilities2only)elecbillpdayelectric bill divided by billing days a numeric vector (
Utilities2only)totalbillpdaytotal bill divided by billing days a numeric vector (
Utilities2only)thermsthermsPerDay * billingDays(Utilities2only)monthsSinceY2Kmonths since 2000 (
Utilities2only)
Source
Daniel T. Kaplan, Statistical modeling: A fresh approach, 2009.
See Also
Examples
data(Utilities2)
if (require(ggformula)) {
gf_point(gasbillpday ~ temp, data = Utilities2)
}
Weather
Description
2016-17 weather in several cities
Usage
data(Weather)
Format
A data frame with weather-related variables for several world cities.
- city
City name.
- date
Date.
- year
Numeric year.
- month
Numeric month.
- day
Numeric day.
- high_temp, avg_temp, low_temp
High, average, and low temperature for the day in degrees F.
- high_dewpt, avg_dewpt, low_dewpt
High, average, and low dew point for the day in degrees F.
- high_humidity, avg_humidity, low_humidity
High, average, and low relative humidity.
- high_hg, avg_hg, low_hg
High, average, and low sea level pressure in inches of mercury.
- high_vis, avg_vis, low_vis
High, average, and low visability for the day in miles.
- high_wind, avg_wind, low_wind
High, average, and low wind speed for the day in mph.
- precip
Precipitation for the day – a character vale;
Tmeans "trace amount".- events
Character string naming weather events on the day (Rain, Fog, Snow, etc.)
Source
These data were downloaded from WeatherUnderground in January 2018.
Examples
if (require(dplyr)) {
Weather |>
group_by(city, year) |>
summarise(
min_temp = min(low_temp),
max_temp = max(high_temp)
)
}
if (require(ggformula)) {
Weather |>
gf_linerange(low_temp + high_temp ~ date | city ~ .,
color = ~ (high_temp + low_temp) / 2, show.legend = FALSE) |>
gf_refine(scale_color_gradientn(colors = rev(rainbow(5))))
}
Data from the Whickham survey
Description
Data on age, smoking, and mortality from a one-in-six survey of the electoral roll in Whickham, a mixed urban and rural district near Newcastle upon Tyne, in the UK. The survey was conducted in 1972-1974 to study heart disease and thyroid disease. A follow-up on those in the survey was conducted twenty years later.
Usage
data(Whickham)
Format
A data frame with 1314 observations on women for the following variables.
outcomesurvival status after 20 years: a factor with levels
AliveDeadsmokersmoking status at baseline: a factor with levels
NoYesageage (in years) at the time of the first survey
Details
This dataset contains a subset of the survey sample: women who were classified as current smokers or as never having smoked. The data were synthesized from the summary description tables given in the Appleton et al al paper.
References
DR Appleton, JM French, MPJ Vanderpump. "Ignoring a covariate: an example of Simpson's paradox". (1996) American Statistician, 50(4):340-341.
Examples
data(Whickham)