*NURSING > EXAM > WGU C955 - Module 6: Correlation & Regression Questions And Answers 2022/2023 (All)
A researcher wants to know if there was a relationship between student age and desire to complete a college degree. To answer this question, the researcher used a local community college as the sampli... ng frame and then used stratified sampling to get a sample of students from 18 to 80 years old. Based on the information given, is there any potential bias in this study? a Yes, because the sampling frame does not match the intended population of the question. b Yes, because the sampling method will not give a representative sample. c Yes, because a voluntary sample should have been used. d No, there will likely be no bias in this study. Yes, because the sampling frame does not match the intended population of the question. A researcher conducts an observational study and finds a correlation between managers' income and the number of college credits earned. The correlation coefficient was r=.85 with a regression equation of y=515x+12000. What can you say about the relationship between these variables? a There is an association between these variables b There is a causation between these variables. c There is no relationship between these variables. d There is not enough information to determine the relationship between the variables. There is an association between these variables What technique is used to estimate the profit margin for a production level of 25 thousand units, if a line of best fit is created to estimate profit margin for production levels between 20 to 39 thousand units? a Interpolation b Linearization c Extrapolation d Internalization Interpolation What is the most appropriate definition of a scatterplot? a A graph that uses dots to demonstrate relationships between two categorical variables. b A graph where lines are shown to represent positive or negative trends. c A graph where the explanatory and response variables are plotted as ordered pairs. d A graph where a positive causation is always represented with dots forming a straight line. A graph where the explanatory and response variables are plotted as ordered pairs. What does a strong positive correlation between two variables suggest? a The explanatory variable is increasing and the response variable is decreasing b There is an association between the variables c There is a causation between the variables d Cannot determine There is an association between the variables {{ Scatterplot of distance from city center vs. rent. The linear equation for this scatterplot is y equals negative one hundred five ten thousandths times x plus ten and three hundred eighty one thousandths. }} Using the scatterplot Distance from City Center vs. Monthly Rent, how far can we expect an apartment to be from the city center if the monthly rent is $ 980 ? Round your answer to the nearest mile. a Around 54 miles b Around 55 miles c Around 56 miles d Around 57 miles Around 55 miles Which correlation coefficient suggests the weakest correlation? a −0.9 b −0.3 c 0.1 d 1 0.1 What does it mean for a result or relationship to be statistically significant? a The relationship is not caused by mere chance. b The relationship is caused by chance. c Your hypothesis test has failed. d Your significance level is not high enough. The relationship is not caused by mere chance. What is the process used to create the equation for the line of best fit? a Completing the square b Least squares estimation c Fitting the line to the curve d Linear approximation Least squares estimation A relationship between two or more variables is known as a(n) ________. a Causation b Association c Cause and effect d Correlation Association Using a line of best fit in slope-intercept form, y=mx+b , what must be true if there is a positive correlation? a m must be >0 . b b must be >0 . c Both m and b must be >0 . d Both m and b must have the same sign. m must be >0. Consider the following equation, y=−3.2x+2.8 . What is the y -intercept for this equation? a −3.2 b 3.2 c 2.8 d −2.8 2.8 When linear regression is used to show there is a linear association between two variables, we know the relationship is: a A causation b A correlation c Neither A nor B d Both A and B A correlation A restaurant owner wants to see if she can use low temperatures to boost soup sales at her restaurant. To study a possible relationship between temperature and soup sales, she collects data throughout the year on the temperature of a given day (ranging between 20 degrees F and 90 degrees F) and the amount of soup sold that day. She performs a linear regression and comes up with a least squares regression line of y=−1.64x+176.6 with r=−0.89 where x is the temperature (in degrees F) and y is the number of daily soup sales. How much soup should she expect to sell on a day that is 50 degrees F? Round to the nearest integer. a 177 orders of soup b 95 orders of soup c 259 orders of soup d 101 orders of soup 95 orders of soup The line of best fit is also known as: a The extrapolation line b The regression line c The interpolation line d None of the above The regression line If a trend appears in a large sample of data, the trend may not be replicated if the sample is broken up into smaller subsets. What is this effect known as? a Foster's Theorem b Bayes' Theorem c Simpson's Paradox d Consistency Construct Simpson's Paradox A variable not included in the study that is related to the measured variables in a study is called a ____________. a Independent variable b Confounding variable c Dependent variable d Lurking variable Lurking variable Using the scatterplot below, what type of correlation is suggested? Scatterplot displaying data points distributed from lower left to upper right. Points are relatively close to one another. a No correlation b Weak negative positive c Moderate positive d Strong negative Moderate positive What must be true about the dots on a scatterplot if there is no correlation? a The dots are far away from the line of best fit. b The dots form no recognizable linear pattern. c The dots are evenly spaced around the graph. d The dots form a non-linear pattern. The dots form no recognizable linear pattern. What is (are) a potential problem(s) that can occur when attempting to use regression analysis? a Extrapolation b Lurking variables c Inappropriate sampling d A, B, and C A, B, and C A researcher conducts an experimental study and finds a correlation between salary and levels of experience. The correlation coefficient was r=.75 with a regression equation of y=515x+17500 . What can you say about the relationship between these variables? a There is an association between these variables b There is a causation between these variables. c There is no relationship between these variables d There is not enough information to determine the relationship between the variables. There is a causation between these variables A researcher wants to know if there was a relationship between executive general managers' income and the number of college credits earned. To answer this question, the researcher used a cluster sample to randomly choose 10 states across the United States; then 10 random counties were chosen; then within those 100 total counties, 4 businesses were randomly chosen; all executive general managers working within these 400 total businesses, were then invited to participate in the study. Based on the information given, is there any potential bias in this study? a Yes, because the sampling frame does not match the intended population of the question b Yes, because the sampling method will not give a representative sample. c Yes, because a voluntary sample should have been used. d No, there will likely be no bias in this study. No, there will likely be no bias in this study. Which of the following statements is always true? a If there is causation, there must also be association. b If there is association, there must also be causation. c Association is a stronger relationship than causation. d Cause and effect can never exist where there is an association. If there is causation, there must also be association. Inappropriate sampling can occur during regression analysis. What is (are) an example(s) of inappropriate sampling? a Non-proportional sampling b Small sample sizes c Non-random exclusion of a population subset in the sample d A, B, and C A, B, and C Using the scatterplot below, what type of correlation is suggested? {{ Scatterplot displaying data points distributed along a line from upper left to lower right. The points fall relatively close to the line. }} a No correlation b Strong positive c Moderate negative d Moderate positive Moderate negative The following regression equation estimates total profit ($, measured in 1000s) based on x units produced (in 1000s) with data that was gathered from x=5 thousand units to x=35 thousand units . y=28.07+6.49x Determine the total profit (round to the nearest thousand) for a production level ( x ) of 25 thousand units. Round your answer to the nearest whole number. a $190,000 b $864,000 c $708,000 d $59,000 $190,000 What does a weak, negative correlation look like on a scatterplot? a The points follow closely along a line that moves down and to the right. b The points loosely follow a line that moves down and to the right. c The points follow closely along a line that moves up and to the right. d None of the above. The points loosely follow a line that moves down and to the right. Which of the following correlation coefficients describes a strong, positive correlation? a −0.99 b 0.69 c 0.95 d 0.21 0.95 What should be true about the dots on a scatterplot once a line of best fit is drawn on the graph? a The y should form a perfect line. b The distance each dot is from the line is equal. c The dot furthest from the line is always an outlier. d There should be approximately the same number of dots above and below the line. There should be approximately the same number of dots above and below the line. Consider the following equation, y+8x=−2 . What is the y -intercept of a line with this equation? a −2 b 2 c 8 d −8 -2 When constructing a line of best fit, what must be minimized? a The vertical distances between that line and the data points. b The length of the line. c The correlation coefficient. d The p-value The vertical distances between that line and the data points. Which correlation coefficient suggests the strongest correlation? a −0.9 b −0.3 c 0.1 d 0.8 -0.9 The following table shows the performance of two airlines in two different cities. Is there a Simpson's Paradox occurring? Airline A Delayed Flights Airline A % Delayed Airline B Delayed Flights Airline B % Delayed Total Delayed Total % Delayed Los Angeles 62/559 11.10% 460/3450 13.30% 522/4009 13% San Diego 46/396 11.60% 30/221 13.60% 76/617 12.30% a No, because an equal number of flights departed from each city. b No, because it's clear from the data that Los Angeles has a higher rate of delayed flights. c Yes, because the delayed flight rates were different for each city. d Yes, because while Los Angeles has a greater overal rate of delayed flights, San Diego has a greater rate of delayed flights when looking at the individual airlines. Yes, because while Los Angeles has a greater overal rate of delayed flights, San Diego has a greater rate of delayed flights when looking at the individual airlines. A restaurant owner wants to see if she can use low temperatures to boost soup sales at her restaurant. To study a possible relationship between temperature and soup sales, she collects data throughout the year on the temperature of a given day (ranging between 20 degrees F and 90 degrees F) and the amount of soup sold that day. She performs a linear regression and comes up with a least squares regression line of y=−1.64x+176.6 with r=−0.89 where x is the temperature (in degrees F) and y is the number of daily soup sales. What is the correct interpretation of the slope of the regression line? a For every increase in one degree Fahrenheit, there is a corresponding increase of 1.64 sales of soup. b For every increase of one degree Fahrenheit, there is a corresponding increase of 176.6 sales of soup. c For every increase in one degree Fahrenheit, there is a corresponding decrease of 1.64 sales of soup. d For every increase of one degree Fahrenheit, there is a corresponding decrease of 176.6 sales of soup. For every increase in one degree Fahrenheit, there is a corresponding decrease of 1.64 sales of soup. A study is done to determine whether or not age determines salary level. The subject also records their years of experience and level of education. What is the response variable in this study? a Age b Education level c Years of experience d Salary Salary {{ Scatterplot of advertising dollars spent vs. percent increase in sales. The linear equation for this scatterplot is y equals zero point four five four six times x plus sixty-four and one hundred thirty-three thousandths. }} Using the scatterplot above, what can we expect the percent increase in sales to be for an advertising expenditure of $ 165 thousand dollars? Round your answer to the nearest whole number. a Around 121 % b Around 139 % c Around 161 % d Around 198 % Around 139 % Using a line of best fit in slope-intercept form, y=mx+b , what must be true if there is a negative correlation? a m must be >0 . b m must be <0 . c Both m and b must be <0 . d Both m and b must have the opposite signs. m must be <0 Using the scatterplot below, estimate a possible correlation coefficient. {{ Scatterplot displaying data points distributed along a line from lower right to upper left. The points fall very close to the line. }} a 0.9 b 0.6 c −0.85 d −1 -0.85 A study is done relating computer programming aptitude to typing speed. In this case, what type of variable would the amount of computer programming experience be considered, if it was not measured in the study? a Independent variable b Confounding variable c Dependent variable d Lurking variable Lurking variable Which of the following statements is not a causal relationship? a The higher the temperature in the oven, the faster the food will cook. b The more miles a car is driven, the more fuel is consumed. c The time of day determines when the sun will rise. d The faster a runner goes, the shorter time it will take to complete the race. The time of day determines when the sun will rise. {{ Percentage of people age 12 and older who watched a movie in the past month after viewing an ad for it targeted to their age group. For all ages, seven point five percent of the population watched a movie, for men, five point five percent watched a movie, nine point five percent of women watched a movie. For ages twelve to seventeen, five point five percent of the population watched a movie, four percent of men watched a movie and seven point five percent of women watched a movie. For ages eighteen to thirty-nine, seven point five percent of the population, five point five percent of men were depressed and nine percent of women watched a movie. For people between the ages of forty and fifty-nine, nine point five percent of the population watched a movie. Seven percent of men were depressed and twelve percent of women in that age bracket watched a movie. Of the people who were sixty years or over, five percent of the population watched a movie, three and a half percent of men watched a movie and seven percent of women watched a movie. }} You work in marketing at an independent movie studio and are measuring the effect of targeted ads by gender and age. Based on the results shown, what do you think the relationship is between age and ad effectiveness? a) It is positively correlated b) It is negatively correlated c) No correlation d) Can't tell from this display Can't tell from this display A car company wanted to study the relationship between the weight of the car and the car's average gas mileage, so they collected data from several of their cars and wrote down their weights and average gas mileage and plotted this data on the scatterplot below. The least squares regression line for the data is y=−0.0084x+48.8 , where x is the weight of the car in pounds and y is the average gas mileage (in miles per gallon). What is the predicted gas mileage for a car that weighs 2500 pounds? {{ Scatterplot illustrating Average Miles per Gallon and Weight of car in pounds. }} a) 29.18 miles per gallon b) 26.4 miles per gallon c) 27.8 miles per gallon d) 69.8 miles per gallon 27.8 miles per gallon Which of the following can help prevent Simpson's Paradox from occurring? a) Having the greatest number of subjects in the lowest performing trial. b) Having an equal number of subjects exposed to each of the treatments in each trial. c) Having the greatest number of subjects in the highest performing trial. d) Having each subject be exposed to each treatment in the trial. Having an equal number of subjects exposed to each of the treatments in each trial. {{ Scatterplot showing two data points, one at open parens two, three point seven five close parens and open parens four, six point five close parens. }} Due to budget cuts, a computer scientist had her funding reduced for a research project. Her support only enabled her to collect 2 data points, (2,3.26) and (4,6.52) . The data points are plotted above. What is your estimate of the correlation coefficient? a) −1 b) Zero c) 1 d) Cannot be determined 1 {{ Scatterplot showing the relationship between productivity and sick days. The points move loosely up and to the right. }} To better understand employee burn-out, Kinetic Inc. is looking at the relationship between the productivity of its sales force (measured in the number of cold calls per work day, averaged across the work year) and the number of sick days taken in a year. The scatterplot shows the results of their data gathered from company records. What trend do you see? a) Employees who make more phone calls per day tend to take more sick days. b) There is no relationship between the average number of phone calls per day and sick days. c) As the number of sick days increases, the employee averages a lower number of phone calls. d) All of the above Employees who make more phone calls per day tend to take more sick days. The b in y=mx+b is what? a) The point at which the line crosses the y-axis b) The value of y when x=0. c) Neither A nor B d) Both A and B Both A and B A college wants to study if there is a relationship between the health of students enrolled at the university and the number of credit hours they are enrolled in. They pull the student numbers of all students who used the gym in the last month and randomly selected 200 . They then asked them how many hours he/she exercised in the last week and how many credit hours he/she is enrolled in and based on the data announced that the fewer credit hours a student is enrolled in at the university, the more hours per week the student exercises. Is this a valid conclusion? a) No, the sample is biased because the sample was too small. b) No, the sample is biased because only students at one university were questioned. c) No, the sampling frame in this study introduced bias because it is not representative of the population. d) Yes, the study was conducted in a fundamentally random way. No, the sampling frame in this study introduced bias because it is not representative of the population. [Show More]
Last updated: 2 years ago
Preview 1 out of 16 pages
Buy this document to get the full access instantly
Instant Download Access after purchase
Buy NowInstant download
We Accept:
Can't find what you want? Try our AI powered Search
Connected school, study & course
About the document
Uploaded On
Nov 27, 2022
Number of pages
16
Written in
This document has been written for:
Uploaded
Nov 27, 2022
Downloads
0
Views
74
In Scholarfriends, a student can earn by offering help to other student. Students can help other students with materials by upploading their notes and earn money.
We're available through e-mail, Twitter, Facebook, and live chat.
FAQ
Questions? Leave a message!
Copyright © Scholarfriends · High quality services·