MGT 226 ONLINE BUSINESS ANALYTICS 线上商业分析代写 Question 1 (2 points) Descriptive analytics encompasses summarization and visualization of historical data using tables, charts, Qu...View details
统计数据分析作业代写 Background: Exoplanets are planets which orbit other stars, like the Earth orbits the Sun. Exoplanet discovery is currently an exciting and
Exoplanets are planets which orbit other stars, like the Earth orbits the Sun. Exoplanet discovery is currently an exciting and growing area of research in astronomy.
The Royal Astronomical Society (RAS) is an organisation which is interested in data about exoplanets. They have asked some astronomers to do observational research to discover exoplanets orbiting distant stars.
Previous theoretical research suggests that a regular, periodic decrease in light intensity detected from a distant star suggests there might be a planet orbiting the star (an exoplanet).
These distant stars are classified as either: 统计数据分析作业代写
- ‘sun-like’ which are stars with similar properties (size, mass, gravity etc) to our Sun.
- ‘super-massive’ which are stars which are far greater in properties (size, mass, gravity, etc.) than our Sun.
Exoplanet research suggests these three theories:
- A star being either sun-like or super-massive might affect the existence of orbiting exoplanets.
- Stars which are close to the centre of the galaxy have a different proportion of exoplanets compared to those near the edge of the galaxy.
- Stars which are in small clusters (groups of 3 or more stars close to each other) have been identified with proportionally more exoplanets compared to non-cluster stars.
This study is observational, where telescope and satellite data has been collected. The goal is to compare the real, observational data with the three theoretical predictions mentioned above.
The data, presented in Tables 1a and 1b, includes: 统计数据分析作业代写
- the distance each star is from the centre of their galaxy (in units of 1000’s of light years).
- the periodic percentage decrease in light intensity of each star.
- whether the star is part of a cluster of 3 or more stars.
The researcher reported that 1 or 2 of the measurements were taken before a fault was detected (and resolved) in the imaging processing. Your analysis must include all possible anomalies and state their possible impact on any conclusions.
The aims of this study include the identification of any statistical links between the variables for both super-massive and sun-like stars, and to link this data with the theoretical predictions in order to provide a focus for future research into finding and identifying exoplanets.
NOTE: theoretical research means research involving mathematical equations and laws (e.g. model behaviour). Observational research means using telescopes to measure (or observe) a behaviour. The two do not always agree.
Any calculations must only use the methods and techniques described to you on this course.
The goal is to present your recommendations clearly so that the Director of the RAS can easily interpret your report. You MUST assume that the researcher and the Director of the RAS has no prior statistical knowledge, therefore care should be taken to fully explain the reasoning behind any analysis.
You should consider categorizing data in some new way that reveals patterns, and consider re-processing the data, perhaps creating a new variable or new tables. Clearly summarize any findings.
Microsoft Excel must be used for the statistical analysis you will perform below. Use of other software is not permitted.
The following must be completed below:
a) Quality Check 统计数据分析作业代写
You must include a full numerical summary of the data, which includes:
- The 5-number summary.
- Measures of central tendency.
- Measures of variation.
- The skew and shape of the data.
You must include a critique and explanation of:
- the factors,
- any lurking variables,
which may be present in the study. You must highlight any possible errors in measurement, and any outliers. You must describe their effect on your conclusions.
b) Graphical Summary 统计数据分析作业代写
You must produce 1 graphical representation (with a maximum of two graphs) of the given data from the following options:
- Box Plot
- Modified Box Plot
- Cumulative Relative Frequency Curve
[For example, you may choose to have a maximum of 2 boxplots, but any other graphical summaries after this will not be marked.]
Note: More sophisticated graphs will receive more marks.
c) Regression Analysis 统计数据分析作业代写
You must produce:
I. (no more than) two scatter plots
II. (no more than) two least squares regression lines
III. (no more than) two correlation coefficients, and
IV. (no more than) two residual plots.
You must include:
- comments on the validity of the regression model, using all the scatterplot(s), regression line(s), correlation coefficient(s), and residual plots(s) to do this.
- comments on how the collection method of the data and the quality of the data effects the validity of the models.
d) 2 Sample Confidence Interval 统计数据分析作业代写
You must include:
I. How you are categorising the data and what 2-sample confidence interval you will use. That is:
- You must consider whether you are comparing tables 1a) and 1b) OR comparing data within either tables 1a) or 1b).
- Once you have decided, you must think about how you are going to compare the data sets.
- You must pick one of either difference of means or difference of proportions.
II. You must pick two confidence levels, stating clearly what confidence level you will use and why.
III. Full calculations must be included for each interval.
IV. A discussion on the physical interpretation of the two intervals
V. Comments on how the collection method of the data and the quality of the data effects the validity of the intervals.
A template will be provided to help you organise your recommendations, calculations, and critique and comments. 统计数据分析作业代写
End of scenario, the study data is below:
Table 1a: Data Recorded for Sun-Like Stars
|Distance from Centre of Galaxy (000’s Lt Yrs.)||Decrease inIntensity (%)||Clusterpresent? (Y/N)|
Table 1b: Data Recorded for Super-Massive Stars
|Distance fromCentre of Galaxy (000’s Lt Yrs.)||Decrease inIntensity (%)||Clusterpresent? (Y/N)|
Statistics in Education Policy 统计课业代写 There are 12 question groupings. You are expected to address each question in the grouping. Each question “grouping” is worth 8 points and ...View details