In your initial post, you will solve a problem that involves conditional probabi

In your initial post, you will solve a problem that involves conditional probability, first using “no replacement” and second using “with replacement”. You will be using the Packaging variable from your LEGO dataset.

Instructions – Initial Post
Use Statkey (https://www.lock5stat.com/StatKey/index.html) in any browser and click on One Categorical Variable. Click UPLOAD FILE and upload the CSV file you created last week. Choose the Packaging variable and display the Bar Graph and Summary Statistics.
Video: https://youtu.be/LDIng0_H7PM

Determine the relative frequency (proportion) of LEGO sets in your sample that are contained in a “Box”. For example, in this sample, 32 out of 50 sets are contained in a Box (32/50, 0.64, or 64%):
Be sure to show us your Summary Statistics.

Case 1. No Replacement. Suppose you pick THREE sets at random from your sample WITHOUT replacement. Compute the probability that you will pick three sets in Boxes. (Show your work)

Case 2: With Replacement. Suppose you pick THREE sets at random from your sample WITH replacement. Compute the probability that you will pick three sets in Boxes. (Show your work)

Compare and contrast the two scenarios. Which one is more likely? Is it a big difference?

This assignment requires a statistics-minded person to answer questions 1-3. Th

This assignment requires a statistics-minded person to answer questions 1-3.
The message from the Professor-
Could you revisit this post?
The paired samples t-test looks at the difference between the means. Both of the p-values are significant. Meeting these assumptions is important for validity, but I do not see how the output describes what you are saying. Here is a link with a brief explanation of the paired samples t-test –> https://www.ibm.com/docs/en/spss-statistics/saas?topic=tests-paired-samples-t-test

Important note: don’t use AI, I will provide the pdf with all the questions plea

Important note: don’t use AI, I will provide the pdf with all the questions please provide me the answers in order so I can copy and paste them into my workbook that I will submit in the final submission, there will be some code cells try to do the work as much as you can manually use the code cells if necessary, also provide the codes in the correct order so I can copy and paste them into the workbook label the question so I can know which is for which.
The assignment is in the workbook. Make sure to follow all the instructions:
All answers must be typed directly into the provided spaces.
Only long or complex handwritten equations/graphs can be submitted as a PDF attachment.
Python codes must be written in the cells provided on the Forum. Submissions of Python code in PDF format will not be accepted.
This problem set is intended to be done fully solo – that is, without consulting classmates. This is because we want to develop our individual capacities for grappling with hard problems to bring more to the table when collaborating with our peers later. This will also allow instructors to give more personalized feedback on your solutions.
As you work the problems and write up your solutions, try to use the correct vocabulary and notation for your ideas. Ensure the submitted PDF is readable and properly formatted.
Assignment Information
Weight:15%
Learning Outcomes Added
Distributions: Identify different types of distributions and make inferences based on samples from distributions appropriately.
Probability: Apply and interpret fundamental concepts of probability, including conditional and bayesian probabilities.
ModelSelection: Apply appropriate statistical theory and methods to determine which model generated observed data.
ParameterEstimation: Apply appropriate statistical theory and methods to determine which parameter values generated observed data.
CompTools: Use appropriate computational tools to solve problems in Probability and Statistics.
MathTools: Use appropriate mathematical notation (including DAGs) and tools to solve problems in Probability and Statistics.
ProfessionalWorkProduct: Follows the established guidelines for the task and academic conventions in writing and presentations.

In your initial post, you will solve a problem that involves conditional probabi

In your initial post, you will solve a problem that involves conditional probability, first using “no replacement” and second using “with replacement”. You will be using the Packaging variable from your LEGO dataset.

Instructions – Initial Post
Use Statkey (https://www.lock5stat.com/StatKey/index.html) in any browser and click on One Categorical Variable. Click UPLOAD FILE and upload the CSV file you created last week. Choose the Packaging variable and display the Bar Graph and Summary Statistics.
Video: https://youtu.be/LDIng0_H7PM

Determine the relative frequency (proportion) of LEGO sets in your sample that are contained in a “Box”. For example, in this sample, 32 out of 50 sets are contained in a Box (32/50, 0.64, or 64%):
Be sure to show us your Summary Statistics.

Case 1. No Replacement. Suppose you pick THREE sets at random from your sample WITHOUT replacement. Compute the probability that you will pick three sets in Boxes. (Show your work)

Case 2: With Replacement. Suppose you pick THREE sets at random from your sample WITH replacement. Compute the probability that you will pick three sets in Boxes. (Show your work)

Compare and contrast the two scenarios. Which one is more likely? Is it a big difference?

Important note: don’t use AI, I will provide the pdf with all the questions plea

Important note: don’t use AI, I will provide the pdf with all the questions please provide me the answers in order so I can copy and paste them into my workbook that I will submit in the final submission, there will be some code cells try to do the work as much as you can manually use the code cells if necessary, also provide the codes in the correct order so I can copy and paste them into the workbook label the question so I can know which is for which.
The assignment is in the workbook. Make sure to follow all the instructions:
All answers must be typed directly into the provided spaces.
Only long or complex handwritten equations/graphs can be submitted as a PDF attachment.
Python codes must be written in the cells provided on the Forum. Submissions of Python code in PDF format will not be accepted.
This problem set is intended to be done fully solo – that is, without consulting classmates. This is because we want to develop our individual capacities for grappling with hard problems to bring more to the table when collaborating with our peers later. This will also allow instructors to give more personalized feedback on your solutions.
As you work the problems and write up your solutions, try to use the correct vocabulary and notation for your ideas. Ensure the submitted PDF is readable and properly formatted.
Assignment Information
Weight:15%
Learning Outcomes Added
Distributions: Identify different types of distributions and make inferences based on samples from distributions appropriately.
Probability: Apply and interpret fundamental concepts of probability, including conditional and bayesian probabilities.
ModelSelection: Apply appropriate statistical theory and methods to determine which model generated observed data.
ParameterEstimation: Apply appropriate statistical theory and methods to determine which parameter values generated observed data.
CompTools: Use appropriate computational tools to solve problems in Probability and Statistics.
MathTools: Use appropriate mathematical notation (including DAGs) and tools to solve problems in Probability and Statistics.
ProfessionalWorkProduct: Follows the established guidelines for the task and academic conventions in writing and presentations.

LEVELS OF DATA/MEASUREMENT …please tell me what “LEVELS OF DATA/MEASUREMENT” i

LEVELS OF DATA/MEASUREMENT
…please tell me what “LEVELS OF DATA/MEASUREMENT” is EACH VARIABLE ON THE ATTACHED SURVEY.
— Write the “level” NEXT to the variable – save, then UPLOAD IT
. — I AM NOT ASKING YOU TO FILL IT OUT!!

Instructions
a. UPLOAD the SURVEY – indicating the LEVELS OF DATA/MEASUREMENT of each variable
Survey
INTERVIEW PACKAGE: Recruiting Letter and Scales

ALL CORRECTIONS/EDITS ARE APPRECIATED

RECRUITING STATEMENT

Social Statistics
Profile of Dillard University Students Evaluative Survey/Questionnaire

Dr. Steve A. Buddington and his students in the spring 2019 Social Statistics Class ask that you respond to this survey. This is a PILOT STUDY – assessing the profile of a purposive sample of students on the variables delineated – will only be used internally and will not be published.

Please complete the following questionnaires or surveys; if you are a Dillard University student. This survey is confidential. Your answers will be summarized with the various others who completed the survey/questionnaire and USED ONLY FOR TEACHING AND LEARNING OF DATA ANALYSIS Do not write any identifying information on this survey – anonymity and confidentiality are maintained.

***********************************************
Please read the following important information before you answer the surveys.

1. Answer the questions as honestly as possible. There are no right or wrong answers.

2. Don’t spend too much time thinking about the answer. Give the first answer that comes
to mind.

2. Please fill out all questions as completely as possible. Don’t skip any questions, and please provide
answers to all questions. Make sure you answer every statement with only one solution.

4. Although some questions may seem much like others, there are no statements precisely alike.

Thank you for your cooperation!!

**************************************************
2

PROFILE OF DILLARD STUDENTS

Social Statistics Data Collection/Code Sheet; Spring 2015

** If you are NOT a Dillard University Student, please do not complete this survey
________________

Age: ____​Gender: ​Male​____
​Female​ ___

Height:_____ (in inches)​Weight:_____ (in lbs)

Grade Point Average:_____​Annual Income:__________​

Study Time ( in minutes): ___________​

Time spent with Advisor (in minutes):___________

ACT Score:____________

Marital Status: ​Single​_____
Married​_____
Separated​_____
Divorced​____

Classification:​Freshman​______
Sophomore​______
Junior​______
Senior​______

A manufacturing company regularly conducts quality control checks at specified p

A manufacturing company regularly conducts quality control checks at specified periods on the products it manufactures. Historically, the failure rate for LED light bulbs that the company manufactures is 5%. Suppose a random sample of 10 LED light bulbs is selected. What is the probability that
a. none of the LED light bulbs are defective?
b. exactly one of the LED light bulbs is defective?
c. two or fewer of the LED light bulbs are defective?
d. three or more of the LED light bulbs are defective?
Q2) Assume that the number of new visitors to a website in one minute is distributed as a Poisson variable. The mean number of new visitors to the website is 4.0 per minute. What is the probability that in any given minute
a. zero new visitors will arrive at the website?
b. exactly one new visitor will arrive at the website?
c. two or more new visitors will arrive at the website?
d. fewer than three new visitors will arrive at the website?

In real-life applications, statistics helps us analyze data to extract informati

In real-life applications, statistics helps us analyze data to extract information about a population. In this module discussion, you will take on the role of Susan, a high school principal. She is planning on having a large movie night for the high school. She has received a lot of feedback on which movie to show and sees differences in movie preferences by gender and also by grade level.
She knows if the wrong movie is shown, it could reduce event turnout by 50%. She would like to maximize the number of students who attend and would like to select a PG-rated movie based on the overall student population’s movie preferences. Each student is assigned a classroom with other students in their grade. She has a spreadsheet that lists the names of each student, their classroom, and their grade. Susan knows a simple random sample would provide a good representation of the population of students at their high school, but wonders if a different method would be better.
You can review the student demographics here: Module One Discussion Data PDF.
In your initial discussion post, specifically address the following:
Introduce yourself and describe a time when you used data in a personal or professional decision. This could be anything from analyzing sales data on the job to making an informed purchasing decision about a home or car.
Describe to Susan how to take a sample of the student population that would not represent the population well.
Describe to Susan how to take a sample of the student population that would represent the population well.
Finally, describe the relationship of a sample to a population and classify your two samples as random, systematic, cluster, stratified, or convenience.

In this assignment, you will use data related to a topic of your choice to creat

In this assignment, you will use data related to a topic of your choice to create different data visualizations. After understanding the trends and patterns in your data visualizations and studying your chosen subject, you will ask a research question that arises while observing the data. You will then formulate a hypothesis to aim to answer the research question.
In my case i chose:
Subject
public transportation usage vs private car usage
Research question:
What is the environmental impact of public transportation and private car usage in terms of greenhouse gas emissions, air pollution?

Learning Outcomes
● #Visualizations: Interpret, analyze, and create data visualizations.
● #HypothesisDevelopment: Evaluate the link between hypothesis-driven research and the theories or observations that motivate it.
● #EvidenceBased: Identify and appropriately structure the information needed to support an argument effectively.
● #Professionalworkproduct

Assignment Structure:
The written report (1000 words +/- 10%) should include the following elements.
(See the ‘Guidelines’ section for more detailed information)
Introduction – Provide an introduction of your chosen topic, identify key terms and discuss why it is important
Labeled visualizations (2 self-created+ 1 readymade)
Hypothesis – with discussion of plausibility, testability, relevance, target population
Explanation of how the data visualizations are related and how they contribute to the development of your hypothesis
Discussion of preliminary research that relates to your hypothesis (2-3 articles)
Appendix- Screenshots of the raw data used for each of your visualizations

Guidelines
Creating Visualizations for the Assignment
You need to provide a minimum of 2 self-created visualizations AND 1 ready-made visualization.
There should be variety in the type of visualizations you create.
All data visualizations MUST support (lead to) your hypothesis.
Your visualizations should complement each other and help you develop your hypothesis.
Ensure that your visualizations have a descriiptive title and are well-labeled, appropriately scaled, and visually appealing.
Include figure captions for each visualization. These captions should feature essential elements within the figure and summarize the primary patterns or insights revealed by the visualization.
**Packages to create visualizations – **You may use the software package of your choice. Links for some options are provided:
● CODAP, website link
● Python, [matplotlib] https://onecompiler.com/python/3vq38pmgw
● Excel: Microsoft Office Tutorials. (2015). Create a chart from start to finish.
● Google sheets

Appendix- Screenshots of the raw data used for each of your visualizations are to be included in the Appendix. Please submit clear screenshots representing your data to ensure you maintain a good grade.
No raw data screenshot is requested for your “ready-made” visualization, but you must acknowledge the source.

Datasets that may be used for the assignment:
You may use data from one or several of the provided dataset websites: Our World in Data: https://ourworldindata.org/, Gapminder (https://www.gapminder.org/): World Bank (https://data.worldbank.org/)
(Note: you will need to get the approval of your instructor before using any dataset not provided in the guidelines)
Datasets can be for ANY topic.
More than one dataset can be used, but they must complement each other and support the hypothesis you will be developing.
Introduce the dataset(s) you use – provide context about its source, relevance, and key attributes.

Creating a Hypothesis related to the data visualizations
Provide a clear, specific, and testable hypothesis
Your hypothesis should be 1-2 sentences (not longer).
A hypothesis is a statement, not a question.
Explain how your hypothesis is derived from your 3 (minimum) data visualizations.
Explain the research question or idea you aim to explore through your hypothesis.
Discuss any relevant readings or prior research that influenced the development of your hypothesis.
Explain how your hypothesis can be tested.
Clarify the data type, test, or experiments required to support or refute your hypothesis.
Discuss why your hypothesis is plausible based on your understanding of the dataset, the context, and prior knowledge. (Refer to the Forum lesson plans on plausibility and testability of hypotheses)
Highlight any logical reasoning or assumptions that support the validity of your hypothesis.

Important note: don’t use AI, I will provide the pdf with all the questions plea

Important note: don’t use AI, I will provide the pdf with all the questions please provide me the answers in order so I can copy and paste them into my workbook that I will submit in the final submission, there will be some code cells try to do the work as much as you can manually use the code cells if necessary, also provide the codes in the correct order so I can copy and paste them into the workbook label the question so I can know which is for which.
The assignment is in the workbook. Make sure to follow all the instructions:
All answers must be typed directly into the provided spaces.
Only long or complex handwritten equations/graphs can be submitted as a PDF attachment.
Python codes must be written in the cells provided on the Forum. Submissions of Python code in PDF format will not be accepted.
This problem set is intended to be done fully solo – that is, without consulting classmates. This is because we want to develop our individual capacities for grappling with hard problems to bring more to the table when collaborating with our peers later. This will also allow instructors to give more personalized feedback on your solutions.
As you work the problems and write up your solutions, try to use the correct vocabulary and notation for your ideas. Ensure the submitted PDF is readable and properly formatted.
Assignment Information
Weight:15%
Learning Outcomes Added
Distributions: Identify different types of distributions and make inferences based on samples from distributions appropriately.
Probability: Apply and interpret fundamental concepts of probability, including conditional and bayesian probabilities.
ModelSelection: Apply appropriate statistical theory and methods to determine which model generated observed data.
ParameterEstimation: Apply appropriate statistical theory and methods to determine which parameter values generated observed data.
CompTools: Use appropriate computational tools to solve problems in Probability and Statistics.
MathTools: Use appropriate mathematical notation (including DAGs) and tools to solve problems in Probability and Statistics.
ProfessionalWorkProduct: Follows the established guidelines for the task and academic conventions in writing and presentations.