tag:blogger.com,1999:blog-31188735195662492762017-09-05T22:47:30.775-06:00Probability for ScientistsProbability for Scientists is a hands-on, interdisciplinary introduction to probability theory that emphasizes big-picture thinking. Problems are introduced via in-class exploration of physical objects such as coins, dice, and cards. Learn what randomness really looks like, how to read a histogram, and why the normal distribution is so common.Muninnoreply@blogger.comBlogger37125tag:blogger.com,1999:blog-3118873519566249276.post-5120013332299364912013-12-10T19:05:00.000-07:002013-12-10T19:05:30.651-07:00Final Course SurveyThanks so much for your participation in Probability for Scientists. As instructors, we enjoyed this class and learned a lot from teaching it. Please help us improve by answering the following anonymous survey. <br /><br />The final survey is available <a href="http://survey.x14n.org/index.php/328326/lang-en">here.</a> You have a week to complete it.<br /><br /><h2>Administrative notes</h2><ul><li>Due to low interest, Drew will *not* be available this Thurs 12 Dec.</li><li>We will be emailing you feedback on your written report. Please email the class website if you would like to arrange to pick up the physical copy.<br /></ul>xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-26534780815574104152013-12-10T19:03:00.001-07:002013-12-10T19:03:36.293-07:00Final Project PostersPosters from the final project are now available as pdfs <a href="https://probability-for-scientists.googlecode.com/git/posters/">here.</a> In the event that you have any updates, email a new pdf to the class address and we will post them.xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-33993766858043943422013-11-23T19:22:00.000-07:002013-11-23T19:22:08.880-07:00Final Project DetailsThe final poster session will be on Thursday, the 5th of December. Feel free to invite friends/family to this class.<br /><br />Key points to keep in mind as you work on the written projects:<br /><br /><ul> <li>The written report is *separate* from your poster. We strongly encourage you to complete your written reports early so you have ample time to focus on your poster.<br /> <li>Mark corresponding author, include email address.</li><br /> <li>Your report should be between 500 and 1,000 words total (including captions). Use simple, declarative statements - avoid "flowery" language like "we intend to", "we hope to", "it might be the case that", etc.</li><br /> <li>When there are multiple authors, use "We" rather than "I".</li><br /> <li>Aim for 3-5 final figures. Each figure should be numbered, have sensible axis labels w/units, and a 2-3 sentence caption.</li><br /> <li>Always spell check.</li><br /> <li>Bibliography: you can use any accepted style. An online tool like <a href="http://myt4l.com/index.php?v=pl&page_ac=view&type=tools&tool=bibliographymaker">this</a> can be helpful.</li><br /></ul> Poster details: <ul> <li>We will be printing posters for you. Thus, we need a final version by 5pm Tues 3 Dec.</li> <li>To submit your poster, export your finished poster from Powerpoint as a pdf and email it to the class address.</li> <li>I strongly suggest that each group reads <a href="http://colinpurrington.com/tips/academic/posterdesign">this post</a> about poster design. It includes several templates that you can use to start your poster. The final poster size should be 36" wide by 24" tall.</li></ul>xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-966648180453930142013-11-13T15:03:00.000-07:002013-11-14T12:24:17.881-07:00Week 13: Working with Rstudio and Data For class this Thursday, each group should have a laptop with Rstudio installed, and a dataset that's ready to be imported.<br /><br /><h2>Rstudio</h2><ul><li>First, you need to install R from <a href="http://cran.rstudio.com/">here</a>.</li><li>Next, install Rstudio from <a href="http://www.rstudio.com/ide/download/desktop">here</a>.</li><li>There's very good documentation on Rstudio <a href="http://www.rstudio.com/ide/docs/">here</a>. Learning a few shortcuts and understanding syntax highlighting can make your life much easier.</li></ul><br /><br /><h2>Importing Data</h2><ul><li>First, make sure you can open your data in a spreadsheet program like Excel or OpenOffice.</li><li>If there are multiple tables or "sheets", identify the most important ones, and think about how you might combine several tables together into a single table.</li><li>From your spreadsheet program, select "Save as" and save the spreadsheet as a CSV (comma-separated value) file.</li><li>Finally, load the CSV file using Rstudio ( nice instructions <a href=" http://isites.harvard.edu/fs/docs/icb.topic1345189.files/Import%20Data%20to%20R.pdf">here</a>).</li></ul><br />For class, please download this <a href="https://probability-for-scientists.googlecode.com/git/class/wk13/heads-switches.R">R script</a> and <a href="https://probability-for-scientists.googlecode.com/git/class/wk13/heads-switches.csv">data file</a> to the same directory.xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-117440175209889222013-11-08T18:08:00.001-07:002013-11-08T18:08:10.083-07:00The Monty Hall ProblemThere are several different valid specifications of the Monty Hall problem. All use Bayes' theorem to incorporate the initial choice of the contestant and the information that the host "gives away" to reach the same conclusion.<br /><br /><a href="http://formalisedthinking.wordpress.com/2010/10/05/bayes-theorem-and-the-monty-hall-problem/">This link</a> provides a concise explanation of the set-up and the math.<br /><br />The <a href="http://en.wikipedia.org/wiki/Monty_Hall_problem">Wikipedia article</a> for the problem is long, but contains a very good introduction, as well as some interesting history on the problem, and a detailed list of solutions.<br /><br />Finally, there's a New York Times interview with Monty Hall himself <a href="http://www.nytimes.com/1991/07/21/us/behind-monty-hall-s-doors-puzzle-debate-and-answer.html?src=pm">here</a> that describes some of the intricacies of the actual game show.xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-90499758319197992302013-11-08T17:27:00.000-07:002013-11-08T17:27:17.424-07:00Annotated Outline InstructionsThe annotated outline (Due Thurs 14 Nov) is an expanded version of your proposal, and will form the outline of your poster. This should be typed, and written to address a non-technical audience (can a high school senior understand it?). As noted in class, probability and statistics Wikipedia pages will be considered valid sources for this project.<br /><br />The format for the annotated outline is as follows:<br /><br /><ul><li>Introduction (200 words max, at least 2 sources) </li> <ul> <li>Background of the research system</li> <li>Problem statement</li> <li>Significance of problem</li></ul><li>Methods (200 words max, at least 2 sources) </li> <ul> <li>Describe data, including source, number of samples, variables, whether data collection is complete. </li> <li>Analysis: What will you do to answer the question, and how did you choose these methods? How will you interpret the results? Describe any (nontrivial) assumptions you'll be making.</li></ul><li>Discussion (200 words max) </li> <ul> <li>Expected outcome and reasoning.</li> <li>Significance of results.</li></ul></ul>xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-28691129943979969232013-11-08T17:15:00.001-07:002013-11-08T17:15:18.161-07:00Proposal Feedback and RevisionsWe've reviewed all of your final projects and provided feedback. <br />We need a few things from you:<br /><br /><ul><li>Please choose one corresponding author per group. We will email feedback to the corresponding author ASAP.</li><li>For Tuesday 12 November, please revise your proposals, taking our comments into account.</li></ul><br />Revised proposals should be typed, and addressed to a non-technical audience (would a high school senior understand it without further explanation?). When revising your proposals, please use the following format:<br /><br /><ul><li>Title</li><li>List of authors (mark corresponding author and provide preferred email)</li><li>Proposal text. Text should be less than 200 words. It should explain the research question and significance. Briefly describe the data that will be used, and whether it has been collected already. Describe the methods you will use, and how they answer the research question. Finally, include a brief description of your expected results and their practical significance. </li></ul>xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-5976628577257382172013-11-06T17:31:00.000-07:002013-11-06T17:31:17.111-07:00Final Project ProposalFinal project proposals are due Thurs 7 Nov (one per group, hard copy).<br />Consider the following critera when choosing a project:<br /><br />Projects should be:<br /><ul> <li>Interesting (to group members, and hopefully to others)</li> <li>Have real-world significance</li> <li>And be tractable (you can make progress during the course of the semester).</li></ul><br />The proposal should include the group members, a title, and a 3-5 sentence abstract. The abstract should explain the problem, and the proposed method for solving or exploring the problem.xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-15825982323426026732013-10-29T15:13:00.000-06:002013-10-29T15:14:46.923-06:00Quiz make-upAs I mentioned last week, you can make up a single quiz (Due Tues 26 Nov) by doing the following:<br /><br />1. Find a probability/statistics article on Wikipedia.<br />2. Read as much of it as you can. I expect you to spend somewhere between 20 and 40 minutes reading the article. Use links in the article to look up some of the terms you're unfamiliar with.<br />3. Write down 5 things you learned or found interesting, one sentence each.<br /><br /><br /><br />The following 2 links are Wikipedia "Outline" articles. They show the organization of all the content related to that topic on Wikipedia, and can be a good place to start.<br /><br /><a href="http://en.wikipedia.org/wiki/Outline_of_probability">http://en.wikipedia.org/wiki/Outline_of_probability</a><br /><br /><a href="http://en.wikipedia.org/wiki/Outline_of_statistics">http://en.wikipedia.org/wiki/Outline_of_statistics</a>xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-55295621975789893252013-10-29T15:05:00.000-06:002013-10-29T15:05:12.433-06:00Data entry: Number of Heads and SwitchesYou can find the survey to enter data from today's activity <a href="http://survey.x14n.org/index.php/767678/lang-en">here</a>. xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-87435762041793327072013-10-14T23:39:00.003-06:002013-10-14T23:39:56.022-06:00Week 9: Confidence IntervalsIn class this week we are looking at confidence intervals, starting with the proportion of successes (p) of a binomial process. p̂ is the sample estimate of p. The following figures illustrate what we did for a range of values.<br /><br /><div class="image"><a href="https://probability-for-scientists.googlecode.com/git/class/figs/wk09-se.png" imageanchor="1" ><img border="0" src="https://probability-for-scientists.googlecode.com/git/class/figs/wk09-se.png" /></a><br /><div>This figure shows how the standard error (SE) of p̂ changes with respect to p̂ and the sample size. </div></div> <br /><br /><div class="image"><a href="https://probability-for-scientists.googlecode.com/git/class/figs/wk09-ci_width.png" imageanchor="1" ><img border="0" src="https://probability-for-scientists.googlecode.com/git/class/figs/wk09-ci_width.png" /></a><br /><div>This figure shows how the width of the confidence interval changes with respect to p̂, the sample size, and α. The panel headings show the "confidence", i.e. (1-α). </div></div>xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-35490546355174494432013-10-08T00:54:00.001-06:002013-10-08T00:54:29.387-06:00Change in Readings The schedule has been updated to switch the reading for the next two weeks. Next week's quiz is on CGS Ch 8, and the week after is CGS Ch 7.xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-75973190084828675622013-10-03T16:55:00.000-06:002013-10-03T16:55:40.880-06:00Lab 4: Discrete and Continuous DistributionsLab 4 is available <a href="https://probability-for-scientists.googlecode.com/git/hw/lab4.pdf">here</a>. It's due at the beginning of class on Tues, 22 Oct 2013. xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-52312939289323076052013-10-03T00:58:00.000-06:002013-10-03T02:42:05.597-06:00Week 7: Hypothesis testing, class videosThis week we're looking at hypothesis testing. We started out using the <a href="http://en.wikipedia.org/wiki/Mann%E2%80%93Whitney_U">Wilcoxon rank-sum test</a> (also known as the Mann-Whitney U test) to test whether samples were drawn from different populations.<br /><br />The world is full of statistical (hypothesis) tests. Each one generates a test statistic. The key to understanding a test is understanding what the distribution of the test statistic would be if the null hypothesis was true. <br /><br />The test statistic of the rank sum test is U: the sum of the ranks minus a sample size correction factor.<br />For the rank sum test, the null hypothesis is (approximately) that two samples are drawn from populations with the same mean. The following figures show the distribution of U, assuming the null hypothesis is true. The area of the shaded region sums to alpha. The vertical red lines show our critical values of U. Values of U that are more extreme than these critical values are unlikely due to chance <em>if the null hypothesis is true</em>. Thus, if we observe U values this extreme, we can <em>reject</em> the null hypothesis.<br /><br /><div class="separator" style="clear: both; text-align: center;"><a href="https://probability-for-scientists.googlecode.com/git/class/fig-wk7-wilcox-n10-alpha0.05.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://probability-for-scientists.googlecode.com/git/class/fig-wk7-wilcox-n10-alpha0.05.png" width="100%" height="100%" /></a></div><br />If we lower alpha, we see the area in the tails get smaller.<br /><br /><div class="separator" style="clear: both; text-align: center;"><a href="https://probability-for-scientists.googlecode.com/git/class/fig-wk7-wilcox-n10-alpha0.01.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://probability-for-scientists.googlecode.com/git/class/fig-wk7-wilcox-n10-alpha0.01.png" width="100%" height="100%" /></a></div><br />For larger sample size, we see the value of U gets much larger, but the same pattern holds.<br /><br /><div class="separator" style="clear: both; text-align: center;"><a href="https://probability-for-scientists.googlecode.com/git/class/fig-wk7-wilcox-n20-alpha0.05.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://probability-for-scientists.googlecode.com/git/class/fig-wk7-wilcox-n20-alpha0.05.png" width="100%" height="100%" /></a></div><br /><div class="separator" style="clear: both; text-align: center;"><a href="https://probability-for-scientists.googlecode.com/git/class/fig-wk7-wilcox-n20-alpha0.01.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://probability-for-scientists.googlecode.com/git/class/fig-wk7-wilcox-n20-alpha0.01.png" width="100%" height="100%" /></a></div><br />For a devil's advocate view of what p-values <em>mean</em>, we turn to the internet:<br /><a href="http://www.youtube.com/watch?v=ax0tDcFkPic">What the p-value</a> <br /><br />The Mann-Whitney U test (also known as the Wilcoxon rank sum test) is a <em>non-parametric</em> test: it makes no assumptions about the distribution of the data. Most common statistical tests are parametric, and usually assume the data (or something about the data) is normally distributed. The <a href="http://en.wikipedia.org/wiki/Student%27s_t-test">t-test</a> is the parametric sibling of the rank sum test. It assumes the data is normally distributed. <br /><br />This video describes hypothesis tests in general, and walks through the t-test.<br /><a href="http://www.youtube.com/watch?v=0Pd3dc1GcHc">What is a t-test?</a><br /><br />By the end of this week, this comic should make sense. <br /><a href="http://xkcd.com/882/">XKCD: Significant</a>xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-71036356420334563652013-10-02T22:35:00.001-06:002013-10-02T22:42:29.146-06:00Week 6 survey results: reaction timesHere's a familiar histogram of each person's reaction times (in distance), along with everyone combined together (heading "All").<br /><br /><div class="separator" style="clear: both; text-align: center;"><a href="https://probability-for-scientists.googlecode.com/git/surveys/s06/reaction-hist.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://probability-for-scientists.googlecode.com/git/surveys/s06/reaction-hist.png" width="100%" height="100%" /></a></div><br />Overall, the whole class (heading "All") appears approximately normally distributed, though somewhat right-skewed. Why might we expect this distribution to be right (upwards) skewed?<br /><br />With this 3-column layout, it is difficult to compare individual performances. We could use one column and 15 rows, but that would make a very long figure. The following figure condenses each person's data, easing comparisons. This is called a box-and-whisker plot, or boxplot. The black dot shows the median, and the box shows the interquartile range (which measures the variability, similar to standard deviation). The individual points are considered outliers. For more information, see <a href="http://en.wikipedia.org/wiki/Box_plot">the boxplot wikipedia page</a>. <br /><br /><div class="separator" style="clear: both; text-align: center;"><a href="https://probability-for-scientists.googlecode.com/git/surveys/s06/reaction-boxplot.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://probability-for-scientists.googlecode.com/git/surveys/s06/reaction-boxplot.png" width="100%" height="100%" /></a></div><br />From this, I can easily see some people's responses vary quite a bit, while others are much more consistent. I also notice that 2 people appear faster (lower distance) than the average, and one person appears slower. How might we test if these are significantly different from the rest of the class?<br /><br />xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-6239970535240932022013-09-26T12:50:00.004-06:002013-09-26T12:50:58.750-06:00Drew's Central Limit Theorem DemonstrationHere's the demo application that we used in class. It simulates adding up dice rolls similar to what we did in class today.<br /><a href="http://cs.unm.edu/~drew/probsci/">Link</a>xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-5384298384259678752013-09-24T15:00:00.003-06:002013-09-24T15:05:27.906-06:00Data entry survey: reaction timesThe reaction times survey is <a href="http://survey.x14n.org/index.php/611686/lang-en">here</a>.<br />It is worth 5 points.xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-80949277545031388822013-09-24T01:29:00.000-06:002013-09-24T15:23:57.735-06:00Week 5 survey results: number of heads (binomial distribution)I've made a movie out of the coin-flip results. Each frame of the movie adds a single coin-flip.<br /><br />I plotted 2 histograms: the top one shows the total number of each result, and the bottom shows the proportion or density of each result. You can see the Y-axis of the top histogram change as we add flips.<br /><br />What would this movie look like if we added another 1,000 flips?<br /><br /><a href="https://probability-for-scientists.googlecode.com/git/surveys/s04/movie-s04-coinflips-nheads.avi">Movie Link</a>xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-21480150256318755392013-09-20T00:59:00.001-06:002013-09-24T01:18:46.455-06:00Lab 2The Lab 2 answer key is available <a href="https://probability-for-scientists.googlecode.com/git/hw/sol2.pdf">here</a>.<br /><br />A lab rewrite can earn you up to 50% of missed points. Rewrite questions are marked with @@.<br /><br />Identify errors/typos in the key for extra credit. See answer key for details.<br /><br />Note that Lab 2 is worth 95 points (it says 100 on the assignment - typo), with a possible 5 points of extra credit.xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-73898610974691708442013-09-19T02:19:00.002-06:002013-09-24T15:06:22.771-06:00Class Feedback SurveyThe class feedback survey is now available <a href="http://survey.x14n.org/index.php/974868/lang-en">here</a>.<br />As usual, you need to register with a gmail address, but your responses will be anonymous.<br /><br />This survey is worth 5 points. <br /><br />xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-44691221151263813662013-09-17T15:44:00.001-06:002013-09-17T15:44:15.872-06:00Week 5 Survey: Coin TossesThe data entry form for today's coin-toss activity is <a href="http://survey.x14n.org/index.php/765525/lang-en">here</a>.xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-70052670865862482272013-09-15T19:27:00.001-06:002013-09-15T19:28:05.953-06:00Cannon of Proportion<div class="separator" style="clear: both; text-align: center;"> <a href="http://lh5.ggpht.com/-QtL1mnkOWx8/UjZeo5cLfbI/AAAAAAAAFZE/gM6yvLGqtvk/s1600/IMG_20130912_135233.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"> <img border="0" src="http://lh5.ggpht.com/-QtL1mnkOWx8/UjZeo5cLfbI/AAAAAAAAFZE/gM6yvLGqtvk/s640/IMG_20130912_135233.jpg"> </a> </div>Muninnoreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-20361138108774140752013-09-10T19:55:00.001-06:002013-09-12T22:14:17.239-06:00Lab 3Lab 3 is here:<br /><a href="https://probability-for-scientists.googlecode.com/git/hw/lab3.pdf">https://probability-for-scientists.googlecode.com/git/hw/lab3.pdf</a>.<br /><br />Due date: Beginning of class Oct 1.Muninnoreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-75168119960247174572013-09-10T18:17:00.000-06:002013-09-12T22:14:49.669-06:00Mathematics: Chance encounters in the life of Andrei Kolmogorov<a href="http://nautil.us/issue/4/the-unlikely/the-man-who-invented-modern-probability">http://nautil.us/issue/4/the-unlikely/the-man-who-invented-modern-probability</a><br /><br />A fun little read about a weird character in the history of probability.<br /><br /><br />Muninnoreply@blogger.com0tag:blogger.com,1999:blog-3118873519566249276.post-2988426775307109472013-09-05T03:58:00.002-06:002013-09-06T19:39:16.957-06:00Lab 1 Answer Key (Updated)The answer key to Lab 1 is now available <a href="https://probability-for-scientists.googlecode.com/git/hw/sol1.pdf">here</a>.<br /><br />Edit: The link is now working.xianhttp://www.blogger.com/profile/01991290472959345882noreply@blogger.com2