{"id":396,"date":"2021-06-23T16:30:06","date_gmt":"2021-06-23T16:30:06","guid":{"rendered":"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/?post_type=chapter&#038;p=396"},"modified":"2021-12-06T15:38:37","modified_gmt":"2021-12-06T15:38:37","slug":"practical-significance-and-other-statistical-concerns","status":"publish","type":"chapter","link":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/chapter\/practical-significance-and-other-statistical-concerns\/","title":{"raw":"Practical Significance and Effect Sizes","rendered":"Practical Significance and Effect Sizes"},"content":{"raw":"[caption id=\"attachment_678\" align=\"aligncenter\" width=\"1895\"]<img class=\"wp-image-678 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1.png\" alt=\"Screenshot from Thesaurus.com showng synonyms of the word significant\" width=\"1895\" height=\"733\" \/> Figure 5.1. Screenshot of synonyms of the word \"significant\" from www.thesaurus.com[\/caption]\r\n\r\nWe have heard the phrase statistical significance used quite a bit already when we are discussing a statistical finding that has a <em>p<\/em> value of less than or equal to 0.05. Often in research and academic journal articles, the phrase statistical significance is shortened to just state that it was a \"significant finding.\" This is problematic for many who don\u2019t understand that the word significant in a statistical perspective may be different than the more straightforward and widely used definition for significance that most people use. For example, consider a study that demonstrates a statistically significant difference between weight loss outcomes in two groups. One that took a weight loss supplement and one that did not. This may be described as a \u201dsignificant\u201d difference in outcomes between the groups because the p value was less than 0.05. Think about how the supplement company might use that for marketing purposes. They might say something like \u201close significantly more weight with our product\u201d or any of the other synonyms shown above that comes from <a href=\"https:\/\/www.thesaurus.com\/browse\/significant\">thesaurus.com<\/a>. But that isn\u2019t exactly what statistical significance means. If we have a large enough sample, we can find a statistically significant difference even when that difference is a small one (demonstrated at the end of the last chapter). So, we should also be concerned with practical differences.\r\n\r\nAnother way to describe practical difference is meaningfulness.\u00a0 How meaningful is the statistical difference we found? This information is what is probably most important to the consumer or end user, but it might not be the easiest to spin for marketing. Whenever we perform statistical analyses, whether they are looking for similarity (i.e. correlation) or difference (i.e. t test or ANOVA), we should also include a measure of meaningfulness.\r\n\r\nMeaningfulness can be quantified by effect size estimates. We will discuss two main types of effect sizes, one of which you are actually already familiar with.\r\n<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<h2 class=\"textbox__title\"><span style=\"color: #ffffff\">Chapter Learning Objectives<\/span><\/h2>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<ul>\r\n \t<li>\r\n<div>Discuss meaningfulness as it applies to quantitative analysis<\/div><\/li>\r\n \t<li>\r\n<div>Learn how to quantify and interpret effect size estimates<\/div><\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/div>\r\n<h1>Effect Sizes<\/h1>\r\nEffect sizes, or more appropriately effect size estimates, help us statistically quantify practical significance. Effect sizes have also been described as a measure of \"meaningfulness.\"[footnote]Ellis, P. (2010). The Essential Guide to Effect Sizes: Statistical Power, Meta-Analysis, and Interpretation of Results. Cambridge, NY. Cambridge University Press.[\/footnote] From a purely practical and applied perspective, the effect size should be the primary outcome of research inquiry.[footnote]Cohen J. (1990). Things I have learned (so far) Am Psychol. 45:1304\u20131312.[\/footnote] No matter what we are measuring, we are likely most interested in the size of the difference or relatedness between variables. The reliability of our finding or how confident we are in the finding is important also, but it shouldn\u2019t be the only product.\r\n\r\nThis has become such a problem in many fields of research that more than 800 researchers recently published a paper condemning the usage of <em>p<\/em> values since many rely too heavily on it and not enough on practical measures of significance that demonstrate realistic effects.[footnote]Nature 567:305-307 (2019). <a href=\"https:\/\/www.nature.com\/articles\/d41586-019-00857-9?fbclid=IwAR1jzbGpWu9wsHIwBdOu3byOielCLEQxPZMvHJ-3X4GW2gvy4eD98a7a9EU\">https:\/\/www.nature.com\/articles\/d41586-019-00857-9?fbclid=IwAR1jzbGpWu9wsHIwBdOu3byOielCLEQxPZMvHJ-3X4GW2gvy4eD98a7a9EU<\/a>[\/footnote] Completely abandoning measures of statistical significance like <em>p<\/em> values is probably a little too extreme, but their point is well taken. We should be publishing measures of both statistical and practical significance or meaningfulness.\r\n\r\nSo, let\u2019s define effect size. An[pb_glossary id=\"682\"] effect size[\/pb_glossary] is a measure that describes the magnitude or size of the difference or relatedness between the variables we are measuring. This means that it is describing the practical\/meaningful significance. As we will see in this chapter, it really all boils down to how much the data overlaps between our groups or variables. If we are comparing groups for differences, we would likely not expect to see much overlap in the data. If we were evaluating our data for similarity or association, we are looking for more overlap.\r\n<h1>Types of Effect Sizes<\/h1>\r\nAs mentioned above, effect sizes can be created for both relatedness and difference tests. This essentially means that there are 2 \u201cfamilies\u201d of effect sizes; <em>r<\/em> for relatedness and <em>d<\/em> for difference. Conversions can also be completed between the two if it was necessary to make comparisons across different study types. That won't be demonstrated in this chapter, but a quick internet search for \"<em>r<\/em> to <em>d<\/em> effect size conversion,\" or vice versa, will bring up several online calculators and formulas.\r\n<h2>Cohen's <em>d<\/em> Effect Size Estimates<\/h2>\r\nLooking on the difference side of effect sizes (ES), Cohen\u2019s <em>d<\/em> effect size estimates are most commonly seen. They are named after Jacob Cohen. They can be quickly calculated when both means and the pooled standard deviation is known. The means can be from independent or paired groups. The pooled standard deviation is the standard deviation calculated from both groups as one.\r\n<p style=\"text-align: center\">[asciimath]\"Cohen's d ES\" = (M_1 - M_2)\/\"sd\"_\"pooled\"[\/asciimath]<\/p>\r\n<p style=\"text-align: center\">[asciimath]M = mean, sd = \"standard deviation\"[\/asciimath]<\/p>\r\nWith this data, we simply subtract one mean from the other and divide by the pooled standard deviation. If the value is negative, that simply means that the second mean or the one that was subtracted from the other was larger than the original mean. This is dependent on how the dataset was ordered or how they were called into the formula. So, the direction doesn\u2019t really matter here since the magnitude is the largest concern. For this reason, the negative sign is usually not included. Effect size estimates can be interpreted with the scale provided. Anything less than 0.2 is considered a trivial difference, 0.2 to 0.49 is a small difference, 0.5 to 0.79 is a medium difference, and anything \u2265 0.8 is a large difference.\r\n\r\n[caption id=\"attachment_686\" align=\"aligncenter\" width=\"446\"]<img class=\"wp-image-686 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/Screen-Shot-2021-07-15-at-2.35.52-PM.png\" alt=\"Figure 5.2 T-shirt size description for Cohen's d effect size estimate interpretation\" width=\"446\" height=\"190\" \/> Figure 5.2 T-shirt size description for Cohen's d effect size estimate interpretation[\/caption]\r\n\r\nAs mentioned previously, effect sizes really all boil down to the amount of overlap in the data. Figure 5.3 demonstrates this with 2 examples. The first one (on top) is a comparison of 2 groups. One distribution has a mean of 50, the other has a mean of 55, and the pooled standard deviation is 10. Solving for the Cohen\u2019s <em>d<\/em> effect size, we find that there is a medium difference between these 2 groups with a value of 0.5 (<em>d<\/em>=(55-50)\/10). The second example has a distribution with a mean of 50, a distribution with a mean of 110, and a pooled standard deviation of 60. Here we find a large difference with a Cohen\u2019s <em>d<\/em> effect size of 1.0 (<em>d<\/em>=(110-50)\/60).\r\n\r\nTake a look at the amount of overlap between the distributions in each example. Can you see why the Cohen\u2019s <em>d<\/em> effect size estimate is larger in the 2nd example? It has a lot less overlap between the distributions. What if the data distributions were completely separate with no amount of overlap? What would happen to the effect size then? It would increase quite a bit. What if nearly 100% of the data were overlapping? The effect size would shrink quite a bit and likely be trivial. This is of course if we are still using Cohen\u2019s <em>d<\/em> as our measure of effect size, since it is looking for difference. But what if we wanted to look for similarity?\r\n\r\n[caption id=\"attachment_688\" align=\"aligncenter\" width=\"1214\"]<img class=\"wp-image-688 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/5.3.png\" alt=\"Figure depicting two density plots. The top one shows 2 distributions with a moderate amount of overlap and the bottom one shows two distributions with quite a bit less overlap.\" width=\"1214\" height=\"783\" \/> Figure 5.3 Two density plots demonstrating effect size magnitude with the amount of overlap between distributions. The top plot demonstrates an effect size (d) of 0.5 and the bottom demonstrates 1.0.[\/caption]\r\n<h2><em>r Effect Size Estimates<\/em><\/h2>\r\nWhen looking for similarity, the PPM <em>r <\/em>value should be used as an effect size. The fact that <em>r <\/em>is an effect size is a good thing, since we already know how to quantify it and use it from Chapter 3. Recall that <em>r<\/em> values can range from -1 to 0 to +1 and we have already seen how to interpret them in Chapter 3.\r\n\r\n[caption id=\"attachment_551\" align=\"aligncenter\" width=\"750\"]<img class=\"wp-image-551 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/3.12.png\" alt=\"Venn diagram depicting a correlation\" width=\"750\" height=\"441\" \/> Figure 5.4 A reproduction of the Venn diagram representation of a correlation seen in Chapter 3[\/caption]\r\n\r\nWhen interpreting effect sizes for relationships or associations, we are still concerned with the amount of overlap in the data, and we\u2019ve seen this before in Figure 3.12 (now 5.4 in this chapter). When utilizing a bivariate correlation or simple linear regression with only 2 variables (which are represented by the ovals above), we are mostly concerned with the area where they have shared variance. That is represented as the area of overlap in this figure. Remember the coefficient of determination <em>r<sup>2<\/sup><\/em> helps us explain the amount of shared variance. Please review Table 3.4 (reproduced below) and go back to Chapter 3 if you need a refresher.\r\n<table class=\"aligncenter\" style=\"height: 120px;width: 90%\"><caption>Table 3.4 Coefficient of determination (<em>r<\/em><sup>2<\/sup>) interpretation<\/caption>\r\n<tbody>\r\n<tr style=\"height: 15px\">\r\n<th style=\"height: 15px;width: 91.2833px\" scope=\"col\"><em>r<\/em> value<\/th>\r\n<th style=\"height: 15px;width: 158.017px\" scope=\"col\">interpretation<\/th>\r\n<th style=\"height: 15px;width: 127.317px\" scope=\"col\"><em>r<\/em><sup>2<\/sup><\/th>\r\n<th style=\"height: 15px;width: 152.517px\" scope=\"col\">% of variation<\/th>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"height: 15px;width: 91.2833px\">0.0-0.09<\/td>\r\n<td style=\"height: 15px;width: 158.017px\">Trivial<\/td>\r\n<td style=\"height: 15px;width: 127.317px\">0.00-0.0081<\/td>\r\n<td style=\"height: 15px;width: 152.517px\">0-0.81%<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"height: 15px;width: 91.2833px\">0.1-0.29<\/td>\r\n<td style=\"height: 15px;width: 158.017px\">Small<\/td>\r\n<td style=\"height: 15px;width: 127.317px\">0.01-0.0841<\/td>\r\n<td style=\"height: 15px;width: 152.517px\">1-8.41%<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"height: 15px;width: 91.2833px\">0.3-0.49<\/td>\r\n<td style=\"height: 15px;width: 158.017px\">Moderate<\/td>\r\n<td style=\"height: 15px;width: 127.317px\">0.09-0.2401<\/td>\r\n<td style=\"height: 15px;width: 152.517px\">9-24.01%<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"height: 15px;width: 91.2833px\">0.5-0.69<\/td>\r\n<td style=\"height: 15px;width: 158.017px\">Large<\/td>\r\n<td style=\"height: 15px;width: 127.317px\">0.25-0.4761<\/td>\r\n<td style=\"height: 15px;width: 152.517px\">25-47.61%<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"height: 15px;width: 91.2833px\">0.7-0.89<\/td>\r\n<td style=\"height: 15px;width: 158.017px\">Very Large<\/td>\r\n<td style=\"height: 15px;width: 127.317px\">0.49-0.7921<\/td>\r\n<td style=\"height: 15px;width: 152.517px\">49-79.21%<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"height: 15px;width: 91.2833px\">0.9-0.99<\/td>\r\n<td style=\"height: 15px;width: 158.017px\">Nearly Perfect<\/td>\r\n<td style=\"height: 15px;width: 127.317px\">0.81-0.9801<\/td>\r\n<td style=\"height: 15px;width: 152.517px\">81-98.01%<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"height: 15px;width: 91.2833px\">1<\/td>\r\n<td style=\"height: 15px;width: 158.017px\">Perfect<\/td>\r\n<td style=\"height: 15px;width: 127.317px\">1<\/td>\r\n<td style=\"height: 15px;width: 152.517px\">100%<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nGoing back to our previous example with Cohen's <em>d<\/em> in Figure 5.3, we see the first example that produced a Cohen\u2019s <em>d<\/em> effect size estimate of 0.5 which is interpreted as a medium amount of difference between the variables. You might be thinking that you still see quite a bit of overlap in the data distributions. If there is a medium amount of difference between the variables, how much similarity is there? Intuitively, you might think that there is a medium amount of similarity, but that is incorrect as you can see in the scatterplot prodcued below the distributions (in Figure 5.5 below). In fact, it has a PPM <em>r<\/em> value of 0.053 which is considered a trivial amount of similarity. This demonstrates that difference and similarity are not necessarily on the same continuum. Just because there isn\u2019t a large difference between two variables, they aren\u2019t necessarily related.\r\n\r\n[caption id=\"attachment_692\" align=\"aligncenter\" width=\"970\"]<img class=\"wp-image-692 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/5.5.png\" alt=\"Image of the top density plot from Figure 5.3 coupled with a scatter plot of the same data showing no relationship between the pre and post test data.\" width=\"970\" height=\"814\" \/> Figure 5.5 The top density plot from Figure 5.3 showing a Cohen's d of 0.5 along with the same data depicted as a scatter plot below (r = 0.053) demonstrating no relationship.[\/caption]\r\n\r\nThis also demonstrates why we use specific data visualization techniques with each statistical test. Whenever you are testing for difference, you should likely use a box and whisker plot, density plot, or distribution plot. Whenever you are testing for similarity, you should use a scatterplot. If the wrong plot is used, you may confuse or mislead your audience.\r\n<h1>Calculating Effect Sizes<\/h1>\r\nFortunately, our <em>r<\/em> values are already effect size measurements, so we don\u2019t need to do anything else to them. However, whenever testing for difference, we still need to calculate Cohen\u2019s <em>d<\/em> effect size estimates. These can be produced relatively easy in most statistical programs. Some are as simple as checking a box, but others may require the formula be typed in. As always, you can <a href=\"https:\/\/drive.google.com\/file\/d\/1AnfTz9nUfJFmX86JqVJ7PYfqOo977Zxl\/view?usp=sharing\">download the dataset<\/a> if you would like to follow along with the tutorials below. This dataset includes performance data from jump testing, 30 yard sprints, and 1RM bench press. The sample was grouped based on bench press strength with those above 200 lbs in the \"stronger\" group and those below in the \"weaker\" group. This puts 10 into each group. An independent samples <em>t<\/em> test on the 30 yd sprint time variable produces a <em>p<\/em> value of 0.018, so we know there is a statistical difference between the 2 stronger and weaker groups. Calculating a Cohen's <em>d<\/em> effect size will help understand how meaningful this difference is.\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\"><span style=\"color: #ffffff\">Calculating Cohen's <em>d<\/em> in MS Excel<\/span><\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nUnfortunately, there is no built-in function in Excel or option in the Data Analysis Toolpak for Cohen's <em>d<\/em>. So it must be calculated. If we go back to the equation (<em>d<\/em>=(M<sub>1<\/sub>-M<sub>2<\/sub>)\/sd<sub>pooled<\/sub>), we know that we first need to calculate the means for each group and then the pooled standard deviation. MS Excel does have functions for those, so you can run each of those individually, or if you are looking for an added challenge, you can calculate them inside of the Cohen's <em>d<\/em> equation. Either way, make sure you use parentheses so that your order of operations does not alter your answer.\r\n\r\nThe mean of the weaker group should be 4.15 s and the mean of the stronger group should be 3.86 s. So we know that the stronger group also seems to be faster. The pooled standard deviation should be something close to 0.2837252 (using the =STDEV.S function). Remember do not round prior to calculating the final answer. This may alter your result. Instead of typing these values in, you should click each cell where the value is calculated. For example, my mean for the weaker group was calculated in F4, the mean for the stronger group was calculated in F5, and the pooled standard deviation in F6. So my formula was:\r\n<p style=\"text-align: center\"><em>=(F4-F5)\/F6<\/em><\/p>\r\nand this produced a Cohen's <em>d<\/em> effect size of 1.02 (rounded). If you wanted to do all the calculations in one cell the formula should read:\r\n<p style=\"text-align: center\"><em> =(AVERAGE(E2:E11)-AVERAGE(E12:E21))\/STDEV.S(E2:E21)<\/em><\/p>\r\n<p style=\"text-align: left\">Either way should work, but there is more opportunity for error when combining multiple steps, so only attempt that if you are comfortable.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\"><span style=\"color: #ffffff\">Calculating Cohen's <em>d<\/em> in JASP<\/span><\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nA Cohen's <em>d <\/em>effect size (or several other types of effect size) can be added to any means comparison test by checking a box to do so. For <em>t<\/em> tests, this can be found under \"Additional Statistics.\" After checking the Effect Size box, Cohen's\u00a0<em>d<\/em> is already selected by default. Once checked, it appears in the tabular results output.\r\n<div style=\"padding: 0px 7.2px 0px 0px;text-align: start;float: none;margin: 1.2px 7.2px 7.2px 0px\">\r\n<div>\r\n<div>\r\n<table class=\"aligncenter\" style=\"border-collapse: collapse;float: none;border: 0px none #808080;width: 90%\"><caption>Table 5.1 JASP <em>t<\/em>-test output with Cohen's <em>d<\/em><\/caption>\r\n<thead>\r\n<tr>\r\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: left;border-width: 0px 0px 1px 0px\" colspan=\"10\">\r\n<div>\r\n<div>\r\n\r\nIndependent Samples T-Test\r\n<div><\/div>\r\n<div><\/div>\r\n<\/div>\r\n<\/div><\/th>\r\n<\/tr>\r\n<tr>\r\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\"><\/th>\r\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\">t<\/th>\r\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\">df<\/th>\r\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\">p<\/th>\r\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\">Cohen's d<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\">V30yd<\/td>\r\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\r\n<td style=\"border-collapse: collapse;float: none;text-align: right;border: 0px none #000000\">-2.612<\/td>\r\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\r\n<td style=\"border-collapse: collapse;float: none;text-align: right;border: 0px none #000000\">18<\/td>\r\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\r\n<td style=\"border-collapse: collapse;float: none;text-align: right;border: 0px none #000000\">0.018<\/td>\r\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\r\n<td style=\"border-collapse: collapse;float: none;text-align: right;border: 0px none #000000\">-1.168<\/td>\r\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<tfoot>\r\n<tr>\r\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\" colspan=\"10\"><em> Note. <\/em> \u00a0Student's t-test.<\/td>\r\n<\/tr>\r\n<\/tfoot>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\nIf you also followed along with the solution in MS Excel, you may notice that the value is different. First, it's negative. This is because JASP used the stronger group as M<sub>1<\/sub> instead of the weaker group as was done above. This simply means the sign is flipped and it can be dropped. The other difference between these values comes from how the pooled standard deviation was calculated. In Excel, it was a true pooled standard deviation since all the values were highlighted in the formula. Another way to calculate the sd<sub>pooled<\/sub> is by using an estimate equation where the square of each group's sd is added together, divided by 2, and then the square root of that value represents the sd<sub>pooled<\/sub>. This is how JASP calculates the sd<sub>pooled<\/sub>. This estimation is helpful for calculating a sd<sub>pooled<\/sub> when you don't have access to raw data, but do have access to group summary statistics as is often the case when reading a published article.\r\n<p style=\"text-align: center\"><em>sd<sub>pooled<\/sub> = \u221a((sd<sub>1<\/sub><sup>2<\/sup> +sd<sub>2<\/sub><sup>2<\/sup>)\/2)<\/em><\/p>\r\nCalculation of a Cohen's <em>d<\/em> for an ANOVA in JASP, is completed in the post-hoc stage. Again, the Effect Size box must be checked and it will be added to the post-hoc results output.\r\n\r\n<\/div>\r\n<\/div>\r\n<h6>Other Types of Effect Sizes<\/h6>\r\nThere are quite a few other forms of effect sizes that can be used and should be with specific scenarios. Cohen's <em>d<\/em> and the PPM <em>r<\/em> value are the most common and they are the most widely understood, so it is easy to see why many prefer them. No matter what effect size is used, it is important to remember that they are estimates of practical significance and should be reported along side\u00a0<em>p<\/em> values that provide a measure of statistical significance. Relying too much on one of the values may lead to a misinterpretation of the findings.\r\n\r\n&nbsp;","rendered":"<figure id=\"attachment_678\" aria-describedby=\"caption-attachment-678\" style=\"width: 1895px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-678 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1.png\" alt=\"Screenshot from Thesaurus.com showng synonyms of the word significant\" width=\"1895\" height=\"733\" srcset=\"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1.png 1895w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1-300x116.png 300w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1-1024x396.png 1024w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1-768x297.png 768w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1-1536x594.png 1536w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1-65x25.png 65w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1-225x87.png 225w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.1-350x135.png 350w\" sizes=\"auto, (max-width: 1895px) 100vw, 1895px\" \/><figcaption id=\"caption-attachment-678\" class=\"wp-caption-text\">Figure 5.1. Screenshot of synonyms of the word \"significant\" from www.thesaurus.com<\/figcaption><\/figure>\n<p>We have heard the phrase statistical significance used quite a bit already when we are discussing a statistical finding that has a <em>p<\/em> value of less than or equal to 0.05. Often in research and academic journal articles, the phrase statistical significance is shortened to just state that it was a \"significant finding.\" This is problematic for many who don\u2019t understand that the word significant in a statistical perspective may be different than the more straightforward and widely used definition for significance that most people use. For example, consider a study that demonstrates a statistically significant difference between weight loss outcomes in two groups. One that took a weight loss supplement and one that did not. This may be described as a \u201dsignificant\u201d difference in outcomes between the groups because the p value was less than 0.05. Think about how the supplement company might use that for marketing purposes. They might say something like \u201close significantly more weight with our product\u201d or any of the other synonyms shown above that comes from <a href=\"https:\/\/www.thesaurus.com\/browse\/significant\">thesaurus.com<\/a>. But that isn\u2019t exactly what statistical significance means. If we have a large enough sample, we can find a statistically significant difference even when that difference is a small one (demonstrated at the end of the last chapter). So, we should also be concerned with practical differences.<\/p>\n<p>Another way to describe practical difference is meaningfulness.\u00a0 How meaningful is the statistical difference we found? This information is what is probably most important to the consumer or end user, but it might not be the easiest to spin for marketing. Whenever we perform statistical analyses, whether they are looking for similarity (i.e. correlation) or difference (i.e. t test or ANOVA), we should also include a measure of meaningfulness.<\/p>\n<p>Meaningfulness can be quantified by effect size estimates. We will discuss two main types of effect sizes, one of which you are actually already familiar with.<\/p>\n<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<h2 class=\"textbox__title\"><span style=\"color: #ffffff\">Chapter Learning Objectives<\/span><\/h2>\n<\/header>\n<div class=\"textbox__content\">\n<ul>\n<li>\n<div>Discuss meaningfulness as it applies to quantitative analysis<\/div>\n<\/li>\n<li>\n<div>Learn how to quantify and interpret effect size estimates<\/div>\n<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<h1>Effect Sizes<\/h1>\n<p>Effect sizes, or more appropriately effect size estimates, help us statistically quantify practical significance. Effect sizes have also been described as a measure of \"meaningfulness.\"<a class=\"footnote\" title=\"Ellis, P. (2010). The Essential Guide to Effect Sizes: Statistical Power, Meta-Analysis, and Interpretation of Results. Cambridge, NY. Cambridge University Press.\" id=\"return-footnote-396-1\" href=\"#footnote-396-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a> From a purely practical and applied perspective, the effect size should be the primary outcome of research inquiry.<a class=\"footnote\" title=\"Cohen J. (1990). Things I have learned (so far) Am Psychol. 45:1304\u20131312.\" id=\"return-footnote-396-2\" href=\"#footnote-396-2\" aria-label=\"Footnote 2\"><sup class=\"footnote\">[2]<\/sup><\/a> No matter what we are measuring, we are likely most interested in the size of the difference or relatedness between variables. The reliability of our finding or how confident we are in the finding is important also, but it shouldn\u2019t be the only product.<\/p>\n<p>This has become such a problem in many fields of research that more than 800 researchers recently published a paper condemning the usage of <em>p<\/em> values since many rely too heavily on it and not enough on practical measures of significance that demonstrate realistic effects.<a class=\"footnote\" title=\"Nature 567:305-307 (2019). https:\/\/www.nature.com\/articles\/d41586-019-00857-9?fbclid=IwAR1jzbGpWu9wsHIwBdOu3byOielCLEQxPZMvHJ-3X4GW2gvy4eD98a7a9EU\" id=\"return-footnote-396-3\" href=\"#footnote-396-3\" aria-label=\"Footnote 3\"><sup class=\"footnote\">[3]<\/sup><\/a> Completely abandoning measures of statistical significance like <em>p<\/em> values is probably a little too extreme, but their point is well taken. We should be publishing measures of both statistical and practical significance or meaningfulness.<\/p>\n<p>So, let\u2019s define effect size. An<a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_396_682\"> effect size<\/a> is a measure that describes the magnitude or size of the difference or relatedness between the variables we are measuring. This means that it is describing the practical\/meaningful significance. As we will see in this chapter, it really all boils down to how much the data overlaps between our groups or variables. If we are comparing groups for differences, we would likely not expect to see much overlap in the data. If we were evaluating our data for similarity or association, we are looking for more overlap.<\/p>\n<h1>Types of Effect Sizes<\/h1>\n<p>As mentioned above, effect sizes can be created for both relatedness and difference tests. This essentially means that there are 2 \u201cfamilies\u201d of effect sizes; <em>r<\/em> for relatedness and <em>d<\/em> for difference. Conversions can also be completed between the two if it was necessary to make comparisons across different study types. That won't be demonstrated in this chapter, but a quick internet search for \"<em>r<\/em> to <em>d<\/em> effect size conversion,\" or vice versa, will bring up several online calculators and formulas.<\/p>\n<h2>Cohen's <em>d<\/em> Effect Size Estimates<\/h2>\n<p>Looking on the difference side of effect sizes (ES), Cohen\u2019s <em>d<\/em> effect size estimates are most commonly seen. They are named after Jacob Cohen. They can be quickly calculated when both means and the pooled standard deviation is known. The means can be from independent or paired groups. The pooled standard deviation is the standard deviation calculated from both groups as one.<\/p>\n<p style=\"text-align: center\">[asciimath]\"Cohen's d ES\" = (M_1 - M_2)\/\"sd\"_\"pooled\"[\/asciimath]<\/p>\n<p style=\"text-align: center\">[asciimath]M = mean, sd = \"standard deviation\"[\/asciimath]<\/p>\n<p>With this data, we simply subtract one mean from the other and divide by the pooled standard deviation. If the value is negative, that simply means that the second mean or the one that was subtracted from the other was larger than the original mean. This is dependent on how the dataset was ordered or how they were called into the formula. So, the direction doesn\u2019t really matter here since the magnitude is the largest concern. For this reason, the negative sign is usually not included. Effect size estimates can be interpreted with the scale provided. Anything less than 0.2 is considered a trivial difference, 0.2 to 0.49 is a small difference, 0.5 to 0.79 is a medium difference, and anything \u2265 0.8 is a large difference.<\/p>\n<figure id=\"attachment_686\" aria-describedby=\"caption-attachment-686\" style=\"width: 446px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-686 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/Screen-Shot-2021-07-15-at-2.35.52-PM.png\" alt=\"Figure 5.2 T-shirt size description for Cohen's d effect size estimate interpretation\" width=\"446\" height=\"190\" srcset=\"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/Screen-Shot-2021-07-15-at-2.35.52-PM.png 446w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/Screen-Shot-2021-07-15-at-2.35.52-PM-300x128.png 300w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/Screen-Shot-2021-07-15-at-2.35.52-PM-65x28.png 65w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/Screen-Shot-2021-07-15-at-2.35.52-PM-225x96.png 225w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/Screen-Shot-2021-07-15-at-2.35.52-PM-350x149.png 350w\" sizes=\"auto, (max-width: 446px) 100vw, 446px\" \/><figcaption id=\"caption-attachment-686\" class=\"wp-caption-text\">Figure 5.2 T-shirt size description for Cohen's d effect size estimate interpretation<\/figcaption><\/figure>\n<p>As mentioned previously, effect sizes really all boil down to the amount of overlap in the data. Figure 5.3 demonstrates this with 2 examples. The first one (on top) is a comparison of 2 groups. One distribution has a mean of 50, the other has a mean of 55, and the pooled standard deviation is 10. Solving for the Cohen\u2019s <em>d<\/em> effect size, we find that there is a medium difference between these 2 groups with a value of 0.5 (<em>d<\/em>=(55-50)\/10). The second example has a distribution with a mean of 50, a distribution with a mean of 110, and a pooled standard deviation of 60. Here we find a large difference with a Cohen\u2019s <em>d<\/em> effect size of 1.0 (<em>d<\/em>=(110-50)\/60).<\/p>\n<p>Take a look at the amount of overlap between the distributions in each example. Can you see why the Cohen\u2019s <em>d<\/em> effect size estimate is larger in the 2nd example? It has a lot less overlap between the distributions. What if the data distributions were completely separate with no amount of overlap? What would happen to the effect size then? It would increase quite a bit. What if nearly 100% of the data were overlapping? The effect size would shrink quite a bit and likely be trivial. This is of course if we are still using Cohen\u2019s <em>d<\/em> as our measure of effect size, since it is looking for difference. But what if we wanted to look for similarity?<\/p>\n<figure id=\"attachment_688\" aria-describedby=\"caption-attachment-688\" style=\"width: 1214px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-688 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/5.3.png\" alt=\"Figure depicting two density plots. The top one shows 2 distributions with a moderate amount of overlap and the bottom one shows two distributions with quite a bit less overlap.\" width=\"1214\" height=\"783\" srcset=\"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.3.png 1214w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.3-300x193.png 300w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.3-1024x660.png 1024w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.3-768x495.png 768w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.3-65x42.png 65w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.3-225x145.png 225w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.3-350x226.png 350w\" sizes=\"auto, (max-width: 1214px) 100vw, 1214px\" \/><figcaption id=\"caption-attachment-688\" class=\"wp-caption-text\">Figure 5.3 Two density plots demonstrating effect size magnitude with the amount of overlap between distributions. The top plot demonstrates an effect size (d) of 0.5 and the bottom demonstrates 1.0.<\/figcaption><\/figure>\n<h2><em>r Effect Size Estimates<\/em><\/h2>\n<p>When looking for similarity, the PPM <em>r <\/em>value should be used as an effect size. The fact that <em>r <\/em>is an effect size is a good thing, since we already know how to quantify it and use it from Chapter 3. Recall that <em>r<\/em> values can range from -1 to 0 to +1 and we have already seen how to interpret them in Chapter 3.<\/p>\n<figure id=\"attachment_551\" aria-describedby=\"caption-attachment-551\" style=\"width: 750px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-551 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/3.12.png\" alt=\"Venn diagram depicting a correlation\" width=\"750\" height=\"441\" srcset=\"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/3.12.png 750w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/3.12-300x176.png 300w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/3.12-65x38.png 65w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/3.12-225x132.png 225w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/3.12-350x206.png 350w\" sizes=\"auto, (max-width: 750px) 100vw, 750px\" \/><figcaption id=\"caption-attachment-551\" class=\"wp-caption-text\">Figure 5.4 A reproduction of the Venn diagram representation of a correlation seen in Chapter 3<\/figcaption><\/figure>\n<p>When interpreting effect sizes for relationships or associations, we are still concerned with the amount of overlap in the data, and we\u2019ve seen this before in Figure 3.12 (now 5.4 in this chapter). When utilizing a bivariate correlation or simple linear regression with only 2 variables (which are represented by the ovals above), we are mostly concerned with the area where they have shared variance. That is represented as the area of overlap in this figure. Remember the coefficient of determination <em>r<sup>2<\/sup><\/em> helps us explain the amount of shared variance. Please review Table 3.4 (reproduced below) and go back to Chapter 3 if you need a refresher.<\/p>\n<table class=\"aligncenter\" style=\"height: 120px;width: 90%\">\n<caption>Table 3.4 Coefficient of determination (<em>r<\/em><sup>2<\/sup>) interpretation<\/caption>\n<tbody>\n<tr style=\"height: 15px\">\n<th style=\"height: 15px;width: 91.2833px\" scope=\"col\"><em>r<\/em> value<\/th>\n<th style=\"height: 15px;width: 158.017px\" scope=\"col\">interpretation<\/th>\n<th style=\"height: 15px;width: 127.317px\" scope=\"col\"><em>r<\/em><sup>2<\/sup><\/th>\n<th style=\"height: 15px;width: 152.517px\" scope=\"col\">% of variation<\/th>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"height: 15px;width: 91.2833px\">0.0-0.09<\/td>\n<td style=\"height: 15px;width: 158.017px\">Trivial<\/td>\n<td style=\"height: 15px;width: 127.317px\">0.00-0.0081<\/td>\n<td style=\"height: 15px;width: 152.517px\">0-0.81%<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"height: 15px;width: 91.2833px\">0.1-0.29<\/td>\n<td style=\"height: 15px;width: 158.017px\">Small<\/td>\n<td style=\"height: 15px;width: 127.317px\">0.01-0.0841<\/td>\n<td style=\"height: 15px;width: 152.517px\">1-8.41%<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"height: 15px;width: 91.2833px\">0.3-0.49<\/td>\n<td style=\"height: 15px;width: 158.017px\">Moderate<\/td>\n<td style=\"height: 15px;width: 127.317px\">0.09-0.2401<\/td>\n<td style=\"height: 15px;width: 152.517px\">9-24.01%<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"height: 15px;width: 91.2833px\">0.5-0.69<\/td>\n<td style=\"height: 15px;width: 158.017px\">Large<\/td>\n<td style=\"height: 15px;width: 127.317px\">0.25-0.4761<\/td>\n<td style=\"height: 15px;width: 152.517px\">25-47.61%<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"height: 15px;width: 91.2833px\">0.7-0.89<\/td>\n<td style=\"height: 15px;width: 158.017px\">Very Large<\/td>\n<td style=\"height: 15px;width: 127.317px\">0.49-0.7921<\/td>\n<td style=\"height: 15px;width: 152.517px\">49-79.21%<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"height: 15px;width: 91.2833px\">0.9-0.99<\/td>\n<td style=\"height: 15px;width: 158.017px\">Nearly Perfect<\/td>\n<td style=\"height: 15px;width: 127.317px\">0.81-0.9801<\/td>\n<td style=\"height: 15px;width: 152.517px\">81-98.01%<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"height: 15px;width: 91.2833px\">1<\/td>\n<td style=\"height: 15px;width: 158.017px\">Perfect<\/td>\n<td style=\"height: 15px;width: 127.317px\">1<\/td>\n<td style=\"height: 15px;width: 152.517px\">100%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Going back to our previous example with Cohen's <em>d<\/em> in Figure 5.3, we see the first example that produced a Cohen\u2019s <em>d<\/em> effect size estimate of 0.5 which is interpreted as a medium amount of difference between the variables. You might be thinking that you still see quite a bit of overlap in the data distributions. If there is a medium amount of difference between the variables, how much similarity is there? Intuitively, you might think that there is a medium amount of similarity, but that is incorrect as you can see in the scatterplot prodcued below the distributions (in Figure 5.5 below). In fact, it has a PPM <em>r<\/em> value of 0.053 which is considered a trivial amount of similarity. This demonstrates that difference and similarity are not necessarily on the same continuum. Just because there isn\u2019t a large difference between two variables, they aren\u2019t necessarily related.<\/p>\n<figure id=\"attachment_692\" aria-describedby=\"caption-attachment-692\" style=\"width: 970px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-692 size-full\" src=\"https:\/\/openbooks.library.unt.edu\/quantitativeanalysisinexerciseandsportscience\/wp-content\/uploads\/sites\/3\/2021\/06\/5.5.png\" alt=\"Image of the top density plot from Figure 5.3 coupled with a scatter plot of the same data showing no relationship between the pre and post test data.\" width=\"970\" height=\"814\" srcset=\"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.5.png 970w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.5-300x252.png 300w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.5-768x644.png 768w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.5-65x55.png 65w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.5-225x189.png 225w, https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-content\/uploads\/sites\/3\/2021\/06\/5.5-350x294.png 350w\" sizes=\"auto, (max-width: 970px) 100vw, 970px\" \/><figcaption id=\"caption-attachment-692\" class=\"wp-caption-text\">Figure 5.5 The top density plot from Figure 5.3 showing a Cohen's d of 0.5 along with the same data depicted as a scatter plot below (r = 0.053) demonstrating no relationship.<\/figcaption><\/figure>\n<p>This also demonstrates why we use specific data visualization techniques with each statistical test. Whenever you are testing for difference, you should likely use a box and whisker plot, density plot, or distribution plot. Whenever you are testing for similarity, you should use a scatterplot. If the wrong plot is used, you may confuse or mislead your audience.<\/p>\n<h1>Calculating Effect Sizes<\/h1>\n<p>Fortunately, our <em>r<\/em> values are already effect size measurements, so we don\u2019t need to do anything else to them. However, whenever testing for difference, we still need to calculate Cohen\u2019s <em>d<\/em> effect size estimates. These can be produced relatively easy in most statistical programs. Some are as simple as checking a box, but others may require the formula be typed in. As always, you can <a href=\"https:\/\/drive.google.com\/file\/d\/1AnfTz9nUfJFmX86JqVJ7PYfqOo977Zxl\/view?usp=sharing\">download the dataset<\/a> if you would like to follow along with the tutorials below. This dataset includes performance data from jump testing, 30 yard sprints, and 1RM bench press. The sample was grouped based on bench press strength with those above 200 lbs in the \"stronger\" group and those below in the \"weaker\" group. This puts 10 into each group. An independent samples <em>t<\/em> test on the 30 yd sprint time variable produces a <em>p<\/em> value of 0.018, so we know there is a statistical difference between the 2 stronger and weaker groups. Calculating a Cohen's <em>d<\/em> effect size will help understand how meaningful this difference is.<\/p>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\"><span style=\"color: #ffffff\">Calculating Cohen's <em>d<\/em> in MS Excel<\/span><\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<p>Unfortunately, there is no built-in function in Excel or option in the Data Analysis Toolpak for Cohen's <em>d<\/em>. So it must be calculated. If we go back to the equation (<em>d<\/em>=(M<sub>1<\/sub>-M<sub>2<\/sub>)\/sd<sub>pooled<\/sub>), we know that we first need to calculate the means for each group and then the pooled standard deviation. MS Excel does have functions for those, so you can run each of those individually, or if you are looking for an added challenge, you can calculate them inside of the Cohen's <em>d<\/em> equation. Either way, make sure you use parentheses so that your order of operations does not alter your answer.<\/p>\n<p>The mean of the weaker group should be 4.15 s and the mean of the stronger group should be 3.86 s. So we know that the stronger group also seems to be faster. The pooled standard deviation should be something close to 0.2837252 (using the =STDEV.S function). Remember do not round prior to calculating the final answer. This may alter your result. Instead of typing these values in, you should click each cell where the value is calculated. For example, my mean for the weaker group was calculated in F4, the mean for the stronger group was calculated in F5, and the pooled standard deviation in F6. So my formula was:<\/p>\n<p style=\"text-align: center\"><em>=(F4-F5)\/F6<\/em><\/p>\n<p>and this produced a Cohen's <em>d<\/em> effect size of 1.02 (rounded). If you wanted to do all the calculations in one cell the formula should read:<\/p>\n<p style=\"text-align: center\"><em> =(AVERAGE(E2:E11)-AVERAGE(E12:E21))\/STDEV.S(E2:E21)<\/em><\/p>\n<p style=\"text-align: left\">Either way should work, but there is more opportunity for error when combining multiple steps, so only attempt that if you are comfortable.<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\"><span style=\"color: #ffffff\">Calculating Cohen's <em>d<\/em> in JASP<\/span><\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<p>A Cohen's <em>d <\/em>effect size (or several other types of effect size) can be added to any means comparison test by checking a box to do so. For <em>t<\/em> tests, this can be found under \"Additional Statistics.\" After checking the Effect Size box, Cohen's\u00a0<em>d<\/em> is already selected by default. Once checked, it appears in the tabular results output.<\/p>\n<div style=\"padding: 0px 7.2px 0px 0px;text-align: start;float: none;margin: 1.2px 7.2px 7.2px 0px\">\n<div>\n<div>\n<table class=\"aligncenter\" style=\"border-collapse: collapse;float: none;border: 0px none #808080;width: 90%\">\n<caption>Table 5.1 JASP <em>t<\/em>-test output with Cohen's <em>d<\/em><\/caption>\n<thead>\n<tr>\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: left;border-width: 0px 0px 1px 0px\" colspan=\"10\">\n<div>\n<div>\n<p>Independent Samples T-Test<\/p>\n<div><\/div>\n<div><\/div>\n<\/div>\n<\/div>\n<\/th>\n<\/tr>\n<tr>\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\"><\/th>\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\">t<\/th>\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\">df<\/th>\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\">p<\/th>\n<th style=\"border-collapse: collapse;border-color: #000000;border-style: none none solid;float: none;text-align: center;border-width: 0px 0px 1px\" colspan=\"2\" scope=\"col\">Cohen's d<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\">V30yd<\/td>\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\n<td style=\"border-collapse: collapse;float: none;text-align: right;border: 0px none #000000\">-2.612<\/td>\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\n<td style=\"border-collapse: collapse;float: none;text-align: right;border: 0px none #000000\">18<\/td>\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\n<td style=\"border-collapse: collapse;float: none;text-align: right;border: 0px none #000000\">0.018<\/td>\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\n<td style=\"border-collapse: collapse;float: none;text-align: right;border: 0px none #000000\">-1.168<\/td>\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\"><\/td>\n<\/tr>\n<\/tbody>\n<tfoot>\n<tr>\n<td style=\"border-collapse: collapse;float: none;text-align: left;border: 0px none #000000\" colspan=\"10\"><em> Note. <\/em> \u00a0Student's t-test.<\/td>\n<\/tr>\n<\/tfoot>\n<\/table>\n<\/div>\n<\/div>\n<\/div>\n<p>If you also followed along with the solution in MS Excel, you may notice that the value is different. First, it's negative. This is because JASP used the stronger group as M<sub>1<\/sub> instead of the weaker group as was done above. This simply means the sign is flipped and it can be dropped. The other difference between these values comes from how the pooled standard deviation was calculated. In Excel, it was a true pooled standard deviation since all the values were highlighted in the formula. Another way to calculate the sd<sub>pooled<\/sub> is by using an estimate equation where the square of each group's sd is added together, divided by 2, and then the square root of that value represents the sd<sub>pooled<\/sub>. This is how JASP calculates the sd<sub>pooled<\/sub>. This estimation is helpful for calculating a sd<sub>pooled<\/sub> when you don't have access to raw data, but do have access to group summary statistics as is often the case when reading a published article.<\/p>\n<p style=\"text-align: center\"><em>sd<sub>pooled<\/sub> = \u221a((sd<sub>1<\/sub><sup>2<\/sup> +sd<sub>2<\/sub><sup>2<\/sup>)\/2)<\/em><\/p>\n<p>Calculation of a Cohen's <em>d<\/em> for an ANOVA in JASP, is completed in the post-hoc stage. Again, the Effect Size box must be checked and it will be added to the post-hoc results output.<\/p>\n<\/div>\n<\/div>\n<h6>Other Types of Effect Sizes<\/h6>\n<p>There are quite a few other forms of effect sizes that can be used and should be with specific scenarios. Cohen's <em>d<\/em> and the PPM <em>r<\/em> value are the most common and they are the most widely understood, so it is easy to see why many prefer them. No matter what effect size is used, it is important to remember that they are estimates of practical significance and should be reported along side\u00a0<em>p<\/em> values that provide a measure of statistical significance. Relying too much on one of the values may lead to a misinterpretation of the findings.<\/p>\n<p>&nbsp;<\/p>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-396-1\">Ellis, P. (2010). The Essential Guide to Effect Sizes: Statistical Power, Meta-Analysis, and Interpretation of Results. Cambridge, NY. Cambridge University Press. <a href=\"#return-footnote-396-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><li id=\"footnote-396-2\">Cohen J. (1990). Things I have learned (so far) Am Psychol. 45:1304\u20131312. <a href=\"#return-footnote-396-2\" class=\"return-footnote\" aria-label=\"Return to footnote 2\">&crarr;<\/a><\/li><li id=\"footnote-396-3\">Nature 567:305-307 (2019). <a href=\"https:\/\/www.nature.com\/articles\/d41586-019-00857-9?fbclid=IwAR1jzbGpWu9wsHIwBdOu3byOielCLEQxPZMvHJ-3X4GW2gvy4eD98a7a9EU\">https:\/\/www.nature.com\/articles\/d41586-019-00857-9?fbclid=IwAR1jzbGpWu9wsHIwBdOu3byOielCLEQxPZMvHJ-3X4GW2gvy4eD98a7a9EU<\/a> <a href=\"#return-footnote-396-3\" class=\"return-footnote\" aria-label=\"Return to footnote 3\">&crarr;<\/a><\/li><\/ol><\/div><div class=\"glossary\"><span class=\"screen-reader-text\" id=\"definition\">definition<\/span><template id=\"term_396_682\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_396_682\"><div tabindex=\"-1\"><p>a measure the describes the magnitude or size of the difference or relatedness between the variables we are measuring<\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><\/div>","protected":false},"author":15,"menu_order":5,"template":"","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":["cbailey"],"pb_section_license":""},"chapter-type":[],"contributor":[59],"license":[],"class_list":["post-396","chapter","type-chapter","status-publish","hentry","contributor-cbailey"],"part":3,"_links":{"self":[{"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/pressbooks\/v2\/chapters\/396","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/wp\/v2\/users\/15"}],"version-history":[{"count":43,"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/pressbooks\/v2\/chapters\/396\/revisions"}],"predecessor-version":[{"id":1281,"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/pressbooks\/v2\/chapters\/396\/revisions\/1281"}],"part":[{"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/pressbooks\/v2\/parts\/3"}],"metadata":[{"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/pressbooks\/v2\/chapters\/396\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/wp\/v2\/media?parent=396"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/pressbooks\/v2\/chapter-type?post=396"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/wp\/v2\/contributor?post=396"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/openbooks.library.unt.edu\/quantitative-analysis-exss\/wp-json\/wp\/v2\/license?post=396"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}