Indiana University-Purdue University Indianapolis
The Testing Center
   
The Testing Center
 
 
 

Home : Report Link : 97 Validity Report


VALIDITY OF THE IUPUI PLACEMENT TEST SCORES FOR COURSE PLACEMENT: 1996-1997


Howard R. Mzumara, Ph.D.
Testing Center and School of Education

Mark D. Shermis, Ph.D.
Testing Center and Department of Psychology

Karyn Dossinger, B.Sc.,
Jennifer J. Olson, B.Sc.
Testing Center and Department of Psychology

Indiana University - Purdue University Indianapolis


620 Union Drive,
Suite G003
Indianapolis, IN 46202-5167
317-274-2620





October 1997




INDEX
Executive Summary Introduction Mathematics Examination Reading Placement Examination
English Placement Examination Changes Made in Past Two Years. Method
Research Design and Data Analysis Probability Of Success Results
Mathematics Table 1
Figure 1. Figure 2. Reading
English Figure 3. Figure 4.
Figure 5. Figure 6. Figure 7.
Results of Gender and Ethnic Analyses Discussion and Conclusion Recommendations
Overall Recommendation. General Remarks References
APPENDIX A APPENDIX B Table B.1
Table B.2 Table B.3 Table B.4
Table B.5 Table B.6 Table B.7
Executive Summary

This report addresses a variety of issues regarding the undergraduate placement tests that are mandated for all entering students at IUPUI. Several improvements to the placement testing process were implemented by the academic departments this year, including a shift to a computerized adaptive math test (Shermis & Chang, 1997), the creation of a computerized reading test (Shermis, Wolting, & Lombard, 1996), and a study of the potential impact of computerized input for the English written exam (Harrington, Shermis, & Rollins, 1997). Moreover, beginning with this report, new graphs have been incorporated that indicate the probability of success for a student who achieves a given placement test score (Noble & Sawyer, 1997). Our hope is that these new interpretational aids will provide some additional help to counselors and other academic advisors looking to use the placement tests as one source of information in guiding the student to an appropriate course.

The evaluation of the new computerized adaptive math test involved samples drawn from fall of 1996. An adaptive test is one that conforms to the ability level of the examinee. Pilot work has shown this type of test to be just as reliable as the older non-adaptive test, even though it averages to be 30-50% shorter. Our own exit surveys also suggest that students like the adaptive tests because they are neither too difficult or too easy. Moreover, the computerized adaptive test, which consists of an item bank of 168 items, addresses security concerns in that each student essentially takes a different form of the test. The placement validity coefficients for this test, calculated on the relationship between the placement test results and a score on a common final, averaged r = .30 which makes it a very useful predictor and a significant improvement over last year. The Department of Mathematical Sciences, working with the Testing Center, recently made recommendations to improve the predictive validity of the test, especially for higher level courses. These recommendations will be implemented in mid-fall term. Placement graphs for mathematics are included in this report.

The new Indiana-Purdue Computerized Reading Test is modeled after the Nelson-Denny Reading Test, a nationally recognized assessment that is used for placement purposes. This test has been evaluated on a number of psychometric dimensions and has been demonstrated to have good reliability and validity (Shermis, Wolting, & Lombard, 1996). The advantage of the computerized test is that it can be easily scored and interpreted. Because only those who score low on the test are required to take a reading course higher scorers are "exempt" from a reading requirement, the sample drawn from Fall 1996 is homogeneous and tends to underestimate the predictive validity coefficient. In spite of this statistical anomaly, the predictive validity coefficient, based on the placement test and score on the Nelson-Denny Form H), averaged about r = .22 and is considered to be useful. Plans are currently underway to make this test computerized adaptive as well. Placement graphs for reading are included in this report.

The English written assessment is a writing sample of approximately 500 words generated in response to a prompt. The response not only asks the examinee to address a topic, but also to defend the answer in a thoughtful way. The essay is typically scored by at least two raters from the English Department, and a placement recommendation is made. While the rating scale used by the department has sufficient variance for a good validity assessment, the fact that the outcome measure is based on grades tends to underestimate the true relationship between the two variables. The placement validity coefficient for a sample drawn from Fall of 1996 averaged in the mid-teens, but still useful for placement purposes. The department is currently investigating alternative measures that might be used as an outcome measure. For instance, the department is evaluating the possibility of using portfolios as an alternative for one writing sample. In the interim, the English department has approved the use of typing (rather than writing) exam responses as a way to accommodate students' preferences and writing habits (Harrington, Shermis, & Rollins, 1997).

A few other developments are worth mentioning. In the spring of 1998, the Registrar's Office will be testing out a new "barring" system that will permit departments to enforce course and placement testing pre-requisites. That is, students will not be permitted to enroll in courses for which they lack the minimum requirements without some sort of an override. The creation of this new technology is an outgrowth of compliance problems which began a few years ago when a counselor's signature was no longer required for registration.

Shermis and Mzumara (1996) obtained an SDC grant to implement the IUPUI placement tests at designated local area high schools, IUPUI distance learning locations, and Ivy Tech. The pilot work for the grant was completed this past summer in Pike Township, and full-scale implementation is scheduled for late fall. The main technological development has been the implementation of the tests over the world wide web (Shermis, Mzumara, Lillig, & Brown, 1997).

Finally, the Testing Center is currently "beta testing" a new version of the IU Test Reporting System (IUTS) that will accommodate a more flexible approach to the handling of student performance data. We eventually hope to be able to offer academic departments the ability to incorporate information from multiple predictors rather than just one test score as they formulate their decision models for academic placement.


Introduction
The present study was designed to assess the validity of placement test scores in making placement decisions at Indiana University Purdue University Indianapolis (IUPUI) for the Fall 1996 semester. Placing new students into appropriate first year courses has become an increasingly challenging task for colleges and universities. Also, the percentage of postsecondary institutions with some form of placement and developmental instruction has steadily increased in the past decade and is now about 90% (Sawyer, 1996). In a recent survey on Remedial Education in Higher Education Institutions, conducted by the National Center for Education Statistics (NCES), about three-quarters (78 percent) of higher education institutions that enrolled freshmen offered at least one developmental reading, writing, or mathematics course in Fall 1995 (NCES, October 1996).

A cursory review of the literature (Sawyer, 1996, and the references cited therein) offers two potential explanations for the increase in placement and developmental instruction. "One suggested explanation is that American high schools have become less effective in preparing students for college" (Carnegie Foundation for the Advancement of Teaching, 1988; National Commission on Excellence in Education, 1983; Singal, 1991, as cited in Sawyer, 1996, p. 271). A second explanation is that more students from disadvantaged backgrounds are attending higher education institutions (College Entrance Examination Board, 1977; Munday, 1976, as cited in Sawyer, 1996).

In placement decisions, the concern of the institution is to create learning environments in which all students will learn. Hills, Hirsch, and Subhiyah (1990) define placement as a process by which students are assigned to courses commensurate with their past achievements in order to facilitate expeditious further learning. "The underlying idea is that students differ. They may differ in their level of preparation, in their adeptness at learning, in their interests, in their ability to organize for themselves, and so on. As a result, for efficient instruction, one arranges for different approaches for the different students or groups of students. Ideally, a student is placed in the learning situation which is best for him" (Hills, Hirsch, & Subhiyah, 1990, p.5). Accordingly, IUPUI has implemented a mandatory placement testing program for all undergraduate students in order to facilitate the academic success of students at the University.

The IUPUI placement tests were developed for the purpose of course placement (i.e., matching students with instruction appropriate to their academic preparation) in English (writing), mathematics, and reading. Thus, like most other higher education institutions (NCES, October 1996), IUPUI provides developmental courses in reading, writing, and mathematics. Generally speaking, developmental courses are provided to those college students lacking in academic skills necessary to perform college-level work at the level required by the institution (NCES, October 1996). Although what constitutes developmental courses varies from institution to institution, often developmental courses do not carry credit toward satisfying degree requirements.

In general, the rationale for placement testing is threefold: First, students who enroll in appropriate university courses should have a more positive experience than those who enroll in courses that are either too difficult or too easy. They should be more satisfied with their university experience, and thus more likely to be retained. Second, because students are more likely to be retained in classes that are appropriate to their ability level, departmental administrators can more carefully plan how best to allocate faculty resources to respective class sections. Finally, the placement tests might serve as a basis for assessing the contributions of the University to the development of general educational skills. If placement tests perform the three vital functions well, then University funds spent on these assessments are wisely expended. A brief description of the major IUPUI placement tests follows.

Mathematics Placement Exam. The mathematics placement examination was designed to assess students' college-level mathematics readiness skills upon admission into IUPUI. The mathematics exam used in the present study was a computerized adaptive test which consists of 168 objective items that assess skills ranging from pre-algebra to introductory calculus. The total placement score on the mathematics test ranged from 6 (lowest score) to 40 (highest possible score), and represents a student's ability level in mathematics. For placement purposes, students who score 6 on the test are advised to take Mathematics M010 (Pre-Algebra), a developmental mathematics course. Students scoring between 7 and 14 on the mathematics placement test are advised to take Mathematics 001 (Introduction to Algebra). Students who score between 15 and 23 on the mathematics test receive a placement recommendation of Mathematics 111 (Algebra). Students scoring at 24 or higher are advised into a variety of course offerings, depending on their academic major. Based on this assessment, therefore, placement is made into an appropriate mathematics class. Mathematics placement test scores are valid for one year from the test date.

Reading Placement Exam. The computerized reading placement exam used in the present study is an objective reading assessment consisting of five parts: reading rate, comprehension, and three different types of vocabulary tests (Word Attack, Words in Context, and Words in Isolation). The purpose of this test was to assess students' vocabulary and reading comprehension skills upon entry into IUPUI. Based on this test, a student's eligibility for university reading requirements or the need for reading improvement was determined. The total reading score ranged from 0 to 170, and indicates a weighted raw score composite. Students who scored between 0 and 52 were placed into the gateway program. Students who scored between 53 and 68 were advised to enroll in Education X150, and a score of 68 to 79 resulted in a placement recommendation for Education X152. Students who obtained a score equal to or greater than 80 were exempt from further requirements in reading skill development. Thus, students who read at college level were exempt from taking college reading classes. Reading placement test scores are valid for one year from the test date.

English Placement Exam. The English placement exam is a one-hour exam that asks students to write an essay that explains and supports their opinion on a current social issue. The test provides a brief explanation of the issue or the context in which the issue is posed. Students are also asked to evaluate their answer and explain what changes they might make, had they the time to do so. When readers assess the English placement tests, they look for presence or absence of organization, support, development, and the student's position on the issue presented. Students who need extra help focusing their essays around a major theme, or students who need extra help understanding the relationship between assertion and support, are placed into an appropriate developmental course.

The purpose of the English placement exam is to assess students' ability to write an essay that explains and supports one's viewpoint or opinion on a given issue. Examinees have a choice of two questions, each of which allows the students to use their personal experiences and/or observations in writing the essay. It is important that the test conveys not only the examinee's viewpoint on the selected topic, but also the reasons for taking a particular position. The test, however, does not require any specialized knowledge or research, only an ability to discuss an individual's opinion and reasons. Within the one hour time allotted to the English placement test, students are expected to (a) think seriously about the topic selected, (b) state an opinion clearly, (c) present examples or details that support an opinion, and (d) organize the essay clearly. The English test score is valid for two years from the test date. Students are required to register for the respective courses into which they are placed [i.e., E010 - Access to Writing (a developmental course for students judged not yet ready for W001); W001 - Fundamentals of English; W131 - Elementary Composition; or W140 - an honors version of W131].

Changes Made in the Past Year With continued efforts in daily and quality improvements, several changes were implemented based on the recommendations outlined in the previous placement validity reports (see Mzumara, Shermis, & Wimer, 1996). Based on the recommendations outlined in the previous reports, both mathematics and reading currently use alternate outcome variables. For instance, the traditional variable of "grade" was problematical because mathematics employs a truncated scale that excludes the grade of "D". Partly because of this, correlation coefficients in mathematics were in the low .20s; and reading correlations were close to zero when course grades of "pass/fail" were employed as outcome measures. Because mathematics uses a common final exam, this criterion provided an appropriate supplement to course grades. Moreover, since reading instructors administer Form H (or previously Form F ) of the Nelson-Denny Reading Testat the end of each semester, this reading score provided a convenient outcome replacement.

The second recommendation was that the Mathematics Department consider moving towards an adaptive test that was both configured to the department's curriculum and also drew from a larger item bank than the linearly computerized test. Consequently, full implementation of the computerized adaptive mathematics placement test took place in late October 1995. Further developments of the CAT mathematics placement test are underway with respect to increasing test items and/or revising the current item bank, development of testlets, improved reporting of mathematics test scores including subscores (for testlets), and revision of cutoff scores for course placement. Hopefully, these efforts will improve the predictive validity of the mathematics test, particularly for higher level courses.

A cursory review of the literature suggests that the adaptive procedure should produce a more precise estimate of ability, take a shorter time to complete, and be perceived as easier for most students. Preliminary results from the adaptive mathematics placement test were quite promising. Also, our own placement test exit surveys suggest that students like the computerized adaptive tests because they are neither too difficult nor too easy. Detailed results of the exit surveys are summarized in the Testing Center's 1996 annual report (see a copy of the 1996 annual report available on-line at the Testing Center's web site: http://assessment.iupui.edu/report/report.html). The results of the present study, therefore, should indicate how well the current adaptive mathematics test is working, particularly with respect to efficiency and precision of forecasts on student performance on the mathematics outcome measures.

The third recommendation was that the reading faculty consider developing an in-house computerized test that would eventually incorporate both adaptive and diagnostic features. While there was no widespread dissatisfaction among counselors with the Nelson-Denny Reading Test, near zero predictive validity coefficients coupled with the cumbersome test administration suggested that an assessment more closely linked to the curriculum was appropriate. The vision for the test was that it would perform as a placement test for all students, but would transform into a diagnostic procedure for those who were recommended for developmental coursework. Consequently, the linearly computerized reading placement test was implemented in late June 1996 (Shermis, Wolting, & Lombard, 1996). The adaptive version of the test should be completed early next year.

The final suggestion related to modifications to the scoring procedure for the English placement test. The problem with the previous procedure, from a statistical point of view, was that only one number (or rating) represented course placement in IUTS (the Indiana University test reporting system). This meant that there was no variance in the predictor and consequently no correlation coefficient could be calculated. The recommendation was that each class placement take on a range of predictor scores (ideally, 5 to 7 categories for each class). As reported in last year's placement validity report, the English Department developed a new scale during Fall 1994 and incorporated the new rating system in January, 1995. Note that the English placement test scores used in the present study were based on the scoring procedure that was implemented in January 1995. (The scoring system for the English placement test is described more fully in the Placement Handbook, IUPUI English Department and Susanmarie Harrington's memorandum of January 30, 1995, on the New Lookup Table.)


Method


Sample
The target population comprised all students who took either an English, mathematics, or reading placement test from January, 1996, through August, 1996, and enrolled in an English writing, mathematics or reading course during the Fall 1996 semester at IUPUI's Indianapolis campus. (Note that students with incomplete and/or missing course grades were excluded from the present study.)

Procedure for Obtaining the Data
Students' raw data were obtained through a FOCUS query (and/or students' academic records provided by the respective course coordinators) from the Fall 1996 cohort of students who took both placement exams and subsequently enrolled in a mathematics, reading or English course during Fall 1996 or Spring 1997. Because the IUPUI placement tests are seen as advisory1 rather than prescriptive, in some cases a student enrolled in a course that was not recommended by the placement test score. Consequently, for the purpose of statistical analyses, students were divided into two categories, namely, "compliant" and "non-compliant." The compliant group comprised students who took the recommended courses based on the placement test scores. The non-compliant group consisted of students who chose (sometimes in consultation with their advisors) not to take the recommended course. The current extract resulted in a pool of 2776 students for mathematics, 882 students for (computerized) reading (note that students placed in the "gateway" program and those exempt from taking reading courses were excluded from the present study), and 2742 students for the English (written) essay. With respect to compliance, there were 1965 students who took the mathematics placement exam and then enrolled in a recommended mathematics M010, Math 001, Math 111 or a higher class during the Fall of 1996. Likewise, 139 students took the new computerized reading test and then enrolled appropriately in either Education X150 or Education X152. (Note that the sample for reading excluded students who took the Nelson-Denny Reading Tests.) With respect to English, there were 1629 students who took the English placement exam and then enrolled in one of the following recommended courses of interest: E010, W001, W131, or W140. Overall, the total compliance rates (based on the available data) were 98.8% for mathematics, 100% for reading, and 99.6% for English. The current compliance rates were much higher than those obtained for the past two years or so.



Research Design and Data Analysis
The present study employed some aspects of decision theory models (Sawyer, 1996; Noble & Sawyer, 1997) and logistic regression techniques (Hosmer & Lemeshow; 1989; Norusis/SPSS Inc., 1992) to provide validity evidence for course placement criteria. Also, simple correlation and regression analyses were used to demonstrate the relationship between predictor scores and outcome variables. It is noteworthy that most college placement exams have correlations that run between .20 to .40 (cf. Hills, Hirsch, & Subhiyah, 1990).

The predictor variables consisted of students' placement exam scores on mathematics, reading, and English, respectively. The outcome measures were the corresponding exam scores and/or course grades in the respective content areas. The rationale for the validation/research design was as follows. To the extent that the usefulness of a placement test depends on the existence of statistical relationships, such evidence is clearly essential to validation. Thus, by measuring the strength of this statistical relationship, we obtained evidence on the validity of the placement test scores for making course placement recommendations. (Note that the present study employed the logistic regression model in validating the IUPUI course placement tests.)

The primary criterion measure for mathematics comprised final examination scores and/or grades based on a common final exam. The secondary outcome variable for mathematics consisted of the course grade obtained by the student at the end of the Fall semester. The mathematics grades ranged from "A+" to "F". For purposes of correlation analyses, the letter grades were converted to a numeric scale ranging from 4.33 for an "A+" to 0.33 for an "F".

The primary outcome variable for reading consisted of the total reading score received by a student on Form H of the Nelson-Denny Reading Test. The predictor variable for reading comprised students' placement test scores based on the IUPUI computerized reading test. For reading courses, letter grades ranging from "A+" to "F" were obtained. For purposes of conducting correlation analyses, however, the letter grades were converted to numeric scores ranging from 4.33 for "A+" to 0.33 for "F". Note that students who withdrew from either Education X150 or X152 course were excluded from the correlation analyses. The outcome variable for English was the course grade obtained at the end of the fall semester. English grades ranged from "A+" to "F". For purposes of calculating correlation coefficients, the letter grades were converted to a numeric scale ranging from 4.33 for "A+" to 0.33 for "F". Students who withdrew from courses of interest were excluded only from the correlation analyses.

The present study attempted to provide two major types of validity information. First, probabilities of success were estimated from logistic regression and frequency distributions of scores on the placement measures to determine the effectiveness of the course placement criteria. Probability graphs were then developed to provide graphical illustrations of the relationship between placement test scores and predicted first-year college performance in English, mathematics, and reading, respectively. Second, simple correlation analyses were conducted to obtain coefficients among all the variables studied (i.e., predictor and outcome measures). The intercorrelations (validity coefficients) between the predictors and each outcome measure indicated how the predictors were working, but did not provide a means of making specific predictions for individuals. This was accomplished with the prediction equations, which were essentially the product of logistic regression analyses. In essence, the prediction equations used one variable (i.e., the respective placement test scores) to predict an outcome. This information, however, did not indicate how effective the predictions were. Thus, estimates of measurement error in prediction (i.e., standard errors of estimate) were computed. The results of fitting logistic regression models to the respective data are reported in Appendix B as Tables B.1 - B.7.

Probability of Success
The statistical relationship between students' outcomes (i.e., a course grade of, say, "C" or higher) and their placement test scores was estimated using logistic regression. (Details regarding logistic regression are presented in Appendix A.) The relationship was estimated from the data of students who actually took a placement test and subsequently enrolled and completed the respective course(s) during Fall 1996 semester. For each placement test score, a corresponding probability of success was estimated. The outcome variable used a 0/1 (unsuccessful/successful) criterion measure. (Note that for logistic regression purposes, "FX" grades and withdrawals ("Ws") were considered as unsuccessful outcomes, without necessarily converting them to "F" grades). For purposes of this study, the criterion variable was generally defined as a grade of "C" or higher (2.0 or higher). However, probabilities of success were also estimated for grades of "B" or higher and "A-" or higher, respectively.


Results
The basic course placement procedure applied at IUPUI is mostly dependent upon students' academic achievement, as measured by the placement tests in English, mathematics, and reading, respectively. The manner in which placement decisions are made has been described in the Introduction section. Thus, it is imperative that the effectiveness of the existing placement procedures be known. To provide some information, the probability of success estimated what would happen if a specific cutoff on a particular criterion measure was applied to a particular reference group. (In the context of course placement, the reference group may be thought to as the student pool that took placement tests.) Note that using correlations as the basis of comparing potential placement measures can be misleading, as the "compliant" group of students may differ substantially from the reference group. Also, the restriction of range problem in the outcome measures lowers the utility of correlation coefficients in validating placement criteria. Alternative methods for validating course placement criteria were warranted as correlation evidence per se has severe limitations (see Mzumara, Shermis, & Wimer, 1996; Noble & Sawyer, 1997).

Table 1 provides a summary of the descriptive statistics for the compliant and non-compliant groups based on the respective placement test scores. The results of logistic regression analyses including graphs showing the probability of success for the respective groups by course are provided subsequently as Tables B.1 to B.7 in Appendix B. Based on the tables and graphs, a summary of the results per course is presented in turn.

Mathematics,
The following is a summary of the descriptive statistics based on compliant and non-compliant groups for mathematics (see Table 1). The compliant group for mathematics M010 (n = 52) had a mean placement test score of 6.00 with no standard deviation as, currently, there is no range in placement test scores for M010. Also, there were no students in the non-compliant group for Mathematics M010. The compliant group for Mathematics 001 (n = 1589) obtained a mean mathematics placement test score of 8.80, with a standard deviation of 2.00. The corresponding non-compliant group for Mathematics 001 (n = 20) had a mean placement test score of 9.75 and a standard deviation


Table 1 Descriptive Statistics for the Compliant and Non-Compliant Groups Based on the Placement Exam Scores (PES) for Fall 1996

SUBJECTCOURSEGROUPPES MEANPES S DN
MathematicsM010Compliant6.00.0052
Non-compliantn/a*n/a0
001Compliant8.802.001589
Non-compliant9.752.1720
111Compliant17.052.07324
Non-compliant17.753.594
Reading2X150Compliant61.634.0054
Non-compliantn/an/a0
X152Compliant73.363.3085
Non-compliantn/an/a0
EnglishE010Compliant1.00.004
Non-compliantn/an/a0
W001Compliant8.30.70542
Non-compliantn/an/a0
W131Compliant14.73.711064
Non-compliant14.75.504
EnglishW140Compliant21.58.6119
Non-compliant21.001.412




of 2.17. For Mathematics 111, the number of compliant students was 324, with a mean placement score of 17.05 and a standard deviation of 2.07. In contrast, there were only 4 non-compliant students in Mathematics 111 course with a mean placement score of 17.75 and a standard deviation of 3.59.

Figures 1 and 2 present probability estimates based on specific cutoffs on the outcome measures when applied to particular mathematics compliant groups. (Tables B.1 and B.2 in Appendix B show the results of logistic regression analyses for mathematics data.) For instance, as Figure 1 shows, a student with a mathematics placement test score of 14 is associated with an estimated probability of a B or higher grade of about .67. The corresponding C or higher cutoff score is 7 (probability of success is about .68). (Note, however, when a grade of A- or higher is employed, scores between 7 and 14 are associated with estimated probabilities of success of less than .25.) Overall, Figure 1 shows that the estimated probability of success increases as placement test scores increase. In other words, the higher the placement test score, the greater the probability of success in mathematics.

It is worthwhile to mention that the advantages of using a logistic regression approach, rather than traditional correlation methods, are that we can observe curvilinear relationships and the approach does not require strong distributional assumptions. "When the outcome measure is considered as a dichotomy (pass/fail), rather than as a continuous variable, the focus is placed on addressing the appropriate question, that being whether a student will be successful or unsuccessful, and less so whether a student will receive an A average vs. a B average" (Noble & Sawyer, 1997, p. 3). Thus, the logistic regression approach helps to reduce the problem of restriction of range in course grades.

The placement validity coefficients for the computerized adaptive mathematics test, calculated on the relationship between the placement test scores and scores on a common final mathematics exam, averaged .30, which reflects a very useful predictor and a significant improvement over last year. Using course grades as the outcome measures, the




Figure 1. Estimated Probability of Success in Mathematics 001 A- or Higher, B or Higher, and C or Higher




Figure 2. Estimated Probability of Success in Mathematics 111 A- or Higher, B or Higher, and C or Higher


correlation coefficient for compliant students was approximately .26 for Math 001 (n = 954). A drop in the validity coefficient for course grades was expected because of the adverse effect of range restriction.



Reading
Descriptive statistics for reading courses are shown in Table 1. Tables B.3 and B.4 (see Appendix B) show the results of fitting a logistic regression model to data for the reading X150 and X152 compliant groups, respectively. The estimated probabilities of success in reading courses X150 and X152 are presented in Figures 3 and 4, respectively. As Figure 3 shows, a reading placement score of 66 is associated with a probability of a B or higher grade of approximately .52. The corresponding C or higher cutoff score is 55 (probability of success is approximately .50). Figure 4 indicates that a reading placement score of 75 is associated with a probability of a B or higher grade of about .50. The corresponding C or higher cutoff score is 68 (estimated probability of success = .51).

With respect to correlation analysis, the average correlation coefficient between the reading placement test scores and scores on the Nelson-Denny Reading Test Form H was about .22 for the reading compliant groups. This validity coefficient could be considered useful for course placement purposes.

English
The descriptive statistics for English are summarized in Table 1. The compliant students for W001 writing course (n = 542) obtained a mean English placement test score of 8.30 and a standard deviation of .70. The compliant group for W131 course comprised 1064 students who had a mean English placement score of 14.73 with a standard deviation of .71. For the W140 compliant group (n = 19), the mean placement test score was 21.58 with a standard deviation of .61. In most cases, there were so few (if any) non-compliant students for writing courses to warrant separate statistical analyses.


Figure 3. Estimated Probability of Success in Educ X150 A- or Higher, B or Higher, and C or Higher




Figure 4. Estimated Probability of Success in Educ X152 A- or Higher, B or Higher, and C or Higher


Tables B.5 to B.7 (in Appendix B) provide the results for logistic regression analyses for respective writing courses. The accompanying graphs with estimates of probability of success in writing courses are shown as Figures 5 - 7. As Figure 5 shows, an English placement score of 10 is associated with a probability of a B or higher grade of about .47. The corresponding C or higher cutoff score is 6 (probability of success = .47). Overall, as seen in Figures 5 - 7, the estimated probability of success in writing increases as placement scores increase. Note, however, a negligible negative relationship (for English W140) between English placement scores and probability of success, when a grade of A- or higher is considered (see Figure 7). This particular result is most likely due to sampling error, as English W140, an honors writing course, had a very small sample size (n = 19). Note that the estimated logistic regression coefficients and the corresponding Wald (W) statistics for English W140 data were not statistically significant at the .05 alpha level.


Figure 5. Estimated Probability of Success in English W001 A- or Higher, B or Higher, and C or Higher




Figure 6. Estimated Probability of Success in English W131 A- or Higher, B or Higher, and C or Higher


Figure 7. Estimated Probability of Success in English W140 A- or Higher*, B or Higher, and C or Higher








Results of Gender and Ethnic Analyses
The issue of gender differences in academic achievement was explored using the Student t-test for independent samples. Preliminary results based on gender analysis indicated that there were some statistically significant gender differences in achievement as reflected in performance on the respective placement tests and/or outcome measures. Specifically, statistically significant differences were obtained on CAT mathematics placement test scores [M(males) = 11.37, M(females) = 10.13; t(2440) = 7.22, p < .001] with males scoring higher. However, females tended to obtain better final exam scores in mathematics [M(males) = 28.76, M(females) = 30.20); t(1654) = -3.43, p < .001] . With respect to English, no statistically significant differences were observed between males and females on the English placement tests [M(males)= 13.16, M(females) = 13.17; t(2740) = -.04, p = .967] . However, a statistically significant gender-related difference was obtained for the English course grades [M(males) = 2.39, M(females) = 2.61; t(1766) = -3.77, p < .001] , with female students performing significantly better than male students. In contrast to last year's observation, there was a statistically significant gender difference in reading placement test scores [M(males) = 97.64, M(females) = 90.26; t(880) = 4.99, p < .001] ; but not on the reading post-test (N-D Reading Test Form H) scores [M(males) = 92.76, M(females) = 94.04; t(82) = -.31, p = .755]. Note, however, statistically significant gender-related differences were observed for both reading courses, namely EDUC X150 [M(males) = 1.48, M(females) = 2.15; t(265) = -3.53, p < .001] and EDUC X152 [M(males) = 2.01, M(females) = 2.50; t(320) = -2.68, p = .008]. The aforementioned results indicate a need to conduct a thorough investigation on gender-related differences in academic performance, as reflected in placement test scores and/or course grades at IUPUI. The issue on gender research is, therefore, left for further study.

The analyses based on the ethnic status of students did not yield statistically significant results, when sampling error due to disproportionate sample sizes was taken into account. This finding is similar to those obtained in previous validity studies at IUPUI. However, given the exploratory nature of the present investigations on gender and ethnic bias, it is important to investigate further the link between ethnicity and academic achievement at IUPUI.



Discussion and Conclusion
The primary purpose of the present study was to investigate the validity of the IUPUI placement exam scores for course placement in mathematics, reading, and English. In comparison with the last two years' findings, the overall validity coefficients for mathematics were higher, and almost the same for reading and English. The logistic regression approach, however, offered a more appropriate means of determining the effectiveness of the placement criteria. Therefore, graphs indicating estimates of probabilities of success were constructed for each placement test score. It is hoped that the new probability graphs will provide some additional help to counselors and other academic advisors seeking to use the placement tests as one source of information in guiding the student to an appropriate course.

Of course, the logistic regression techniques have limitations too, such as the influence of range restriction in extreme cases. For instance, if very few or no students are unsuccessful (e.g., below a grade of B) or, for course placement, the course is either very easy or very hard, it is difficult to estimate probabilities of success (Noble & Sawyer, 1997). Similarly, estimated probabilities of success are also influenced by sample size. Generally speaking, the sample sizes required to estimate the logistic regression weights are larger than those needed for linear regression. Thus, we can expect relatively large sampling error whenever small sample sizes (say, n < 100) are employed in logistic regression analyses (see Noble & Sawyer, 1997).

Some of the general factors for explaining the results of placement validity studies are outlined in last year's placement validity report (Mzumara, Shermis, & Wimer, 1996). As noted in last year's report, a modest increase in the validity coefficients is observed when the Nelson-Denny Reading Test Form H scores are employed as a primary criterion. A similar situation was observed this year. Probably due to the influence of score range restriction and small sample size on validity coefficients, the correlations obtained for English were low and mostly not statistically significant. Another important factor to consider for English is that a majority of high school graduates are poorly prepared in writing. Specifically, recent research indicates that most high school students neither write well nor write much in school or outside of school [see the Education Research Report titled What's Wrong with Writing and What Can We Do Right Now? by the Office of Educational Research and Improvement (OERI), April 1993].

Notwithstanding the methodological limitations of placement validity studies, several recommendations are outlined in the next section.




Recommendations
Based on this study and our experience, the following advice should be helpful for course placement and test construction. Recommendations are presented separately for each content domain under investigation.

Mathematics
To improve the validity coefficients for mathematics, a new or improved Mathematics placement exam, utilizing Computerized Adaptive Testing (CAT) procedures, was implemented to replace the current conventional mathematics test. The conversion to adaptive testing in mathematics was actually accomplished in October 1995. As mentioned earlier, revisions with respect to improving item bank characteristics, cutoff scores, better reporting of test results, and the development of testlets are currently underway to improve both the content and predictive validity of the computerized adaptive mathematics placement test. These recommendations will be implemented in mid-fall.

Since the switch-over to CAT, the Testing Center has been collecting information to evaluate the effectiveness of computerized adaptive testing in mathematics, and the preliminary findings have been very encouraging. For instance, we have obtained a positive validity coefficient as high as .31 (p < .001) between the computerized adaptive mathematics test scores and the final exam scores for the Math 001 compliant group (n = 971).

Mathematics placement cutoff scores need to be modified and should take into account the standard error of measurement and other practical issues. These cutoff points represent the standards upon which placement recommendations are made. Thus, cutoff scores should be chosen judiciously to ensure that a student enters only those courses for which he/she is prepared. Various methods for setting cutoff scores are discussed elsewhere in the literature (e.g., see Hills, Hirsch, & Subhiyah, 1990; Nitko, 1983).

In addition to modifying the current cutoff scores, the Testing Center staff (in conjunction with the mathematics faculty) should consider adding appropriate items to the current item bank or periodically revise the local mathematics placement test to improve its content validity or curricular relevance. Note that this recommendation is already in effect, as the Testing Center staff and faculty in the Department of Mathematical Sciences are currently engaged in the process to improve the CAT mathematics placement test.

Reading
Because of the low correlations between the Nelson-Denny Reading Test Form E and Form F, a new Indiana-Purdue Computerized Reading Placement Test has now replaced the Nelson-Denny Reading Test Form E (Shermis, Wolting, & Lombard, 1996). This test has been evaluated on several psychometric dimensions, and has been demonstrated to have good reliability and validity (Shermis, Wolting, & Banta, 1996). The non-adaptive version of the reading test was implemented in late June 1996. This linearly computerized reading test will soon be converted to an adaptive format. Previous pilot information on the reading test designed to work in a CAT environment has yielded promising results. Thus, we encourage all efforts to convert to a computerized adaptive reading test.

English
With respect to English, we recommend and/or encourage the English Department faculty to seek or adopt alternative methods of assessment (e.g., portfolios), in conjunction with traditional forms of assessment, to obtain a wider range of scores. Also, it should be worthwhile to consider expanding the current score range for the current English placement test so as to minimize the influence of range restriction.

Most recently, the English Department approved the use of typing (rather than writing) exam responses as a way to accelerate the turnaround time for reporting test scores. Exploration is also underway with student samples drawn from local area high schools to evaluate the possibility of computerized scoring of essays (Page, 1995).

Overall Recommendation. One of the continuing problems has to do with enforcing compliance in course placement. Although the current compliance rates in English, mathematics, and reading are much higher than for the past two years, overall compliance rates will most likely increase when the Registrar's Office implements an enforcement mechanism for class enrollment other than relying on post-registration audits. Thus, we encourage current efforts by the Registrar to create a "barring mechanism" akin to the financial bars that are already in place. The bar will prevent a student from enrolling in a class that is inappropriate for him/her based on placement test scores unless special consideration is granted by the respective academic departments. Such a "barring" system is currently being tested, and is scheduled for implementation in early spring.

General Remarks
Future placement validity studies should include an analysis of the nature and content of the outcome measures. It is likely that any one year's examination is different in its content coverage of the curriculum, standards or difficulty. Such year-to-year differences in skills tested may account for a considerable amount of any change in performance on the examinations. A content validity analysis would provide some useful and/or supplementary information that would help explain some of the observed inconsistency in the relationships between predictor and outcome scores.

It is noteworthy that placement tests are designed to be used in conjunction with counselors' recommendations, and provide one source of information about the student's current ability level. Decisions regarding course placement ought to include a student's previous academic record and not be made on the basis of placement test scores alone. Because the IUPUI placement decisions are advisory, the placement tests are designed to be used in conjunction with counselors' recommendations. Thus, the use of multiple predictors for course placement is desirable (see Sawyer, 1996). Rather than use placement test scores alone, additional predictors may include one or more of the following variables: high school (HS) overall grade point average (GPA), HS subject GPA, HS (percentile) rank, and HS courses completed. It seems likely that the more information counselors have about a student, the better the chance of their making an appropriate placement decision. (Note that data collection and management for research purposes will be facilitated with an improved IU Test Reporting System (IUTS), which is currently in beta testing.)

After several years of placement testing in English, mathematics and reading, the respective stakeholders should be fairly convinced that the IUPUI placement testing program provides an efficient, practical, and workable method of placing students in appropriate courses which give them the best chance for academic success. The IUPUI placement tests, however, are a guide, based on the past performances of other similar students, of potential success in specific sets of courses in English, mathematics, and reading. Although the placement tests cannot measure motivation or other affective variables in academic learning, they are quite useful in providing our students with a healthier and more valid learning experience in the respective courses. Placement testing, therefore, offers the University a practical way to accommodate the more diverse academic needs and talents of the entering students.




References

Akst, G., & Hirsch, L. (1991).Selected studies on mathematics placement. Review of Research in Developmental Education, 8(4), 1-3.

Allen, M. J., & Yen, W. M. (1979). Introduction to measurement theory. Monterey, California: Brooks/Cole.

Brown, J. I., Bennett, J. M., & Hanna, G. (1981). The Nelson-Denny Reading Test: Examiner's Manual Forms E and F. Chicago, IL: Riverside.

Coladarci, T. (1986). Accuracy of teacher judgments of student responses to standardized test items. Journal of Educational Psychology, 78, 141-146.

College Entrance Examination Board (1982). Guide to the College Board Validity Study Service. New York: Author.

Cronbach, L. J. (1970) Essentials of psychological testing (3rd ed.). New York: Harper & Row.

Cronbach, L .J. (1971). Test validation. In R. L. Thorndike (Ed.), Educational measurement (2nd ed.). Washington, DC: American Council on Education.

Ebel R. L. (1972). Essentials of educational measurement (2nd ed.). New Jersey: Prentice-Hall.

Glass, G. V., & Hopkins, K. D. (1984). Statistical methods in education and psychology (2nd ed.). Englewood Cliffs, N.J.: Prentice-Hall.

Harrington, S., Shermis, M. D., & Rollins, A. (1997). The influence of word processing on English placement test results. Indianapolis, IN: IUPUI Department of English (manuscript in preparation).

Hills, J. R., Hirsch, T. M., & Subhiyah, R. G. (1990). Issues in placement. Washington, DC: ERIC Clearinghouse on Tests, Measurement and Evaluation/American Institutes for Research.

Hosmer, D. W., & Lemeshow, S. (1989). Applied logistic regression. New York: John Wiley & Sons.

Kerlinger, F. N., & Pedhazur, E. J. (1973). Multiple regression in behavioral research. New York: Holt, Rinehart & Winston.

Kirk, R. E. (1990). Statistics: An introduction (3rd ed.). Fort Worth: Holt, Rinehart & Winston.

Mzumara, H. R., Shermis, M. D., & Wimer, D. (1996). Validity of the IUPUI placement test scores for course placement: 1995-96. (Annual placement validity report.)

National Center for Education Statistics (October 1996). Developmental education at higher education institutions in fall 1995. Statistical Analysis Report (NCES 97-584). (WWW: http://www.ed.gov/NCES/pubs/97584.html)

Nitko, A. J. (1983). Educational tests and measurement: An introduction. New York: Harcourt Brace Jovanovich.

Noble, J. & Sawyer, R. (1997, spring). Alternative methods for validating admissions and course placement criteria. The Association for Institutional Research for Management Research, Policy Analysis, and Planning (No. 63), 1-12.

Norusis, M. J. & SPSS Inc. (1992). SPSS for Windows: Advanced statistics, Release 5. Chicago, IL: SPSS Inc.

Office of Educational Research and Improvement (April 1993). What's wrong with writing and what can we do right now? OERI Research Report. Washington, DC: Author. (WWW: http://www.ed.gov/pubs/OR/ResearchRpts/writing.html)

Page, E. B., & Petersen, N. S. (1995). The computer moves into essay grading: Updating the ancient test. Phi Delta Kappan, 76(7), 561-565.

Sawyer, R. (1996). Decision theory models for validating course placement tests. Journal of Educational Measurement, 33, 271-290.

Shermis, M. D., & Chang, S. H. (1997). The use of IRT to investigate the hierarchical nature of a college mathematics curriculum. Unpublished manuscript, IUPUI Testing Center, Indianapolis, IN.

Shermis, M. D., & Mzumara, H. R. (1996).College placement testing through the World Wide Web: Preparing students for post-secondary education. Indianapolis, IN: IUPUI Testing Center.

Shermis, M. D., Mzumara, H. R., Lillig, C., & Brown, M. (1997).Computerized adaptive testing through the World Wide Web. Paper presented at the annual meeting of the American Psychological Association, Chicago, IL.

Shermis, M. D., Wolting, M., & Lombard, D. (1996). Computerized adaptive testing for reading placement and diagnostic assessment. Journal of Developmental Education, 20(2), 18-24.

Stiggins, R. J., & Bridgeford, N. J. (1985). The ecology of classroom assessment. Journal of Educational Measurement, 22, 271-286.

Wainer, H. (1993). Some practical considerations when converting a linearly administered test to an adaptive format. Educational Measurement: Issues and Practice, 12, 15-20.

Wiseman, S. (1967). The effect of restriction of range upon correlation coefficient. British Journal of Educational Psychology, 37, 248-252.




APPENDIX A
Logistic Regression and Probability of Success

Ordinarily, logistic regression estimates the relationship between a dichotomous outcome (i.e., a university course grade of "C" or higher) and one or more predictors (Hosmer & Lemeshow, 1989). In most respects, logistic regression employs same general principles as linear regression, but fits a non-linear model with a predicted outcome bounded by 0 and 1. The advantages and utility of logistic regression approach, compared to traditional correlation methods, in validating placement criteria are discussed elsewhere in the literature (e.g., see Noble & Sawyer, 1997; Sawyer, 1996; and Hosmer & Lemeshow, 1989).

Logistic regression produced a student's estimated probability of success, using the following formula:

wherea and b are regression coefficients estimated on the data,
x is the score the placement test,
K is the cutoff score on the placement test, and
e is the base of the natural logarithms, approximately 2.718
(Hosmer & Lemeshow, 1989; Noble & Sawyer, 1997).






APPENDIX B


Table B.1
Logistic Regression Statistics for Mathematics 001 Data (A- or higher, B or Higher, and C or Higher)


VariableEstimated CoefficientStandard ErrorWald Statisticdf p

A or Higher
Math Placement Score.222.04623.2731.001
Constant-4.249.47281.050 1.001
-2 Log-likelihood = 660.645

B or Higher
Math Placement Score .262.03361.7921.001
Constant-2.953.31985.4651.001
-2 Log-likelihood = 1280.697

C or Higher
Math Placement Score.365.05445.4941.001
Constant-1.808.46315.2631.001
-2 Log-likelihood = 945.634
<



Table B.2 Logistic Regression Statistics for Mathematics 111 Data (A- or higher, B or Higher, and C or Higher)

VariableEstimated CoefficientStandard ErrorWald Statisticdf p

A or Higher
Math Placement Score.064.074.7411.389
Constant-2.5061.2873.7901.052
-2 Log-likelihood = 242.399

B or Higher
Math Placement Score.004.061.0051.944
Constant-.1621.045.2411.877
-2 Log-likelihood = 339.148

C or Higher
Math Placement Score.347.2162.5821.108
Constant-2.6983.494.5961.440
-2 Log-likelihood = 89.588




Table B.3 Logistic Regression Statistics for Educ X150 Data (A- or higher, B or Higher, and C or Higher)

VariableEstimated CoefficientStandard ErrorWald Statisticdf p

A or Higher
Reading Placement Score .009.097.0091<.926
Constant-2.3086.021.1471.702
-2 Log-likelihood = 45.304

B or Higher
Reading Placement Score.088.0731.4441.230
Constant-5.70744.5201.5951.207
-2 Log-likelihood = 73.670

C or Higher
Reading Placement Score.084.0721.3621.243
Constant-4.6134.4081.0951.295
-2 Log-likelihood = 71.188




Table B.4 Logistic Regression Statistics for Educ X152 Data (A- or higher, B or Higher, and C or Higher)

VariableEstimated CoefficientStandard ErrorWald Statisticdf p

A or Higher
Reading Placement Score.106.0752.0071.1566
Constant-8.6885.5492.4511.117
-2 Log-likelihood = 102.284
B or Higher
Reading Placement Score.108.06862.4911.114
Constant-8.1365.0432.6021.107
-2 Log-likelihood = 115.686
C or Higher
Reading Placement Score.083.0691.4551.228
Constant-5.6205.0591.2341.267
-2 Log-likelihood = 111.641




Table B.5 Logistic Regression Statistics for the English W001 Data (A- or higher, B or Higher, and C or Higher)

VariableEstimated CoefficientStandard ErrorWald Statisticdf p

A or Higher
English Placement Score.175.1501.3591.001
Constant-2.9931.2595.6501.018
-2 Log-likelihood = 506.229

B or Higher
English Placement Score.240.1253.6771.055
Constant-2.5051.0445.7591.016
-2 Log-likelihood = 716.879

C or Higher
English Placement Score
.271.1363.9721.046
Constant-1.7401.1272.3831.123
-2 Log-likelihood = 717.896




Table B.6 Logistic Regression Statistics for the English W131 Data (A- or higher, B or Higher, and C or Higher)

VariableEstimated CoefficientStandard ErrorWald Statisticdf p

A or Higher
English Placement Score.491.12216.1871.001
Constant-8.5731.81522.3021.001
-2 Log-likelihood = 1098.052

B or Higher
English Placement Score.247.0917.3211.007
Constant-3.9091.3488.4081 .004
-2 Log-likelihood = 1449.256

C or Higher
English Placement Score.084.094.8141.367
Constant-.4311.377.0981.754
-2 Log-likelihood = 1307.567




Table B.7 Logistic Regression Statistics for the English W140 Data (A- or higher, B or Higher, and C or Higher)

VariableEstimated CoefficientStandard ErrorWald Statisticdf p

A or Higher
English Placement Score-.034.803.0021.966
Constant.19517.329.0001.991
-2 Log-likelihood = 25.008

B or Higher
English Placement Score.127.778.0271.870
Constant-2.64116.789.0251.875
-2 Log-likelihood = 26.287

C or Higher
English Placement Score.321.816.1551.694
Constant-6.15517.586.1221.726
-2 Log-likelihood = 23.699