Be Careful About What You Agree To In Joint Committee Work:

Sometime in the next two years your school district will formally convene the District’s PERA Joint Committee to discuss and agree upon the various elements required by law for performance based teacher evaluations. Administrators should be careful to not expand the duties, authority, and jurisdiction of the PERA Joint Committee beyond the law. Please check with your legal counsel before discussing teacher evaluation issues such as how to determine the relative importance of the various components within the framework or how to determine summative ratings.

Management retains certain rights under this law and you need to be familiar with these rights and exercise them accordingly. For example, there will be an equal number of representatives on the PERA Joint Committee and the union has no right to have input into who the management members of this committee are. You could obviously talk about this and agree on the members but this is a management decision to appoint the management members.

The evaluation result or rating is a management right. Be careful of any language related to an appeal process to the teacher rating. Again, consult with your legal counsel prior to making this decision.

The persons who will do the evaluating are a management right as is the training of these individuals. The PERA Joint Committee should not be discussing or agreeing to language that would limit this right. For example, the committee should not be discussing continued training and inter-rater reliability of the raters.

RV’s Recommendations for Weighting Components of Danielson’s Framework for Teaching

When I train school administrators on using the Danielson Frameworks for Teaching (FFT) I am often asked how to score (rate) the professional practice of teaching. A good place to start this discussion is the ISBE model process for summative rating. This model system has been created and published already and is on the ISBE/PEAC website at http://www.isbe.state.il.us/PEAC/default.htm The following is information copied from the website:

Subject

This document provides a description of excellent, proficient, needs improvement and unsatisfactory levels of teacher practice.

Model System Component

The Performance Evaluation Reform Act (PERA) requires that teachers “be rated using the 4 categories of ‘excellent’, ‘proficient’, ‘needs improvement’, or ‘unsatisfactory’” (PERA Sec. 2-3.25[b]).

Explanation

 In the Model Teacher Evaluation System, districts will provide teachers with an overall teacher practice rating and overall student growth rating. The overall teacher practice rating and the overall student growth rating will be combined to create an overall summative rating using the decision matrix (see Table 1).

Table 1. Overall Summative Rating Decision Matrix

Overall Teacher Practice Rating
Excellent Proficient Needs Improvement Unsatisfactory
Overall Student Growth Rating Excellent Excellent Excellent Proficient Additional evidence required—if confirmed, Proficient
Proficient Excellent Proficient Proficient Needs Improvement
Needs Improvement Proficient Proficient Needs Improvement Needs Improvement
Unsatisfactory Additional evidence required—if confirmed, Proficient Needs Improvement Needs Improvement Unsatisfactory

For the Model Teacher Evaluation System, Excellent, Proficient, Needs Improvement and Unsatisfactory performance will be defined as below.

  • Excellent: Teacher receives a rating of excellent in both practice and student growth; or is proficient in practice and excellent in student growth; or is excellent in practice and proficient in student growth.
  • Proficient: Teacher receives a rating of proficient in both practice and student growth; or is needs improvement in practice and excellent in student growth; or is excellent in practice and needs improvement in student growth; or is needs improvement in practice and proficient in student growth; or is proficient in practice and needs improvement in student growth. A teacher also may be proficient if rated excellent in practice and unsatisfactory in growth or unsatisfactory in practice and excellent in growth and after additional evidence is collected, those ratings are confirmed as being accurate.
  • Needs improvement: Teacher receives a rating of needs improvement in both practice and student growth; or is unsatisfactory in practice and proficient in student growth; or is proficient in practice and unsatisfactory in student growth; or is unsatisfactory in practice and needs improvement in student growth; or is needs improvement in practice and unsatisfactory in student growth.
  • Unsatisfactory: Teacher receives a rating of unsatisfactory in practice and student growth.

Frequently Asked Question

  1. Is our district required to adopt these definitions?

The only districts that are required to follow these steps are those implementing the Model Teacher Evaluation System, which occurs when the Joint Committee cannot come to a decision on this aspect of student growth within 180 days of the first meeting of the Joint Committee. See PERA Section 24A-4(b) for more information on the Model Teacher Evaluation System and deadlines for a Joint Committee. These definitions are not appropriate for districts not implementing the overall decision matrix included above as Table 1.

The information above does not give direction on how to actually calculate the Overall Teacher Practice Rating. Thus, this will become a decision that the Joint Committee in your school district will have to make.

Charlotte Danielson designed the Frameworks for Teaching to be used for professional development of teachers, not for rating of teachers. I do not believe that Danielson has ever put specific percentages on the various components. I have heard Danielson state and I have read in her books that she believes that Domain 3, Instruction, is the heart of the frameworks. She goes on to state that 3c, Engaged Learning, is the heart of the heart of the frameworks. Thus, I interpret this to mean that Domain 3 is most important with 3c being the component within Domain 3 with the most importance.

Danielson goes on to say that 2a, Respect & Rapport, is vital to the student learning and I interpret this to be the most important in Domain 2 with Domain 2 being second in overall importance.

Domains 3 and 4 follow domains 1 and 2 in importance. Component 1e, Designing Coherent Instruction, is the most important component in Domain 1 and Danielson states is the heart of planning and instruction. Component 4e, Reflecting on Teaching, is vitally important because of Danielson’s basic belief that “the learner has to be intellectually involved in order to learn.” This applies to teachers as well as students.

My personal interpretation of the weights of the various components is in the following breakdown. (Remember, these are my interpretations of the Danielson Frameworks for Teaching.) The Joint Committee could also decide to use words to calculate the Overall Teacher Practice Rating.

Domain 1: Planning and Preparation;

1a=5%, 1b=5%, 1c=3%, 1d=3%, 1e=3%, and 1f=3%.

Domain 2: The Classroom Environment;

2a=5%, 2b=3%, 2c=3%, 2d=3%, 2e=2%.

Domain 3: Instruction;

3a=6%, 3b=10%, 3c=15%, 3d=6%, 3e=2%.

Domain 4: Professional Responsibilities

4a=7%, 4b=2%, 4c=4%, 4d=4%, 4e=4%, 4f 2%.

For what it is worth, this would be my summary score range:

80 to 100% Excellent

65 to 79% Proficient

50 to 64% Needs Improvement

Below 50% Unsatisfactory

Engaged Learning = Technology Integration Done Correctly

Recently I had the privilege of visiting Gurnee Elementary School District #56. Dr. John Hutton is the superintendent and Dr. Hutton and the building principals took me on a tour of Viking Middle School and the newly constructed elementary school Prairie Trail School. Gurnee has implemented a full 1:1 iPad technology program for all of its K-8 students. This technology conversion may be one of a kind not only in Illinois but also in the United States. All 2,200 students and all faculty members have iPad’s. I plan on writing some articles and producing some podcasts on what I discovered at Gurnee but in short this visit was truly amazing.

I have been working with education administrators across the entire State of Illinois for almost three years on best practices for teacher evaluation. One of the most misunderstood concepts related to the Danielson Frameworks for Teaching is the emphasis that the frameworks put on component 3c, Engaged Learning. Danielson defines engaged learning as the student being intellectually involved in the work.

When I was visiting classrooms in Gurnee it seemed to me that 100% of the students were intellectually engaged in the work. This was an amazing experience and while the technology alone does not make for “engaged work,” the effective use of technology allows for student engagement. Congratulations Gurnee School District on doing great work! I would highly recommend school administrators and teachers from other Illinois school districts to visit Gurnee to see how to use technology to effectively engage students.

Model vs. Default:

In my opinion there is much confusion over the use of the words “model” and “default” in the State of Illinois new Performance Based Teacher Evaluation System.

Most school districts (the top performing 80%) will not have its Performance Based Teacher Evaluation system in place until the start of the 2016 school year. The lowest 20% districts will need the system in place by 2015 and some Title I grant receiving districts will be implementing sooner.

ISBE has declared that the Danielson Frameworks is the state model for practice based teacher evaluation. It is not the “default model,” it is a “model.” The following sentence was copied from the ISBE Website “The classroom observation portion of the state-designed optional evaluation system is being adapted from Charlotte Danielson’s Framework for Professional Practice, widely used for evaluations by districts across the country.”

School district joint committees are free to develop their own model but if they do they need to make sure they meet the requirements of the TITLE 23:  EDUCATION AND CULTURAL RESOURCES SUBTITLE A:  EDUCATION CHAPTER I:  STATE BOARD OF EDUCATIONSUBCHAPTER b:  PERSONNEL PART 50 EVALUATION OF CERTIFIED EMPLOYEES UNDER ARTICLE 24A OF THE SCHOOL CODE. SUBPART A:  GENERAL REQUIREMENTS. Several important concepts in the Part 50 rules that the Joint Committee will need to consider are the following:

  • “The evaluation plan shall contain a rubric to be used in rating professional practice that aligns to the instructional framework.”
  • “Instructional framework developed or adopted by the school district that is based upon research regarding effective instruction; addresses at least planning, instructional delivery, and classroom management; and aligns to the Illinois Professional Teaching Standards.”
  • “The evaluation plan shall consider the teacher’s attendance and competency in the subject matter taught, as well as specify the teacher’s strengths and weaknesses and the reasons for identifying the areas as such.”
  • The district shall quantify the relative importance of each portion of the framework to the final professional practice rating.

The state model plan that becomes the “default plan” is referred to in this following sentence from the Part 50 Rules: “In situations where a joint committee cannot reach agreement on one or more aspects of student growth within the timeline established under Section 24A-4(b) of the School Code, the school district shall adopt the State model plan contained in Subpart C of this Part with respect to those aspects of student growth upon which no agreement was reached.”

The PEAC is in the process of determining the “State Model Plan” for student growth. Updated information on this process can be accessed at the ISBE website http://www.isbe.net/PEAC/default.htm

Will Illinois School Districts Use Student Survey for Teacher Performance Evaluation?

In a previous blog article I summarized the findings of the MET Study Ensuring Fair and Reliable Measures of Effective Teaching. The MET project researched what measures could be used to identify effective teaching fairly and reliably. In the initial MET report in 2010 the researchers found that “a well-designed student perception survey can provide reliable feedback on aspects of teaching practices that are predictive of student learning.” The 2012 study Gathering Feedback for Teaching “presented similar results for classroom observations.”

Illinois districts have for the first time (outside of the City of Chicago that has been giving this student survey for many years) given students a survey titled “Illinois 5Essentials Survey.” This 5Essentials survey was given to students in 6th through 12th grades.  According to research completed on the 5Essentials Survey “Research shows that schools strong on the five essentials are ten times more likely to improve student learning than schools weak on the five essentials.” The five essentials are the following:

  • Effective leaders: The principal works with teachers to implement a clear and strategic vision for school success.
  • Collaborative teachers: The staff is committed to the school, receives strong professional development, and works together to improve the school.
  • Involved families: The entire school staff builds strong relationships with families and communities to support learning.
  • Supportive environment: The school is safe and orderly. Teachers have high expectations for students. Students are supported by their teachers and peers.
  • Ambitious instruction: Classes are academically demanding and engage students by emphasizing the application of knowledge.

The MET Project used the Tripod Survey. The Tripod survey identifies seven constructs—the 7 Cs—that are core to a student’s experience in his or her classroom. They are 1) care; 2) control; 3) clarify; 4) challenge; 5) captivate; 6) confer and 7) consolidate.

The study concluded, “The average student knows effective teaching when he or she experiences it.”

The MET Project recommends a score weighting of either 1) 33% for Student surveys, 33% for Observations and 33% for Achievement gains on state tests or 2) 50% on Observations, 25% for Student surveys and 25% for Achievement gains on state tests to achieve the highest reliability for teacher evaluation.

It will be interesting to see if the State of Illinois will take these MET Research findings into account when deciding on the final default model for Illinois Performance Based Teacher Evaluation. This is a topic Illinois school districts should be discussing with its Teacher Evaluation Joint Committee members in an informal setting prior to the formal announcement of the Joint Committee work.

MET Project Reveals Important Teacher Evaluation Research:

The Measures of Effective Teaching (MET) Project has three major findings; 1) student achievement gains are the direct result of more effective instruction; 2) trained evaluators using a research based evaluation instrument (Danielson Frameworks for Teaching) does result in proper summative practice teacher evaluation; and 3) student perception surveys are effective in determining effective teaching.

The most important finding of the (MET) is that research validated the theory that teaching ability affects student achievement taking into account that students have various learning characteristics. The student academic achievement scores of the most effective teachers were significantly higher than the student scores of the least effective teachers. In addition, the student scores of the least effective teachers actually decreased.

This study showed that more effective teachers are truly better than other teachers at improving student learning, not that they simply have better students. The study collected data and produced estimates of teaching effectiveness for each teacher. The study then adjusted the estimates to account for student differences in prior test scores, demographics, and other traits. The study then randomly assigned a classroom of students to each participating teacher.

The adjusted measures did identify teachers who produced higher (and lower) average student achievement gains following random assignment. The data showed that we can identify groups of teachers who are more effective in helping students learn. Moreover, the magnitude of the achievement gains that teachers generated was consistent with expectations.

The more effective teachers not only caused students to perform better on state tests, but they also caused students to score higher on other, more cognitively challenging assessments in math and English.

The study pointed out that “Teaching is too complex for any single measure of performance to capture it accurately. Identifying great teachers requires multiple measures.” The MET project identified three composites that reliably predicted student academic growth. They were 1) state assessments, 2) classroom observations, and 3) student surveys.

The project went on to say that significantly placing a lot of weight on one measure such as state assessments will lead to unintended consequences. In the case of weighting state assessments as more than 50% weight, results in teachers teaching to the assessment and leaving out of the curriculum of other important education goals such as higher order thinking and creativity. The study concluded “Heavily weighting a single measure may incentivize teachers to focus too narrowly on a single aspect of effective teaching and neglect its other important aspects….If the goal is for students to meet a broader set of learning objectives than are measured by a state’s tests, then too-heavily weighting that test could make it harder to identify teachers who are producing other valued outcomes.”

The project studied four ways to weight the measures. They are the following:

  1. Model 1 places 81% of the weight on the achievement gains on state tests; 17% on student surveys and 2% on observations.
  2. Model 2 places 50% of the weights on achievement gains on state tests; 25% on student surveys and 25% on observations.
  3. Model 3 places 33% of the weights on achievement gains on state tests; 33% on student surveys and 33% on observations.
  4. Model 4 places 25% of the weights on achievement gains on state tests; 25% on student surveys and 50% on observations.

Model 1 with the highest percentage (81%) on achievement gains on state tests resulted in the highest correlation with state tests gains. However, it also had the lowest correlation with higher-order tests. Model 3 with an equal 33% weight on each measure produced the highest correlation, .76, to reliability reflecting the year-to-year stability of teacher’s results. Model 4 had a .75 correlation to year-to-year stability of teacher’s results based on 50% on observations, 25% on achievement tests and 25% on student surveys.

Other important findings from this project are the following: 1) the student data that is being used needs to be accurate; 2) classroom observers not only need training on the instrument they need continual scoring practice with master observers; 3) observations should be done by more than one observer; and 4) multiple years of student achievement gains, observations, and student surveys should be used if they are available.

The following table illustrates various methods for districts to consider when observing teachers. The MET project states “Adding lessons and observers increases the reliability of classroom observations. In our estimates, if a teacher’s results are based on two lessons, having the second lesson scored by a second observer can boost reliability significantly.”

Another significant finding was “Our analysis from Hillsborough County showed observations based on the first 15 minutes of lessons were about 60 percent as reliable as full lesson observations, while requiring one-third as much observer time.”

Reliability measures for observations are listed below from highest to lowest:

  1. .72 reliability- Two lessons observed by own administrator; one lesson observed by peer observer; three 15-minute lessons observed by three different observers
  2. .69 reliability- Two lessons observed by own administrator; one lesson observed by one peer observer; one lesson observed by different peer observer
  3. .67 - One lesson observed by own administrator; three 15-minute lessons observed by three different observers
  4. .67 – One lesson observed by own administrator; one lesson observed by peer observer
  5. .66 – Two lessons observed by own administrator; two lessons observed by peer observer
  6. .58 – Two lessons observed by own administrator
  7. .51 – One lesson observed by own administrator

 

 

 

 

 

Very Important Teacher Dismissal Case for Illinois SB7:

A Peoria County judge threw out a lawsuit Friday filed by two former Peoria School District teachers who claimed they were terminated improperly after receiving poor evaluations. This case contested the new Illinois law that places teachers in one of four groups based on their teacher performance evaluation ratings. Teachers rated as Unsatisfactory are placed in Group 1, teachers rated Needs Improvement are placed in Group 2, teachers rated Proficient in Group 3 and teachers rated Excellent in Group 4. Teachers in Groups 1 and 2 are not considered for reemployment if the district has a Reduction in Force. Teachers are laid off first from Group 1 then progressing to Group 4.

The Peoria teachers sued because they said the district did not in fact Reduce in Force any teachers even though several teachers were given RIF notices in the spring. The judge ruled “Chief Judge Michael Brandt ruled in a two-page decision the district acted properly when it pink-slipped Eymarde Lawler and Michelle Frakes at the end of the school year.” Brandt held that “the uncertainty of enrollment and state funding were valid reasons for a reduction-in-force at the end of a school year.”

 

The newspaper article in the Peoria Journal Star stated “Gil Feldman of Chicago, the teachers’ attorney, claimed at a hearing earlier this month a ruling in favor of the district would render seniority rights meaningless.”

How will Illinois handle student growth?

The Performance Evaluation Advisory Council (PEAC), of which I represent IASA, is presently discussing student growth and how student growth will be used in the default model for performance teacher evaluation. School districts and its collective bargaining association will form a Joint Committee equally represented by teachers and administrators to determine how the local district will handle student growth. If the Joint Committee cannot come to agreement within 180 days then the State of Illinois student growth default model is imposed on any section not agreed to.

PEAC has been discussing this issue for several months and is using information from the Bill and Melinda Gates Foundation Measures of Effective Teaching project (MET) to learn more about the research that has been done on using student growth for teacher evaluation. You are encouraged to read this report before entering discussions in your own district related to using student growth for teacher evaluation. You may access this report at http://www.metproject.org/reports.php

The MET project mainly focused on urban schools. According to the report, the consequences of having a highly effective teacher vs. a minimum effective teacher are great. However, 85% of teachers were rated average in this study. Relatively few teachers are having outstanding results with students. Most school districts are rewarding teachers for experience and education that does not reflect performance.

The main finding from the MET project should be very alarming to school administrators and teachers. It is the following:

“Teachers previously identified as more effective caused students to learn more. Groups of teachers who had been identified as less effective caused students to learn less.”

The MET projects concluded that three variables should be used when measuring effective teaching. They are observation from properly trained observers, student achievement gains and student perception surveys.

Key findings included the following:

  • Effective teaching can be measured.
  • Balanced weights indicate multiple aspects of effective teaching (33% to 50% weight should be assigned to student achievement gains). Multiple measures also produce more consistent ratings than student achievement measures alone.
  • Adding a second observer increases reliability significantly more than having the same observer score an additional lesson.
    • Additional shorter observations can increase reliability.
    • Although school administrators rate their own teachers somewhat higher than do outside observers, how they rank their teachers’ practice is very similar and teachers’ own administrators actually discern bigger differences in teaching practice, which increases reliability.
    • Adding observations by observers from outside a teacher’s school to those carried out by a teacher’s own administrator can provide an ongoing check against in-school bias. This could be done for a sample of teachers rather than all.

New Illinois School Report Card

I have served on the state committee that has recommended changes to the new Illinois State School Report Card as mandated by the reform laws (Report Card Public Act 097-0671 Legislative Requirements). The following is a list of the required items for the new report card: (the items that are (***) are indicators for future release)

  1. Student Demographics
    1. Average class size
    2. Ethnic breakdown
    3. % Low income students
    4. % Limited English Proficiency students
    5. % students with IEP or 504 plans that provide Special Education services
    6. % students who annually transferred in or out of the school from another district
    7. Average teaching experience
    8. Per Pupil Operating Expenditure
    9. Per Pupil Operating Expenditure by District Type
  2. Student Outcomes
    1. % students meeting or exceeding state standards on assessments Allow comparisons for five years for different student sub-groups
    2. % of students in 8th grade passing Algebra I
    3. *** % of students enrolled in post secondary institutions (including colleges, universities, community colleges, trade/vocational schools and training programs leading to career certification within two semesters of high school graduation)
    4. % of high school students who are college ready
    5. % of high school graduates who are career ready
    6. *** % of graduates who are enrolled in community colleges or universities who are in one or more remedial courses
  3. Student Progress
    1. % of students in the 9th grade who have earned 5 credits or more without failing one or more core classes
    2. *** % of students who have attended Pre-School (measure of students entering kindergarten ready to learn)
    3. % of students who enter high school on track for college or career readiness
  4. School Environment
    1. Number of different principals at the school in last six years
    2. Two or more indicators from any school climate survey administered pursuant to section 2-3.153
    3. *** % of teachers with less than 10 absences
    4. 3 year average of teachers returning to school
    5. *** % of students with less than 10 absences
    6. *** % of teachers rated as proficient or excellent in their most recent evaluation
  5. Curriculum Information
    1. Advance placement courses
    2. International and Baccalaureate or equivalent courses
    3. Foreign language classes offered
    4. Before and after school programs
    5. Extracurricular activities
    6. Special programs such as programs for gifted and talented
    7. Approved programs of study
    8. Dual enrollment courses offered
    9. Subjects in which elective classes are offered
    10. *** Health and wellness initiatives including per pupil average # of days of PE per week
    11. Awards received by the school
    12. Community partnerships initiatives/programs
    13. Work study programs
    14. School personnel resources
    15. E-Report Card Form 86-43 (Student attendance data; Chronic students; Parental contact; Student mobility/turnover; Days school was in session; Elementary class size data and teacher quality data; High school class size teacher quality (HQ) data; Minutes per week for math, science, English, and social studies for grades 3, 6, and 8)
  1. Accountability Measures
    1. Graduation Rate (four-year cohort and five-year cohort)
    2. *** Index scores to classify high schools into one to five star levels
    3. *** Title I school classification
  1. New Elements
    1. # of homeless students in the district

Several of these indicators are new and will require school administrators to determine a communications plan to your public. For example, the indicator “% of students who are college ready” is currently defined to be students with an ACT Composite score of 20 or better. However, when I think about this metric I would not commit $10,000 to $20,000 plus per year to send my own child to college with an ACT Composite score of 20. Personally, I would use at a minimum the ACT recommended college readiness scores.

According to ACT “The College Readiness Standards are also linked to college instruction. More than 40 years of research has shown that performance on the ACT is directly related to first-year college grade point average.” The ACT Report goes on to state “The Benchmarks are scores on the ACT subject-area tests that represent the level of achievement required for students to have a 50% chance of obtaining a B or higher or about a 75% chance of obtaining a C or higher in corresponding credit-bearing, first-year college courses. These college courses include English composition, college algebra, introductory social studies and humanities courses, and biology. Based on a nationally representative sample of 98 institutions and more than 90,000 students, the Benchmarks are median course placement values for these institutions and as such represent a typical set of expectations.

ACT uses the following scores as benchmarks for college grades:

English 20; Mathematics 22; Reading 21; Science 24 and Composite 21.

As school district leaders it is recommended that you sit down with your district’s administrators and teacher leaders and discuss the various report card indicators. The new Illinois Report Card is due out next fall.

Will using student growth as a measure to evaluate teachers survive?

I have contemplated writing this article about the use of student growth metrics as a measure to evaluate teachers for several months. I am still in the process of forming my final decision but I do know that my own interpretation of using student growth has drastically changed in the two plus years I have been a member of the Performance Evaluation Advisory Council (PEAC).

At the least I would suggest that there are major flaws in using student growth scores for teacher evaluation. However, I do believe we need to look at and consider student growth for both teacher and principal evaluation purposes. If I had the power to make a state level recommendation (which I obviously do not) I would recommend that Illinois use student growth scores in the area of math and reading only and slowly incorporate the score into the teacher evaluation summative rating. An example of this would be the following, use valid and reliable (Type I Assessments) for all teachers who teach reading and math and only count that score for 10% of the teacher summative rating the first two years, 20% the next two years and 30% the next two years and after six years form another committee to determine if the scores are accurately predicting teacher performance. If the scores were determined to not be accurate they would no longer be used for teacher evaluation purposes.

I tweeted a reference to a very good article on the use of student growth for teacher evaluation titled “Who will be held responsible when state officials are factually wrong? On statistics & Teacher Evaluation.” You can read this article here and draw your own conclusion.