| CHAPTER 18 Toward Instructional Process Measurability: An Interbehavioral Field Systems Perspective |
Andrew Hawkins
Tom Sharpe
Roger Ray
We are seeing a change
in focus from the study of one organism over time to the study of the social
interaction between organisms.
Dillon, Madden, and Kumar (1983, p. 564)
It has been an ongoing challenge for educational researchers to systematically
capture the many observable variables relevant to instruction as they occur
in concert in natural settings. At a conceptual level, many similarities exist
between an interbehavioral field systems perspective and the nature of the instructional
milieu. The purpose of this chapter is to take advantage of these similarities
by describing the value of the field systems orientation as it relates to the
measurement of instructional settings, and by detailing the feasibility of a
measurement technology based on an interbehavioral framework. Specifically,
we address this purpose by (1) conceptualizing a field systems perspective as
it relates to the assessment of instructional processes, (2) describing an interbehavioral
measurement system that uses current computer-based technology, (3) presenting
an example of instructional assessment using a field systems perspective in
which longitudinal data point to the efficacy of such an approach in the education
and assessment of teachers, and (4) providing future
potential applications of this approach specifically to teacher education and
generally to behavior analysis. We hope this type of information will provide
behavior analysts with the ability and incentive to focus on the more complex
and observable process interactions that are naturally exhibited between teachers
and students in context. Our goal is to contribute to the further development
of data-driven, behavior-analytic assessment of instructional settings.
Conceptual Foundations
It is easy to agree
with Jackson's (1968) perspective that in the context of typical classroom settings,
events come and go with astonishing rapidity. Individual teachers engage in
hundreds of behavioral interactions within each class period. Accurate observation
and analysis is difficult at best and is at least partly responsible for a "science
versus art" debate with regard to effective instruction. Many educators contend
that a true science
of teaching is beyond our grasp, for it "implies that good teaching will someday
be attainable by closely following rigorous laws that yield high predictability
and control" (Gage, 1978, p. 17). Some argue that effective teaching is spontaneous
and intuitive. Clinical assessments within a class setting regarding how multiple
variables affect the solution to the myriad of daily instructional problems
are said to rely primarily on feeling and artistry (cf., Dawe, 1984; Eisner,
1983; Gage, 1984; Rubin, 1985). In essence, many mainstream contemporary educational
researchers agree that scientific attempts to analyze teaching as a preplanned
or predictive process are much too difficult and time-consuming. Further, they
may even distract from a study of important esthetic, intuitive, and political
factors.
Interbehavioral Field Systems
Behavior-analytic research in natural classroom settings has largely stemmed from a linear approach to experimentation and evaluation (Witt, 1990). Although productive, a linear perspective allows primarily for possible solutions to an immediate concern by examining a limited number of variables. Results are temporally and contextually constricted, often overlooking multiple concurrent and sequential interdependencies. Basic laboratory science functions largely according to linear chains in which functional relationships are established between certain variables. Single factors are usually identified as independent variables, which exert functional control over single dependent variables. However, the subject's behavior in a natural setting is guided by patterns of multiple concurrent and sequential stimuli. Drawing from Gottman and Roy's (1990) sequential analysis illustrations, a comparison of conditional and unconditional probabilities of multiple behavioral events as they flow through time may be a valuable analytic tactic. Knowledge of the temporal characteristics of prior events as they have naturally occurred may assist in the kind of prediction desired in science. It seems, therefore, that the nature of classroom instruction may be more fruitfully assessed as an interbehavioral field comprised of distinguishable but inseparable historical, setting, contextual, and behavioral elements that act to simultaneously and conjointly define the whole.
Before proceeding, we want to familiarize you with fundamental field systems terminology. The language used in this chapter is drawn largely from Roger Ray's methodological work (e.g., Ray, 1992; Ray & Delprato, 1989) and from interpretations of J. R. Kantor's many conceptual contributions (1941, 1946, 1953, 1959, 1969; Kantor & Smith, 1975). First, the term interbehavior is used to emphasize not only an isolated response to stimulus function, but the multiplistic connections among organisms, organismic action, setting factors, and other relevant enabling and impeding conditions in a particular setting. Clearly, context is of primary importance within an interbehavioral analysis. For example, low student achievement may be interpreted and reacted to differently across accelerated and special education settings by both the teacher and student peers.The term field stems from Kantor's (1959) holistic approach to behavioral psychology and Morris and Midgley's (1990) assertion that the behavioral sciences must become increasingly ecological and contextualistic. A field involves a thorough account of observable events that conceivably operate within a setting. Given the static connotation of the term field, the additional term system is added to emphasize the dynamic reciprocal interaction of the many elements contained within a field. A systemic dimension accentuates the presence of at least two or more elements that share varying degrees of enabling or constraining influence on one another, and consequently force the field into a constant state of flux. An interbehavioral field system, therefore, may be defined as a unique, constantly changing, spatial and temporal confluence of multiple behavioral and setting elements.
Redefining the Instructional Setting
The application of linear assumptions to the study of whole classrooms has been conceptually unsatisfactory and epistemologically forced. Multiplistic occurrences of a host of interrelated events are characteristic of instructional settings (Scarr, 1995). Altman (1988) and Altman and Rogoff (1987) argued that instructional elements mutually define one another. They understood the instructional whole as a host of inseparable aspects that serve to mutually explain one another in context. These aspects include the historical characteristics of teachers and students, the setting, the lesson content and activities, and the students' and teachers' current behavior. All of these contribute to the "meaning and nature of ... the [instructional] event' '(Altman& Rogoff, 1987,p. 24). It may readily be discerned, therefore, that an interbehavioral field systems perspective and the emergent view of instruction in a natural context are conceptually compatible.
The question, therefore, becomes "How, then, can such a rich setting be studied, and what questions ought its study be able to answer?" (Salomon, 1991, p. 14). Attempts to redefine a compatible research and evaluation process according to a field systems approach are scarce. Ray and Delprato (1989) offer one of the few generic behavioral systems methodologies available in the literature. A few recently available tactical guides are also more relevant to instructional settings (Frick, 1990; Greenwood et al., 1991; Greenwood, Cm% & Atwater, 199 1; Sharpe & Hawkins, 1992a, 1992b). We now turn our attention to a methodological process that draws from these seminal efforts.
An Assessment
Protocol
A technology that enables concurrent evaluation of people, settings, and activities within particular instructional episodes shows great promise for educational improvement at a local level. Toward this end, we now describe one instructional measurement system (Sharpe, Bahls, & Wood, 199 1) that has been used with undergraduate teacher education students who teach in physical education and special education movement skill settings. [Currently, this evaluation instrument is used by the West Virginia University and University of Nebraska-Lincoln undergraduate teacher certification programs in physical education, and it is in the first cycle of implementation and possible adoption throughout the Lincoln Public School System. Experienced practitioners, university faculty, and select graduate students are working cooperatively on evaluation system training, preservice, teacher instruction, and on-site assessment (cf., Sharpe et al., 199 1). ]
Technological Foundations
The past 20 years may be characterized as a period of rapid technological evolution with readily apparent effects in scientific, technical, and industrial fields. Technology's effect on education in general and teacher education in particular, however, is less discernible (Hawkins & Wiegand, 1987). It is possible that the greatest potential for the application of technology to education may lie in the area of instructional measurement. The assessment of teacher and student process variables as they occur within the context of particular learning environments may be amenable to fresh technological approaches. The recurrent challenge is to find instruments that enable behavioral, ecological, and historical evaluation of particular classroom settings. We believe that this challenge, if met, may lead to direct improvement of educational programs at their most basic level--moment-to-moment teaching/learning activities.
Technological enhancement may make significant contributions to at least two areas in instructional measurement: more exhaustive recording and measurement capabilities in live instructional settings, and instructional evaluation based upon more sensitive data collection. Greater understanding of the variables extant in instructional settings, when coupled with an analysis of the functional relationships among those variables, may enable more effective pedagogical assessment.
Category Systems
When category systems are
used to describe and analyze instruction, it must be remembered that such categories
are somewhat arbitrary delineations of certain stimulus and response classes.
In other words, the sum
total of behavioral and contextual elements is subdivided into functionally
defined groups that are thought to have a relationship to teaching effectiveness
according to the informed opinion and literature of teacher educators. It still
remains to be seen whether the categories in our example, or in any category
system for that matter, are profitable in understanding the relationships in
a learning environment. Categories often evolve into multiple subcategories,
while others are synthesized to better describe and analyze the relationships
in instructional settings. The number of categories that constitute a thorough
description of an instructional system must also be balanced against a logistical
requirement. In essence, the number of categories must be carefully weighed
against the information gained and the ability of observers to collect it. System
parsimony enhances the accuracy and reliability of the data collected.
Nevertheless, category systems provide vehicles for a more systematic and objective perspective on our phenomenon of interest: effective instruction. Therefore, our purpose has been to discover technologies that facilitate the use of category systems in instructional assessment.
The Measurement System
The Behavioral Evaluation Strategy and Taxonomy (BEST) and its companion, the Temporal Analysis System (TAS), were developed in concert by the faculty at the Teachers College at the University of Nebraska-Lincoln and the West Virginia University Sport Pedagogy faculty (Sharpe, Bahls, & Wood, 1991; Sharpe, Hawkins, & Wood, in press). Its purpose is to provide a rich empirical source of information from which instructional assessment may be made. BEST was designed to be used by teacher educators in evaluating preservice and inservice teachers in different subject matters. The system incorporates an extensive category system, including teacher and student behaviors, ecological elements, and historical factors. In addition, it can incorporate new or unique situational elements as the user observes them.
Instructional stimuli, student responses, setting events, and historical context categories were derived from a content analysis of qualitative field notes generated in observations of a variety of subject matter activities by preservice and inservice teachers. Terminology has been taken, when appropriate, from a synthesis of many of the category systems in the teaching evaluation literature, such as the observation tools contained in Darst, Zakrajsek, and Mancini's (1989) compilation and in the work of the Juniper Gardens Children's Project (Dickie, 1989).The system is designed for use with an electronic recording regimen (e.g., S&K Computer Products Event Recorder, Ltd.). Though traditional paper-and-pencil recording techniques could be used, the user would be constrained in the level of complexity that is amenable to observation. Therefore, we will only explain the electronic strategy in detail.
We prefer using fairly sophisticated technology for data collection because we want to provide immediate, real-time information for each behavioral and contextual category (e.g., frequency, average duration, and percentage of total recording time) and a detailed analysis of patterns in time (e.g., the frequency, relative probability, and parametric significance with which defined categories tend to follow one another in time). Only an electronic recording regimen is able to provide a thorough description of complex instructional settings.Six to eight hours of instruction and practice on the system generally produces an acceptable level (r > .85) of interrater reliability (Kazdin, 1982). System accuracy is assessed through observation comparisons with several short segments of a criterion tape. System reliability is assessed by comparing one accurate observation to another observation conducted on the same criterion tape sample one month later. Interobserver agreement is assessed by comparing two simultaneous independent observations of the same criterion tape sample.
Since the data collected must be viewed in the context of certain ecological factors such as unit planning, placement in the unit, class size, activity, and so on, the teacher educator must have a degree of learned interpretation. This is particularly salient with regard to student categories, which are only definable in terms of a particular lesson's goals. Therefore, an ideal generic data profile that may be transferred across lesson contexts does not exist. Rather, a systematic technology is presented in light of its ability to collect a substantial amount of descriptive data to be used to (1) systematically describe an instructional setting, (2) provide context-specific feedback, and (3) provide a longitudinal account of progress through a series of observations and analyses.Category system descriptions. The BEST system consists of a 4-category, 14-subcategory, and 107-event classroom observation system (Table 18. 1). The system enables the recording of simultaneously occurring teacher and student behaviors. Although the category system appears large, only 20 to 30 of the possible 107 events are typically observed within a particular instructional context. For example, only one event within each of the six ecological subcategories will be operative, and the historical subcategories are recorded before the observation begins. The system is organized around four major categories: ecological, teacher behavior, student behavior, and historical. The focus of analysis is on multiple dimensions of discrete events within each of the three major nonhistorical categories and on the proximal relationships in time, or sequential patterns, among these events (Sharpe, 1990; Sharpe & Hawkins, 1992c).
Table 18.1 BEST categories,
descriptions, codes, and examples
|
Categories
|
Number of Codes
|
Description
|
Examples of Codes
|
| Ecological | |||
|
8
|
Service delivery setting | Regular class, resource room, partitioned gymnasium |
|
10
|
Subject matter content | Science, math, English, physical education |
Content stage |
3
|
Temporal status of lesson | Introduction, lesson body, review |
Materials |
6
|
Physical resources | Task cards, pupil folders, workbooks |
Pupil grouping |
3
|
Physical arrangements | Large group, small group, individual |
Method of instruction |
6
|
Stimuli method to occasion responding | Command style, task teaching, questioning, peer teaching, self-instruction, cooperative |
|
Teacher |
|||
Behavior |
22
|
Teacher's behavior relative to student | Observation, instruction, interpersonal, managerial |
Focus |
3
|
How behavior is directed | Individual, general class, nonstudent |
Position |
5
|
Relative proximity to target student | Proximate, distant, central, peripheral, out of room |
|
Student |
|||
Academic |
5
|
Active response | Task appropriate, task engaged, motor, cognitive, verbal |
On task |
5
|
Organizational responses | Transition, absorption, waiting, support, peer instruction |
Off task |
3
|
Academically competing responses | Active disruption, self-stimulation, passive |
|
Historical |
|||
Teacher definition |
11
|
Organismic history of setting impact | Educational certification, years and type of experience |
Student definition |
17
|
Organismic history of setting impact | Age, cultural background, SES achievement and disciplinary history |
For illustrative purposes, we've chosen one category system used in physical education and special education movement skill settings. For the purpose of multiple evaluations across a similar practicum, a more simplistic code drawn from the BEST instrument was used. Table 18.2 displays this category system, which was implemented across several teacher certification practica and inservice settings. Comparison of Tables 18.1 and 18.2 should illustrate how instructional measurement tools for particular contexts may evolve from a more global category system.
Table
18.2 Physical education/Movement skill assessment code
|
Teacher
Behavior
|
Student
Behavior
|
Context
|
| General observation | Motor appropriate | Lesson preview |
| Specific observation | Motor inappropriate | Lesson body |
| Encouragement | Supportive | Lesson review |
| Positive feedback | Instruction of peers | |
| Negative feedback | Cognitive | Teacher locus: Proximate/Distant |
| Management | Self-management | |
| Verbal instruction | Waiting | Teacher locus: Individual/Large group |
| Modeling | Off task | |
| Physical guidance | ||
| Interpersonal | ||
| Off task |
Recording procedures. A data entry program integral to a NEC-PC 8300 laptop computer (S&K Computer Products, Ltd.) is used to record classroom events and prompt historical notations. Historical notations and ecological subcategories are entered at the beginning of an observation evaluation and any time thereafter if they change. Real- time recording procedures are used to record teacher and student behavioral events and those ecological characteristics that change during an observation. Real-time recording involves ascribing a subcategory to an alphanumeric key on the NEC-PC. Keys are either pressed to signal the onset and termination of a particular subcategory or pressed and held for the duration of a particular category. A notation program also allows for the recording of atypical characteristics of particular events. Following collection, data are downloaded into a personal computer for analysis, using software developed by S & K Computer Products and the authors. [A complete data collection and analysis protocol and accompanying category system definitions are available from Tom Sharpe at the University of Nebraska.
Instructional Measurement Procedures
Using BEST as an instructional measurement instrument in our teacher education programs includes three steps: (1) data collection and summation of discrete behavioral and contextual elements (i.e., those elements that change in the course of an observation); (2) analysis of the data set for patterns in time; and (3) data-based goal setting for future instructional opportunities. Selected examples from one instructional episode, shown in Table 18.3, illustrate the type of information gleaned.
Table 18.3 Three-step instructional assessment examples
| Discrete Element Data Summation |
|
Frequency
|
Mean Duration (sec)
|
Percentage Duration
|
Number Per Minute
|
|
| Teacher Behavior | ||||
|
19
|
35
|
24.8
|
0.59
|
|
5
|
11
|
2.1
|
0.11
|
|
15
|
101
|
16.1
|
0.33
|
|
59
|
8
|
48.2
|
1.31
|
| Teacher Context | ||||
|
5
|
28
|
4.8
|
0.11
|
|
57
|
15
|
32.1
|
1.28
|
|
|
||||
|
7
|
19
|
4.1
|
0.17
|
|
11
|
189
|
68.2
|
0.23
|
| Student Behavior | ||||
|
5
|
185
|
33.1
|
0.11
|
|
8
|
135
|
35.2
|
0.20
|
|
|
||||
|
4
|
121
|
16.0
|
0.08
|
|
15
|
35
|
16.1
|
0.33
|
|
Analysis of Patterns in Time
|
| Teacher Observation |
--------->
|
Management
|
@11 (.58)*
|
|
Verbal Instruction
|
4 (.21)
|
||
|
Interpersonal
|
1 (.05)
|
||
| 4(0.80)** Teacher Proximate |
<---------
|
Motor appropriate
|
|
| 4(.080)** Student Support | |||
| 5(1.00)** Individual Focus | |||
| 6(0.75)** Teacher Distant |
<---------
|
Motor inappropriate
|
|
| 8(1.00)** General Focus |
Notes:
@ 11 = number of times the pattern occurred; (.58) = conditional probability of management following teacher observation
* = significant at p = .05
** = significant at p = .01
|
Examples of goal Setting
|
| Provide secondary managerial systems to allow for a primary focus on instruction. |
|
Use observation to focus on instructional concerns. |
| Provide an active supervision pattern to ensure proximity and individual focus when conveying instruction. |
| Use peers as secondary instructional support sources. |
Discrete
element analysis.
Our first step
is to examine the summative characteristics of each subcategory used in the
observation (refer to Table 18.2). Several measurement dimensions (e.g., frequency,
duration, and the temporal percentage of the total observation) are necessary
for each subcategory to understand the dynamics of a complex instructional
setting more thoroughly. For example, frequency may be more important in assessing
high-velocity, low-duration behaviors such as teacher "interpersonal interaction."
On the other hand, duration may be a more indicative dimension of some low-frequency
behaviors such as teacher "nontask interpersonal," or low-frequency contextual
events such as "lesson preview." The number per minute and percentage duration
dimensions indicate how an event functioned in relation to the overall class
time. An insightful inquiry based on a field systems perspective requires
a multidimensional examination.
Analysis of patterns in time. The analysis of patterns in time describes some of the relationships among contextual and behavioral events as they occur temporally. As Table 18.3 displays, frequency, conditional probability (the probability with which a particular subcategory either precedes or succeeds another), and statistical significance (based on conditional and unconditional probabilities, Gottman & Roy, 1990) are represented for preceding and succeeding events around a central subcategory of interest. Pattern-in-time searches may be programmed using preceding events, succeeding events, origination events, and length-of-chain parameters. Then, they may be searched individually or as a logically specified group. For example, all event chains of three that begin with a particular teacher behavior subcategory may be examined (e.g., all behavior chains that begin with verbal instruction and include other key instructional behaviors, such as verbal instruction --> specific observation --> positive feedback). All conceivable chains of two may be explored and represented by a matrix. All subcategories that either precede or succeed a particular subcategory may also be displayed (e.g., all behaviors that precede and succeed teacher modeling). Lagtime specifications may be specified in a pattern-in-time search (a time parameter between the onset of a central event and the onset of others); this is particularly useful if multiple ongoing events obscure a temporal chain of interest.
The major constraint of previous behavior-analytic efforts is that particular variables of interest have been viewed in a linear fashion, removing them from their context and from their temporal association with one another. Other events have been deemed extraneous to such an evaluation. An interbehavioral analysis stands in contrast to this kind of assessment, and the analysis of patterns in time may be the crux of our approach. In complex settings such as exist in classroom instruction, it is quite clear that many events have the propensity to affect many others in a multidirectional fashion. Therefore, evaluation that primarily focuses on which teacher, student, and contextual events tend to be present, absent, or surround others temporally should lend greater insight into how to enhance instructional settings and processes. Goal setting.
An integral
component in our use of the BEST system is data-based goal setting, Approaches
based on the contiguous temporal relationships among behavioral and contextual
elements, and that make a direct connection with student responses, hold much
promise
for educational evaluation. An interbehavioral evaluation strategy seeks to
capture such teacher <--> student relations within the larger umbrellas
of contextual events and historical antecedents. Therefore, following assessment,
goals are provided that relate directly to the discrete element summation and
to the analysis of patterns in time. Of course, future observations may assess
the degree to which goals are attained.
As Table 18.3 illustrates, high-frequency and
high-percentage teacher managerial behavior (f = 59, 48.2%) and a low-percentage
duration of verbal instruction (16. 1 %) indicate a need to reorient toward
a primarily instructional role, This need is again evident in the analysis of
patterns in time. There is a temporal connection between teacher observation
and the ensuing management emphasis. Furthermore, the importance of teacher
proximity and peer support are also seen in this kind of analysis. There are
significant temporal relationships among (1) motor-appropriate student behavior
and teacher proximity, (2) motor-appropriate behavior and student support, and
conversely, (3) motor inappropriate behavior and teacher distance. The need
for more individual focus is also evident, both from the discrete element summation
(individual equals low frequency and percentage, and general equals high frequency
and percentage) and from the analysis of patterns in time. The strong temporal
relationship between individual focus and motor appropriate on the one hand,
and general focus and motor inappropriate on the other underscores the need
for more individual focus.
A caution. A major criticism of a behavior-analytic approach to assessment lies in its perception as a largely technocratic undertaking, driven more by advanced technologies employed by unreflective technicians than by thoughtful professionals (Schempp, 1987). However, it should not be assumed that particular quantitative levels or specific temporal relationships may be ascribed and extended to effective instruction. Rather, a technical tool merely enables a more representative and thorough picture of complex instructional episodes from a dynamic field systems perspective. A field systems procedure, and others that are akin, still requires the interpretive expertise of an experienced, trained educational evaluator. An interbehavioral system simply provides descriptive and analytic techniques for extracting behavioral and contextual information from instructional settings. The purpose is always to understand more fully the dynamics of complex educational environments.
To illustrate the value of a field systems approach to instructional measurement, we present two analyses of instructional performance change for selected events within Table 18.2.
Study I
Study I was a longitudinal analysis of different dimensions of selected teacher and student behaviors of physical education and special education majors in the University of Nebraska-Lincoln (UN-L) teacher certification program (N = 15). Aggregate data were used for each graph and instructional context and subject matter activities were matched cross- sectionally and longitudinally.
Preservice teachers in the undergraduate certification program were familiarized with the conceptual nature of an interbehavioral field systems approach to instructional measurement in the didactic portions of their introductory and intermediate methods classes. They were then repeatedly assessed according to the Table 18. 2 protocol throughout each practicum, student teaching, and in their initial inservice experiences. One aggregate early, mid, and end-of-experience data point for each practicum (i.e., methods practica, student teaching, and inservice) is represented for illustrative purposes in Figures 18.1, 18.2, and 18.3.

Figure
18.1. Percentage of total class time for selected teacher and student behaviors
from the Table 18.2 assessment taxonomy.
Teacher and student behaviors and analytic units were chosen based on initial instruction in the didactic portions of the coursework. For example, before each practicum, the following concepts were repeatedly addressed: (1) increasing class time devoted to instruction and student motor-appropriate time (Figure 18.1); (2) decreasing time devoted to teacher and student management and student waiting (Figure 19.1); (3) increasing the frequency of specific observation, encouragement, feedback, instruction, and interpersonal interaction (Figure 18.2); and (4) emitting particular instructional behaviors in sequence (Figure 18.3).

Figure l8.2.
Average emission number per minute for selected teacher behaviors from the
Table 18.2 assessment taxonomy.
As is readily discerned from Figures 18.1 through 18.3, aggregate trends were demonstrated across all variables in the recommended directions. Though only three aggregate points are represented within each practicum, each graph indicates the efficacy of the assessment protocol with regard to improvement both within and across phases for each variable targeted for change. Of additional interest is the field system orientation's ability to train, alter, and maintain more complex analytic units (e.g., sequences of teacher behaviors).
Figure l8.3. Conditional probability for trained teacher behavior analytic units from the Table 18.2 assessment taxonomy.
Study II
Study II involved an aggregate comparison of employed teachers who had been repeatedly exposed to the evaluation process in their undergraduate program (as in Study I, N = 15) and a similar inservice teacher 3 group that had not been similarly exposed (N = 15). [ Data collection and analysis was made possible by the Oliver E. Bird Fellowship for the UN-L Training and Generalization Project: A Behavioral Evaluation Strategy and Taxonomy for InService Teacher Evaluation, and UN-L Research Council Grant LWT/l 0-215-91601 Real- Time Data Analysis Software Applications.] Over the course of one academic year, a series of ten profiles using the BEST instrument were undertaken at regular intervals for each subject in representing the overall performance for the two groups.

Figure
l8.4. Comparison of percentage of total class time devoted to selected teacher
behaviors for the evaluation aggregate.
Figures 18.4 and 18.5 represent the amount of class time devoted to the teacher behaviors of verbal instruction, management, and interpersonal interactions. A comparison of experimental (Figure 18.4) and control (Figure 18.5) graphs demonstrates that the evaluation process seemed to (1) increase class time devoted to instruction, (2) decrease time devoted to managerial concerns, and (3) prompt greater periods of class time devoted to interpersonal interactions over the course of the academic year.

Figure l8.5. Comparison of percentage of total class time devoted to selected teacher behaviors for the control aggregate.
These findings demonstrate the usefulness of the BEST instrument as a measurement system for teacher education. More important, however, is the use of field systems analyses to understand and develop the temporal relationships among more complex clusters of teacher and student behaviors. Figure 18.6 shows percentages of class time devoted to Student success (i.e., accomplishment of a stated subject matter-related task--motor-appropriate behavior), whereas Figure 18.7 portrays the percentage of student engaged behavior (i.e., attempting a subject matter-related task, but doing so unsuccessfully--motor-inappropriate behavior) for the exposed group across sessions. The findings also represent the probabilities with which selected teacher behaviors evidence themselves in a manner temporally contiguous with the student variables.

Figure l8.6. Conditional probability of select teacher behaviors occurring in temporally contiguity as a function of the percentage of classtime in which motor-appropriate behavior is evidenced.

Figure l8.7. Conditional probability of select teacher behaviors occurring in temporally contiguity as a function of the percentage of classtime in which motor-inappropriate behavior is evidenced.
The evaluation process fostered interactions that enabled larger percentages
of student success with diminished portions of student engagement over the course
of the year. Further, teacher proximate (i.e., the teacher in close proximity
to a particular student) and individual teacher focus (i.e., the teacher attends
to a particular student as opposed to student groups) appear to function as
temporal facilitators of student success, as indicated by the heightened conditional
probability levels that correlate positively over time with levels of student
success. As similar pattern was also seen between teacher distant and general
focus probabilities and student engaged (unsuccessful attempts),
though the levels of student engaged declined across sessions.
When combining student
success with student-engaged class percentages across graphs (i.e., the total
time students spend in subject matter attempts regardless of success), the evaluation
procedure may also be focused on classroom practices, which enable greater portions
of time to be devoted to subject matter practice. This was particularly salient
in light of the diminished portions of total class time devoted to managerial
concerns across the academic year for the experimental aggregate (refer to Figure
18.4).
From these data, it seems that behavior analysis is increasingly capable of examining some of the more complex response classes of human subjects in educational settings using an interbehavioral technology. This has largely been due to a field systems perspective that has emerged contemporaneously with growing computer technologies. The combination has made possible the technological application of a theoretical framework, which has enabled a more complex data collection, analysis, and evaluation process.
Implications and Applications
It is
increasingly clear from this study and the others referenced that technology
is enabling behavior analysts to understand, and even control, more complex
response <--> stimulus configurations. This provides an impetus for evaluating
and altering complex behavioral and contextual interactions in classroom settings.
If this is the case, then two questions arise: Should we do what is technologically
feasible? and Which configurations should be taught now that we know we are
capable? Though it is easy to become mired in the particulars of which educational
ends justify which technological means, the larger question of the general effects
of technology must be considered first.
Therefore, what purpose may we legitimately expect to have served by enhancing our technological understanding of excellence in teaching? Is it to remain merely a tool for assessment, or may we anticipate applying it to the enhancement of other dimensions of the teaching process, including perhaps the teaching of the prospective teacher (i.e., applying technology to teacher education as well as to evaluation)? Several issues seem to be raised by such a vision, not the least of which is the possibility of computer-driven simulations of the process. Perhaps the most realistic form that such simulation might take is that of an interactive video.
Consider, for example, the possibility that evaluation research such as that reported earlier might lead us directly to the ability to discriminate and articulate those teacher activities most influential in guiding desired student learning of a given topic or skill in a given situation. Anticipating that expert teachers are more expert at teaching some things than others, any given simulation effort will surely have its application and validity restricted by the specific expertise and setting that is under study, thus requiring a variety of different simulation efforts.If our interest is in developing such a technology, data must come from a variety of experts doing a variety of things before we attempt to articulate any single approach or strategy appropriate to any given circumstance. In addition, variations between individual students help define what is likely to work and what isn't. Expert teachers not only fail when they try to teach all things, they also fail sometimes in teaching their strongest areas to some select students. But what is the feasibility for bringing simulation technology into such a complex process?
Given the fact that data-based assessment programs such as those reported earlier will likely generate a variety of concrete examples of good and not-so-good teaching over a variety of student learning and setting variations, what if the critical features of these various episodes were reconstructed on videodisc as isolated "scenes" for eventual computer-selected simulation application? Such a corpus of video scenes might well illustrate most, if not all, of the positive and negative event features that new teachers are likely to experience, including the individuals and ecological settings with which they will eventually deal.
Imagine the possibility of playing these various scenes in a wide variety of sequences by making computer-based choices of the types of teacher interactions chosen for illustration. Also imagine that each sequence leads to its own learning outcomes, and that the video database includes illustrative examples of all such outcomes. Such a video simulation system has several possibilities for application and may assume many different structural and functional forms.First, real databases could be used to "seed" the simulation's onset, thereby allowing the simulator to determine sequential transitions from scene to scene. This would be accomplished by following in the same stochastic sequence determined by the type of data being simulated (e.g., poor teaching interaction databases, better teaching databases, and expert teaching databases).
Such a simulator is interactive not with the viewer but with the supplied stochastic database. As such, it mimics typically observed settings, student types, teaching interaction strategies, sequential dependencies among events, durational and other temporal parameters, and so on. It also demonstrates the observed learning outcomes attendant to each type of stochastic database. Good teaching may thus be directly compared to poorer teaching as the user selects the desired playback "script's" level of expertise. And because playback scenes are stochastically driven in sequence, they approximate the actual dynamics and unpredictability of real teaching situations. Yet another
variation on this computer-operated video simulator could allow for the viewer
to interact with the sequence playback by using keyboard inputs.
In this scenario, viewers
could watch poor teaching sequences until they felt they recognized the types
of problems being demonstrated. Then they could intervene to shift the driving
stochastic programs to more-effective strategies by interceding with better-interaction
selections. As a result, they could "teach" the teacher being observed on the
video to use different teaching techniques, thereby learning what really is,
and is not, a "better" technique by observing student performance outcomes reported
via the simulator's play sequence.
The critical factor in all good simulators is whether an expert observer (such as those doing the coding in the evaluation studies) can detect whether a true human expert is in control, or whether a novice or learner is in control. As such, all interactions with the interactive-video simulator should be automatically "coded" by the simulator and used to contrast with the expert teacher's choices of interaction with the training simulator. When they are statistically indistinguishable, the student has personally learned to "simulate" the expert. When such a performance level has been reached, the student should be placed into real-life situations to train generalization.
Because teaching is interactive on several levels, examples of physical interactions and of interactions that are more qualitative and interpretational in character should be included. This level is conveyed by much subtler, but still potentially objective, events that are more difficult to capture in coded descriptions. These events include verbal intonations and inflections, the frequencies of speaking and pausing, choices of words for their connotative (rather than their denotative) meanings, nonverbal communicative gestures and body language, and even social (versus physical) distances that exist between teacher and student. These are the elements that challenge us most in describing how a true expert teacher works. They are the traditional domain of the "artistic" teachers. Nevertheless, art may also be understood and used as a simulation of reality.The challenge lies not only in recognizing the task that lies before us in trying to describe and evaluate good teaching, but also in recognizing the tools that are already available for accomplishing that task. Scientific description is of little use if we cannot use it to reconstruct the original sequences and dynamics
of events being described. Computer and interactive video simulation is, after all, merely a reconstructive process that makes the original events appear to be real-time depictions to those observing and describing the simulation. Eventually, simulation becomes our own quality check on the reality of scientific progress: if we understand and describe the attendant variables well enough to make our simulations appear real, we probably have the technology necessary to use it to train new observers and participants in the process.
In line with
Greenwood, Delquadri, Stanley, Terry, and Hall
(1985), we recommend
four areas for application of these emergent assessment strategies: (1) the
development of a comparative database derived from different instructional settings
as temporal conglomerates of contextual and behavioral variables; (2) the implementation
of subsequent causal analyses of relationships that frequently appear in such
databases; (3) the monitoring of the fidelity of interventions in specific contexts
based on causal conclusions; and (4) the assessment of long-term changes in
contextual and behavioral
dependencies that have resulted from these interventions.
Greenwood, Carta, Arreaga-Mayer, and Rager (1991)
summarized the implications
of instructional evaluation efforts comparable to the one described:
The utility of a search and validate approach to the evaluation of effective instructional practices has only just begun to be undertaken. Because of its ... analysis of classroom behavior, and temporally related situational features of classroom instruction, it is an approach consistent with current school improvement goals. It is also an approach that focuses on the contributions classroom teachers can bring to the development of effective instructional technology. Clearly, demonstrations of the approach are warranted. [pp. 188-1891
As the technological revolution approaches educational
evaluation, careful consideration of the empirical and ethical limits of technology
must be undertaken through the scholarly dissemination of its functional possibilities.
The end of more thorough and accurate educational evaluation strategies will
necessarily combine calculated experimentation with emergent technology.
References
ALTMAN, 1.
(1988). Process, transactional, contextual, and outcome research: An alternative
to the traditional distinction between basic and applied research. Social
Behavior, 3, 259280.
ALTMAN, I.,
& Rogoff, B. (1987). World views in psychology: Trait, interactional, organismic,
and transactional perspectives. In D. Stokolis & 1. Altman (Eds.), Handbook
of environmental psychology (pp. 1-40). New York: Wiley.
BRONFENBRENNER,
U. (1979). Contexts of child rearing: Problems and prospects. American Psychologist,
34, 84-89.
DARST, P.
W., Zakrajsek, D. B., & Mancini, V. H. (1989). Analyzing physical education
and sport instruction. 2nd ed. Champaign, IL: Human Kinetics.
DAWE,H.A.
(1984). Teaching: Social science or performing art? Harvard Educational Review,
54,111-114.
DICKIE, R.
F. (Ed.). (1989). The Juniper Gardens Children's Project. Education and Treatment
of Children [Special monograph issue], 12(4).
DILLON, W. R., Madden, T. J., & Kumar, A. (1983). Analyzing sequential categorical data on dyadic interaction: A latent structure approach. Psychological Bulletin, 94, 564-583.
EISNER, E.
W. (1983). The art and craft of teaching. Educational Leadership, 40,
4-13.
ELLUL, J.
[19541 (1964). The technological society. Trans. J. Wilkinson. New York:
Knopf.
FINN, C. E.
(1988). What ails education research. Educational Researcher, 17, 5-8.
FIRESTONE,
W. A. (1997). Meaning in method: The rhetoric of quantitative and qualitative
research. Educational Researcher, 16,16-21.
FRICK, T.
W. (1990). Analysis of patterns in time: A method of recording and quantifying
temporal relations in education. American Educational Research Journal, 27(l),180-204.
GAGE, N. L.
(1978). The scientific basis of the art of teaching. New York: Teachers
College Press.
GAGE, N. L.
(1984). What do we know about teaching effectiveness? Phi Delta Kappan, 66,
87-90.
GARRISON,
J. W. (1986). Some principles of postpositivistic philosophy of science. Educational
Researcher, 15, 12-18.
GOTTMAN, J.
M., & Roy, A. K. (1990). Sequential analysis: A guide for behavioral
researchers. New York: Cambridge University Press.
GREENWOOD,
C. R., Carta, J. J., & Atwater, J. (1991). Ecobehavioral analysis in the
classroom: Review and implications. Journal of Behavioral Education, 1, 59-77.
GREENWOOD,
C. R., Carta, J. J., Arreaga-Mayer, C., & Rager, A. (199 1). The behavior
analyst consulting model: Identifying and validating naturally effective instructional
methods. Journal of Behavioral Education, 1, 165-191.
GREENWOOD,
C. R., Delquadri, J. C., Stanley, S. 0., Terry, B., & Hall, R. V. (1985).
Assessment of eco-behavioral interaction in school settings. Behavioral Assessment,
7, 331-347.
HAWKINS, A.
H., & Wiegand, R. L. (1987). Where technology and accountability converge:
The confessions of an educational technologist. In G. T. Barrette, R. S. Feingold,
C. R. Rees, & M. Pieron (Eds.), Myths, models, and methods in sport pedagogy
(pp. 67-75). Champaign, IL: Human Kinetics.
HOWE, K. R.
(1988). Against the quantitative-qualitative incompatibility thesis or dogmas
die hard. Educational Researcher, 17,10-16.
HUBERMAN,
M. (1987). How well does educational research really travel? Educational
Researcher, 16, 5-13.
JACKSON, P.
W. (1968). Life in classrooms. New York: Holt, Rinehart & Winston.
KANTOR, J.
R. (1941). Current trends in psychological theory. Psychological Bulletin,
38, 29-65.
KANTOR, J.
R. (1946). The aim and progress of psychology. American Scientist, 34,
251-263.
KANTOR, J.
R. (1953). The logic of modern science. Chicago: Principia Press.
KANTOR, J.
R. (1959). Interbehavioral psychology. Granville, OH: Principia Press.
KANTOR, J.
R. (1969). The scientific evolution of psychology. Vol. 2. Chicago: Principia
Press.
KANTOR,J.R.,&
Smith,N.W. (1975). The science of psychology: An interbehavioral survey.
Chicago: Principia Press.
KAZDIN, A.
E. (1982). Single case research designs. New York: Oxford University
Press.
MORRIS, E.
K., & Midgley, B. D. (1990). Some historical and conceptual foundations
of ecobehavioral analysis. In S. R. Schroeder (Ed.), Ecobehavioral analysis
and developmental disabilities: The twenty-first century (pp. 1-32). New
York: Springer-Verlag.
RAY, R. D.
(1992). Interbehavioral methodology: Lessons from simulation. Journal of
Teaching in Physical Education [special monograph issue].
RAY, R. D.,
& Delprato, D. J. (1989). Behavioral systems analysis: Methodological strategies
and tactics. Behavioral Science, 34, 81-127.
RUBIN, L.
J. (1985). Artistry in teaching. New York: Random House.
SALOMON, G.
(1991). Transcending the qualitative-quantitative debate: The analytic and systemic
approaches to educational research. Educational Researcher, 20, 10-18.
SCARR, S.
(1985). Constructing psychology: Making facts and fables for our times. American
Psychologist, 40,499-512.
SCHEMPP, P.
K. (1987). Research on teaching in physical education: Beyond the limits of
natural science. Journal of Teaching in Physical Education, 6(2), 111-121.
SHARPE, T.
L. (1990). Field systems data: An exploration of alternative visual representations.
Interbehaviorist, 18(2), 4-8.
SHARPE, T.
L., & Hawkins, A. (1992a). Field systems analysis: A tactical guide for
exploring temporal relationships in classroom settings. Teaching and Teacher
Education: An International Journal of Research and Studies, 8, 171-186.
SHARPE, T.
L., & Hawkins, A. (Eds.) (1992b). Field systems analysis: An alternative
strategy for the study of teaching expertise. Journal of Teaching in Physical
Education, 12, 1.
SHARPE, T.
L., & Hawkins, A. (1992c). Field systems analysis: Prioritizing patterns
in time and context among observable variables. Quest, 44, 15-34.
SHARPE, T.
L., Bahls, V., & Wood, D. (1991). University of Nebraska teacher and student
behavioral evaluation strategy and taxonomy (BEST). 3rd ed. Lincoln: University
of Nebraska-Lincoln.
SHARPE, T.
L., Hawkins, A., & Wood, D. (in press). Temporal analysis system software
guide and users' manual. Ontario, Canada: S&K Computer Products, Ltd.
SHARPE, T.
L., Bahls, V., Seagren, S. & Wolfe, P. (1991) A collaborative approach to
practica based teacher preparation: The Nebraska - Lincoln Model. In G. M. Graham
& M. A. Jones (Eds.), Collaboration between researchers and practitioners
in physical education: An international dialogue (pp. 99-96). Champaign,
IL: Human Kinetics.
SHAVELSON,
R. J., & Berliner, D. C. (1988). Erosion of the education research infrastructure:
A reply to Finn. Educational Researcher, 17, 9-14.
WITT, J. C. (1990). Complaining, preCopernican thought and the univariate linear mind: Questions for school-based behavioral consultation research. School Psychology Review, 19, 367377.