People are generally good at anticipating simple motion (McLeod & Dienes, 1996). This ability is critical for effective tracking, navigation, and other forms of interaction with a dynamic world (Freyd & Miller, 1992; Hubbard, 2005). It is not surprising, then, that many studies have found anticipation to be essentially built into observation (Freyd & Finke, 1984). Such an ability is especially critical in threatening situations, when one must, for example, evade a predator or prepare for the actions of a violent rival. Under such circumstances, survival will likely depend not only on one’s ability to react, but also to proactively evade, preemptively strike, or rapidly overcome opponents.

Consistent with this reasoning, people show processing advantages for threatening over nonthreatening stimuli (Anderson, Siegel, Bliss-Moreau, & Barrett, 2011; Devue, Belopolsky, & Theeuwes, 2011; Ford et al., 2010; Olds, Lanska, & Westerman, 2014; West, Anderson, & Pratt, 2009). People are faster, for example, to identify a snake in an array of mushrooms than vice versa (Öhman, Flykt, & Esteves, 2001). Memory for the location of a stationary threat (Terburg, Aarts, & van Honk, 2012) is better than memory for nonthreatening counterparts, and this is consistent with more general findings of improved spatial memory for emotional over neutral stimuli (Mather & Nesmith, 2008).

Similar processing advantages occur for both evolutionarily significant and modern threatening objects (Fox & Damjanovic, 2006; Fox, Griggs, & Mouchlianitis, 2007; Loftus, Loftus, & Messo, 1987; Witt & Brockmole, 2012). Processing advantages begin preattentively, with threatening objects persisting longer in iconic memory, which allows for greater investment in early perceptual processing (Feldmann-Wustefeld, Schmidt-Daffy, & Schubo, 2011; Kuhbandner, Spitzer, & Pekrun, 2011; Phelps, Ling, & Carrasco, 2006). This heightened prioritization characterizes subsequent search behavior, with threats producing longer spontaneous fixation times than neutral objects (Ford et al., 2010; Loftus et al., 1987).

Prioritized processing of threatening information should enhance memory for a stationary threat’s last observed status. Because the threat’s condition is unchanging, accurate memory would optimize future interaction with it. Not only is this the case with the physically stationary threats described above, for which memory of the last observed status of stationary threats is generally accurate, but also in abstract, nonspatial conditions (Lench & Levine, 2005; MacLeod & Mathews, 2004; Mathews & Klug, 1993; Wessel & Merckelbach, 1997; Zadra & Clore, 2011).

Faithful memory representation may not always be optimal, however, even for highly survival-relevant situations (see also Klein, Robertson, & Delton, 2010). Consider immediate memory for dynamic aspects of an unfolding event, which may serve a different function from the sorts of memory discussed above. Whereas memory for past events can facilitate forming new associations, drawing inferences, conveying details to others, and generalizing to similar novel events (e.g., Brewer & Treyens, 1981; Roediger & McDermott, 1995), immediate memory during the course of a dynamic event serves primarily to guide one’s ongoing actions. Because of this, immediate memory may not show the same sorts of threat-related increases in accuracy as do the other functions listed above. Instead, we predict that other processing biases would emerge that would reflect adaptive anticipation of the threat’s next actions. To examine this question, we employed a representational momentum paradigm, which measures distortion toward the future. We describe this phenomenon next.

When viewing simple dynamic scenes, people tend to remember an object’s final position as being displaced forward along its path of motion (Freyd & Finke, 1984; Hubbard & Ruppel, 1999; Munger, Solberg, & Horrocks, 1999; Ruppel, Fleming, & Hubbard, 2009). This representational momentum (RM) effect suggests that memory incorporates the object’s anticipated movement. In the classic RM paradigm (e.g., Freyd & Finke, 1984), static views of (nonthreatening) object configurations are presented in relatively rapid succession to create the impression that one or more objects are moving. At the offset of the final view, participants report the precise final position of the target, typically remembering it as being displaced farther along its path of motion.

Such error has been demonstrated for many different types of objects and movement patterns (e.g., Kerzel, Jordan, & Musseler 2001; Munger & Minchew, 2002; Yoshikawa & Sato, 2008), with the degree of distortion following a number of simple perceptual and conceptual principles (Freyd & Finke, 1985; Hubbard, 2005, 2014; Reed & Vinson, 1996; Ruppel et al., 2009; Vinson & Reed, 2002). For example, the magnitude of forward displacement is a function of both the implied velocity and acceleration (Hubbard, 1995, 2005; Hubbard & Bharucha, 1988) and can be impacted by semantic information, such as by implied weight (Hubbard, 1997), likely direction of motion (Reed & Vinson, 1996), context (Hubbard, 2014), causality (Hubbard & Ruppel, 2002), and cues suggesting agentive or nonagentive motion (Doerrfeld & Shiffrar, 2011; see also Wilson, Lancaster, & Emmorey, 2010).

In the present experiments, we used an RM paradigm to investigate immediate memory for dynamic scenes involving threat. Participants saw simple, abstract visual displays, each consisting of a ring (which corresponded to a moving person with a described intention) and a stationary dot (toward which the ring moved at a constant velocity). The ring’s intention (and, thus, threat status) was manipulated in a vignette read by the participants just before they saw the RM display. This allowed the participants to interpret the motion they observed as reflecting specific goal-directed actions (e.g., Heider & Simmel, 1944), while avoiding many of the complex factors associated with the visual processing of animate stimuli (Freyd & Miller, 1992). The participants’ task was to specify the ring’s final position as accurately as possible.

Consistent with the classic RM literature, we predicted forward displacement in memory for both threateningFootnote 1 and nonthreatening conditions. Additionally, if threatening situations prompt observers to anticipate further into the future than they otherwise would, threat should exaggerate the RM effect and lead to even greater forward displacement. That is, immediate memory for the ongoing movement of threatening objects may be optimized not for accuracy, but for evasive responding.

Experiment 1: One moving object (threatening vs. nonthreatening)

The participants in Experiment 1 were told that each trial would depict schematized video surveillance of a real-world situation involving two people. Each scenario consisted of a stationary dot and a ring that appeared to move along a straight path toward the stationary dot. The moving ring’s speed, direction, and ending distance from the dot varied systematically across trials and were independent of each other. Thus, participants could not anticipate an object’s ending position simply on the basis of its starting location. Velocity varied across trials, but within a trial, velocity remained constant.

Before each trial, the participant read a brief vignette that provided a conceptual framework (threatening or nonthreatening) within which to interpret the ring’s motion. (See Appendix A for examples of each type.) The vignettes described each entity’s role in the scenario (e.g., robber [ring] and store clerk [dot], or customer [ring] and store clerk [dot]) and the specific intention of the person who was approaching the other (e.g., to attack the store clerk or to hand him an item to be purchased). All vignettes ended with a sentence of a standard form: [ring] is approaching [dot] in order to [action]. This sentence always described an action in progress and never described a final position or the completion of a goal.

Again, we predicted that people’s memory for the ring’s final position would be displaced forward for both threatening and nonthreatening conditions, demonstrating the RM effect. If processing of a threatening scenario prompts observers to further exaggerate the target’s forward motion, we should see increased RM for the rings described as having threatening intentions.

Norming

Participants

We recruited 34 participants using Amazon Mechanical Turk to participate in this norming task. Each participant was paid $0.50, and none participated in any of the other studies reported here.

Materials and procedure

Each participant in the norming study read the text for a subset of the vignettes used in Experiment 1. No participant read both a threatening vignette and a nonthreatening vignette that involved the same object (e.g., axe used in a threatening manner and axe used in a nonthreatening manner). The norming participants performed four tasks for each vignette. First, they estimated how close, in feet and inches, the moving character in the described scenario intended to ultimately be from the stationary character. They made their distance judgment using a slider bar on a scale with the end labels 0 feet and 20 feet. After making a distance estimate, they rated the intensity of the scene, the level of threat in the scene, and the level of happiness in the scene. They made these responses using 7-point Likert scales that ranged from none at all to extreme.

Results

There were no differences in participants’ estimates of the intended final distance of moving characters from their destination between the threatening (M = 3.26 feet, SD = 2.46) and nonthreatening (M = 3.56 feet, SD = 1.77) vignettes, t(33) = 0.91, p = .42, d = 0.14. People found the threatening vignettes (M = 1.3, SD = 0.4) to be less happy than the nonthreatening ones (M = 2.95, SD = 0.89), t(33) = 14.31, p < .001, d = 2.39, as expected. They rated the threatening vignettes as more intense than the nonthreatening ones (threatening: M = 6.06, SD = 0.93; nonthreatening: M = 2.93, SD = 0.89), t(33) = 12.05, p < .001, d = 3.44. Moreover, they rated threatening vignettes as more threatening than nonthreatening ones (threatening: M = 6.43, SD = 0.50; nonthreatening: M = 1.6, SD = 0.72), t(33) = 35.83, p < .001, d = 7.79, again as predicted. Thus, although the threatening vignettes were perceived as more threatening, more intense, and less positive than the nonthreatening vignettes, they did not differ in the ultimate ending distances from the destinations that people associated with these scenarios.

Method

Participants

Sixty-eight Stony Brook University undergraduates (38 female, 30 male) participated for partial course credit. The participants were assigned randomly to the threatening or the nonthreatening condition.Footnote 2 Half of the participants in each of these conditions always saw a blue ring, and half always saw a green ring.Footnote 3 All other factors varied within participant.

Materials and procedure

The experiment was conducted on a Dell desktop computer using Superlab presentation software. Screen resolution was set to 800 pixels × 600 pixels, and participants sat at a comfortable distance for viewing and responding with no positioning restrictions (approximately 65 cm from the screen). Participants were instructed to suppose that they were employed as an analyst working for the FBI and that they were taking a test to advance to a more elite unit. Each trial represented a schematized security camera video, and the participants’ job was to report the final presented position of the moving person. The people in the security videos were depicted as a colored ring and a black dot. The colored ring always appeared to move toward the black dot, which remained stationary. Both the ring and the dot had a diameter of 30 pixels, and the ring had a 10-pixel hollow center. By using rings and dots as surrogates for people, we maintained consistent perceptual conditions between the threatening and nonthreatening conditions. Thus, factors such as size (Hubbard, 1997) and shape (Kelly & Freyd, 1987) were visually identical for all participants. Additionally, the participants were informed that because they were watching only part of the event, the rings might not actually reach the black dots (in fact, they never did). Participants were instructed to pay careful attention to the rings’ final locations, since their responses would be scored for accuracy.

Before each trial, participants read one of 31 three-sentence vignettes about the people depicted in the scene (Appendix A). Each vignette described the person represented by the colored ring as holding a particular object (e.g., a pan) that under most circumstances would not be interpreted as a weapon but that could potentially be used as one. (For a complete list, see Appendix B.) For half of the participants, the person holding this potential weaponFootnote 4 was described as approaching the person depicted as a black dot, with the intention of harming him or her (e.g., to attack a former friend with a frying pan). The other half involved a harmless intended interaction (e.g., to give a friend a frying pan). By using a verbal description of the scenario, we were able to avoid factors associated with object complexity (Quinlan, 2013) and manner in which the object was held (Pickel, 1999). Additionally, we attempted to match, as best as possible, the average implied interpersonal distance of the intended threatening and nonthreatening actions within the referent scenarios.

After reading the vignette, participants used the mouse to click the center of the screen. When they did this, the cursor disappeared and the trial began. Each trial consisted of a series of four white displays, followed by a colored visual mask and then a white response screen. Over the course of the trial, a series of screens depicted a ring undergoing implied motion toward a stationary black dot. The initial screen showed the black dot and a colored ring, for 500 ms, allowing participants to view the entire image and note the intended destination of the ring before it started to move. This was followed by a 250-ms interstimulus interval (ISI), during which participants viewed a white screen containing only the black destination dot. The movement then began with a second presentation of the initial configuration, now for 250 ms, followed by a 250-ms ISI, then the second stimulus screen for 250 ms (Fig. 1). A 250-ms visual mask followed the fourth stimulus screen. The mask was the same color as the ring, and it covered the entire screen except for the stationary black dot. Thus, the mask removed all evidence of the ring’s final position. Finally, the response screen was presented, consisting of the black dot against a white background. A cursor (presented in the form of a 10-pixel cross) appeared at the center of the screen, and participants used the mouse to place the cursor where the center of the moving ring had last been seen and then clicked the mouse to record their response. For 75% of trials, the program then advanced to the next vignette and trial. For the other 25%, an open-ended question asked either what the ring in the immediately preceding trial had been described as carrying or what it had intended to do. These catch trials were randomly interspersed throughout the experiment to make sure that the participant was attending to the information in the vignettes.

Fig. 1
figure 1

Each trial began with a description of the characters and goals. After a keypress, the initial configuration was displayed for 500 ms, followed by a 250-ms ISI during which only the stationary black dot was depicted. Starting with a redisplay of the initial configuration, each stimulus screen appeared for 250 ms, with a 250-ms ISI. A 250-ms mask screen followed the fourth stimulus screen, and then the response screen appeared and remained until the participant responded

The ring appeared at a unique location on the screen at the beginning of each trial. Across the four stimulus screens, the ring was implied to move in a straight line toward the stationary dot. During the trial, the ring showed implied movement of 210, 270, or 330Footnote 5 pixels (or 70, 90, or 110 pixels per stimulus screen) for the “slow,” “intermediate,” and “fast” trials, respectively. Across the 129 trials (five practice, 124 experimental), the ring was equally likely to move at each of the three speeds (due to a programming error, however, the experimental trials consisted of 42 slow, 41 intermediate, and 41 fast trials).

The ending distance of the colored ring from the stationary dot varied across trials between 99 and 226 pixels, and all 43 of these ending distances (including for practice trials) were distributed equally across the three implied speeds. Finally, the movement was depicted equally often as upward, downward, leftward, and rightward.

Results and discussion

We removed seven participants’ data due to failure to follow instructions, and one participant’s because of an error rate above 50% in the open-ended catch trials. This left 60 participants’ data for analysis. Because the destination dot stayed on the screen during the probe task, participants who had attended to the trial would have been able to infer the ring’s axis of movement. Therefore, any trial in which a participant indicated a location that deviated by more than 100 pixels orthogonally from the ring’s movement axis was removed from the analysis. Furthermore, trials in which a participant took longer than three SDs from their mean response time were also removed from the analysis. Together, these criteria resulted in the removal of less than 2% of the data.

We measured the displacement of each participant’s response from the ring’s actual final position. Displacements were defined only with respect to the axis of movement and reflected either positive or negative displacement of the cursor’s placement relative to the actual location of the ring’s center. Positive values refer to displacement farther along the ring’s path of motion, and negative values refer to displacement behind the ring’s actual ending location. All data are reported in terms of pixels of displacement from the ring’s center.

To test for a classic RM effect, we conducted a single-sample t test using only data from the nonthreatening condition (comparing their displacements to the null-hypothesis prediction of zero pixels of displacement). Participants in the nonthreatening condition displaced the ring forward an average of 22.87 pixels (SD = 17.97) along its path of motion, t(29) = 6.97, p < .001, d = 2.58, remembering it as having traveled farther than it actually had, and thus producing a classic RM effect.

We next examined the extent to which this forward displacement was sensitive to the factors of interest, with the primary goal of investigating the role of threat. Secondarily, and consistent with classic RM, we should find greater forward displacement at greater implied speeds, possibly moderated by boundary effects (Hubbard, 1994; Huttenlocher, Hedges, & Duncan 1991) as a function of the ring’s final distance from its destination. To do this, we consolidated the large number of ending distances into near and far categories, demarcated at the median ending distance of 163.5 pixels. We conducted a 2 (Scene Type: threatening vs. nonthreatening) × 3 (Ring’s Implied Speed: slow, intermediate, fast) × 2 (Ending Distance: near vs. far) × 4 (Direction: up, down, left, or right) × 2 (Ring Color: blue vs. green) analysis of variance (ANOVA). Color and direction were not expected to produce a main effect or to interact with any other factor, nor did they (ps > .1). Thus, we collapsed across both factors for all further analyses.

Impact of threat on RM

As predicted, threat increased the magnitude of RM, F(1, 58) = 4.24, p < .05, η p 2 = .068. Importantly, this suggests that people anticipate the motion of threatening objects (M = 34.6 pixels, SD = 25.5) to a greater extent than the motion of nonthreatening objects (M = 23.0 pixels, SD = 18.3), and that this anticipated motion is incorporated into immediate memory for the object’s final location.

The magnitude of RM effects was greater when the ring’s final position had been far from its destination than when it had been near, F(1, 58) = 99.59, p < .001, η p 2 = .632. This main effect is not surprising, and any of a number of possible reasons might underlie it: greater ending distances from the destination allowing for greater RM, the repulsive influence of landmarks on very near objects (Sheth & Shimojo, 2001; Werner & Diedrichsen, 2002), and/or the regression of responses toward intermediate distances. In any case, and most importantly, this main effect was independent of and cannot explain the main effect of threat on RM magnitude.

A main effect of implied speed emerged, F(2, 116) = 10.43, p < .001, η p 2 = .152, with greater forward displacement occurring at the slowest speed (M = 29.73 pixels, SD = 22.18) than at intermediate (M = 28.99 pixels, SD = 23.07) or fast (M = 27.66 pixels, SD = 23.66) speeds, F(1, 58) = 20.28, p < .001, η p 2 = .259. We have no explanation for this puzzling pattern, which is inconsistent with typical findings in the RM literature (e.g., Freyd & Finke, 1985). However, an interaction between speed and scene type helps clarify this anomalous finding, F(2, 116) = 7.84, p < .001, η p 2 = .12. Namely, the greater displacement at slower speeds occurred only for the neutral scenes (slow: M = 25.24, SD = 18.62; intermediate: M = 23.32, SD = 24.74; fast: M = 20.45, SD = 17.66), F(2, 58) = 13.12, p < .001, η p 2 = .31. In contrast, the threatening scenes showed no effect of speed, F(2, 58) = 0.16, p = .85 (Fig. 2).

Fig. 2
figure 2

Relationship between threat and implied speed (slow, intermediate, or fast) in memory for the target’s final position (Exp. 1). (Error bars refer to standard error of the mean.)

The results of Experiment 1 show that interpreting a scene as threatening leads to increased forward displacement in memory for the threatening object. The most adaptive, and therefore most likely, response to threatening objects may be evasion. Wherever the object is heading, the observer might best head in the opposite direction. This is the case even if the depicted threat is to a third party, since a dangerous agent’s intended target can change abruptly, and an observer can quickly become a victim. The exaggerated forward displacement errors seen in these results are consistent with such a bias and provide greater opportunity to evade the threat. This is in contrast to neutral objects, which are associated with many likely types of anticipated action, including passive observation of, navigation around, or interception of the object, and all of which are optimized by faithful calculation of the object’s next position.

Experiment 2: Multiple agents

The results of Experiment 1 suggest that people anticipate the motion of threatening objects further into the future than they do for nonthreatening ones. However, because the participants in Experiment 1 were presented with scenes that involved a single threatening or nonthreatening object, an equally plausible alternative remains: The heightened tendency to anticipate may apply to all moving objects in the threatening situation, as if a mental simulation of the entire event were advanced uniformly. In Experiment 2, we examined whether this is the case, or whether the effect of threat is exclusive to the threatening object itself. To test this, the participants in Experiment 2 viewed several scenes, each showing two moving objects. Participants viewing threatening scenes observed one threatening and one nonthreatening object within each scene. Participants viewing nonthreatening scenes observed two nonthreatening objects within each scene.

If the increased RM for threatening scenes in Experiment 1 was specific to the threatening objects themselves, then Experiment 2 should show different degrees of displacement for the threatening and nonthreatening objects within a single scene. This would suggest that threatening objects have a special status and that their motion is uniquely exaggerated, perhaps to enable proactive development of an action plan to evade. Indeed, avoidance-related emotions such as fear are known to produce narrowed, local processing at the expense of more global processing (e.g., Basso, Schefft, Ris, & Dember, 1996; Derryberry & Reed, 1998; Gasper & Clore, 2002). One advantage of restricted exaggeration of this sort may be more efficient processing of the threatening object, for which selective attention and action would be adaptive (Friedman & Förster, 2010; Grafton, Watkins, & MacLeod, 2012; Harmon-Jones, Price, & Gable, 2012; Kuhbandner et al., 2011; Morelli & Burton, 2009).

Alternatively, it is possible that threat would lead to increased extrapolation for the entire scene. The stimuli, after all, depict motion within a single, integrated situation. It is plausible that in extrapolating any element of this physical system, observers would concurrently extrapolate other elements as well. Beyond this, the vignette describing goal-directed behaviors of the agents moving within this scene would encourage the interpretation of this as a single, integrated situation (e.g., Radvansky, 2005). Thus, RM magnitude may be enhanced for all moving objects within a threatening trial. Such a finding would be consistent with the attentional-broadening and global-processing effects reported by some researchers (e.g., Brunyé, Mahoney, Augustyn, & Taylor, 2009; Skosnik, Chatterton, Swisher, & Park, 2000). Finally, exaggerated RM for nonthreatening objects in a threatening situation would have the advantage of predicting potential obstructions to one’s evasive or preemptive actions.

Method

Participants

A group of 91 Stony Brook University undergraduates (48 female, 43 male) participated in this experiment for partial course credit. As in Experiment 1, scene type (threatening vs. nonthreatening) was varied between participants, and all other factors were varied within participant.

Materials and procedure

The procedure of Experiment 2 was similar to Experiment 1, with the following exceptions. All trials involved two rings, which were depicted as moving simultaneously, each toward its own stationary destination dot. Participants saw a static depiction of the objects’ initial positions and initiated the trial with a keypress. At the end of the trial, the visual mask matched the color of one of the two rings, indicating the ring for which participants were to report the final position. The participants in Experiment 2 completed 20 practice trials (because of the increased difficulty of the task), and 84 experimental trials (3 Speeds × 4 Directions × 7 Repetitions). In addition, 12 four-alternative forced choice (4AFC) catch trials immediately followed each of the 12 randomly selected vignettes.

Because of the presence of two moving objects, we made a number of changes to the stimuli. We used two different ring colors (teal and yellow) and two different destination dots (blue and black). The teal ring always moved toward the blue dot, and the yellow ring always moved toward the black dot. Although the rings had the same diameter as those in Experiment 1, they had a larger (20-pixel) hollow center. The rings never crossed each other’s paths, but they otherwise moved independently of each other with respect to distance (and thus speed) and direction. The two rings moved in the same direction, in opposite directions, and in orthogonal directions (e.g., left and up) equal numbers of times across trials. To ensure that the rings never crossed paths, the distances of implied motion were reduced to 90, 180, and 270 pixels.Footnote 6

Because there were two moving rings, each vignette in Experiment 2 described two people. As in Experiment 1, one ring was described as holding a potential weapon (Appendix B), and this ring was described as intending to use the potential weapon in either a threatening manner (e.g., attacking someone with a broken bottle) or a nonthreatening manner (e.g., heading toward someone with a broken bottle to be discarded in the trash). The second ring was always described as holding an inherently innocuous object (e.g., a sandwich). This ring performed the same action regardless of condition (e.g., asking for assistance with a purchase). Each specific combination of potential weapon and inherently innocuous object occurred equally often across participants.

Results and discussion

The exclusion criteria for Experiments 1 and 2 were the same, except for the acceptable error rate on the catch trials. This was adjusted, since Experiment 2’s catch trials were 4AFC rather than open-ended questions, and they occurred immediately after presentation of the story rather than after a trial. Thus, we set a 33% error rate on catch trials as our criterion for removing a participant’s data from the analysis, and we excluded 13 participants using this criterion. We also excluded the data from 12 participants who failed to follow the instructions for reporting ring position. Fewer than 2% of the individual trial data were removed from the remaining 66 participants.Footnote 7

As in Experiment 1, we tested for a traditional RM effect by examining memory for the probed ring’s final position within the nonthreatening scenes. We report all data in pixels of displacement. Generally, memory for the final position of the probed ring was displaced forward along its path of motion for both potential weapons (M = 37.19 pixels, SD = 16.13), t(34) = 13.45, p < .001, d = 4.61, and inherently innocuous objects (M = 38.54 pixels, SD = 17.09), t(34) = 13.15, p < .001, d = 4.51. Therefore, the rings showed RM regardless of the type of object they were described as holding.

To test whether the enhanced RM observed in Experiment 1 applies to all aspects of a threatening scene or applies specifically to the threatening object, we conducted a 2 (Scene Type: threatening [which contained one threatening ring and one nonthreatening ring] vs. nonthreatening [which contained two nonthreatening rings]) × 2 (Status of Probed Ring: potential weapon vs. inherently innocuous object) × 3 (Speed: slow vs. intermediate vs. fast) × 4 (Direction: up vs. down vs. left vs. right) × 2 (Color: teal vs. yellow) mixed ANOVA on memory for the probed ring’s final position. As in Experiment 1, neither ring color nor direction produced a main effect or an interaction with any other factor (ps > .1), so we collapsed across these factors for all further analyses.

If exaggerated RM occurs only for the threatening object itself, we should see an interaction between the scene’s threat status and the individual object’s threat status. We found no main effect of scene threat status on the degree of forward displacement, F(1, 64) = 0.002, n.s. That is, exaggerated RM due to threat did not hold at the level of scene, suggesting that people did not represent threatening situations as if they had uniformly advanced further ahead in time. In addition, no main effect emerged to suggest RM differences between the potential weapons as a class and the inherently innocuous objects, F(1, 64) = 0.15, n.s.

We did, however, find the predicted interaction between the threat status of a scene and the probed object’s threat status, F(1, 64) = 5.25, p < .05, η p 2 = .076 (Fig. 3). We take the data from nonthreatening scenes (see Fig. 3, leftmost two bars) to represent baseline displacement magnitudes, uninfluenced by the additional factor of emotional content. We had expected that described objects in the nonthreatening scenes would produce roughly equivalent levels of RM. However, participants may have taken into account the greater physical effort required to carry a baseball bat, tire iron, or golf club, as compared with a sandwich, bottle of juice, or magazine. This may underlie the differences in baseline RM magnitudes. Regardless, given these baseline displacement levels, the reversed pattern for these same objects in the threatening condition (Fig. 3, rightmost two bars) becomes even more remarkable. Thus, threat apparently increases RM specifically for threatening objects, rather than for all objects within a threatening scene. This suggests that in threatening situations, processing may be prioritized to evade only the object that is threatening.

Fig. 3
figure 3

Magnitude of forward displacement in Experiment 2, as a function of scene threat status and the type of object probed. (Error bars refer to standard error of the mean.)

One might wonder whether the increased RM for threatening objects arose because participants attended more closely to the threatening ring than to the nonthreatening one (Hayes & Freyd, 2002). Since the rings moved independently and never crossed paths, it would have been challenging to simultaneously track both moving rings closely (e.g., Allen & Gabbert, 2013). Given that inherently threatening objects (e.g., Loftus et al., 1987; Öhman et al., 2001) and implied threats (e.g., Cutler, Penrod, & Martens, 1987) disproportionately draw attention, it is possible that visually neutral stimuli (rings) that are interpreted as threatening would do so as well. However, dividing attention between competing objects increases the magnitude of RM (Hayes & Freyd, 2002), so we would expect that preferential tracking for the threatening ring would have led to better rather than worse location memory, unless the anticipation effects we predicted were also in place. We also would have expected more random responding for the trials that probed inherently innocuous items than for threatening rings, resulting in more eliminated data. Neither of these patterns emerged. On the contrary, participants were no more likely to make major errors in either condition, t(64) = 0.37, n.s., and innocuous objects in threatening scenes systematically produced classic RM. Therefore, the participants must have kept track of the movements of both objects.

In addition, the classic RM effect of speed emerged, with greater forward displacement for the two faster speeds (intermediate: M = 39.43 pixels, SD = 16.79; fast: M = 40.20 pixels, SD = 17.53) than for the slow speed (M = 34.17 pixels, SD = 14.06), F(2, 128) = 33.07, p < .001, η p 2 = .341. Speed did not interact with any other factors (all ps > .05).

Experiment 3: Arousing positive (happy) scenes

Although Experiments 1 and 2 demonstrated increased anticipated motion for threatening relative to nonthreatening stimuli, it is possible that this effect was not due to threat-specific processing. Threats share many of the same properties as other emotional stimuli (Arnell, Killman, & Fijavz, 2007; Bar-Haim, Lamy, Pergamin, Bakermans-Kranenburg, & van IJzendoorn, 2007; Calvo & Lang, 2004; Forgas, 2008; Lerner & Keltner, 2001), including emotional and motivational properties (e.g., Gasper, 2004; Harmon-Jones et al., 2012). It is possible, then, that any motivated, emotional situation would lead to exaggerated RM, which would suggest that the exaggerated effect is not unique to threat.

If exaggerated anticipation is beneficial across a range of emotionally significant events, then emotions like joyful excitement, for example, might lead to patterns of RM similar to those we have demonstrated for threat. This prediction is consistent with Woody and Teachman (2000), who found that perceived proximity varied with arousal, and with Mather and Sutherland (2009), who found arousal to be a better predictor than valence for location memory binding.

Several findings, in fact, point to the possibility that exaggerated forward distortion should be especially likely for happy scenarios. First, happiness is considered to be an approach-related emotion (e.g., Gable & Harmon-Jones, 2011), biasing cognition toward appetitive stimuli. Consistent with this, appealing locations are perceived as being closer than unappealing ones (e.g., Alter & Balcetis, 2011).

Second, reality monitoring tends to be poorer for positive than for negative events (e.g., Kensinger, O’Brien, Swanberg, Garoff-Eaton, & Schacter, 2007; Storbeck & Clore, 2005). If this reduces people’s ability to reject their own immediate memory biases, they may have poorer access to accurate memory for the target’s final position. Thus, RM effects should increase.

Third, readers’ degree of personal engagement with narratives impacts the nature and use of situation models, as well as impacting other downstream cognitive processes (e.g., by means of participatory responses; Albritton & Gerrig, 1991). If happy scenarios produce higher rates of participatory responses and approach orientation than neutral scenarios, then characters who express happy excitement (e.g., because they have found a lost family heirloom) should produce greater RM magnitude than neutral characters (e.g., who were simply lending their daughter an old necklace).

The findings of Experiments 1 and 2 may, on the other hand, highlight the way in which threat differs from other emotions. In the presence of threat, as we have discussed, evasion of the threat may be an observer’s best future action. In such a case, anticipation farther into the future may trump a narrower window of anticipation. It is valuable not just to know where the threat will be in the next moment, but also to know its likely position after that. This emphasis on future behavior may not be as relevant for other emotions. Since people already anticipate the actions of moving objects (Freyd & Finke, 1985), this may be sufficient for the form of precise, coordinated movement motivated by emotions like happiness. According to such reasoning, the happy scenarios of Experiment 3 might not produce exaggerated RM effects. In this experiment, we used a simple two-object design similar to that of Experiment 1 in order to provide the most straightforward opportunity to test these competing predictions.

Norming

Participants

We recruited 70 participants using Amazon Mechanical Turk to participate in this norming task. Each participant was paid $0.50.

Materials and procedure

The same procedure was used to norm the materials for Experiment 3 as we had used to norm the materials for Experiment 1, except that participants were presented with the vignettes used in Experiment 3.

Results

All analyses were conducted using repeated measures t tests. There were no differences between the happy (M = 1.25, SD = 0.63) and neutral (M = 1.20, SD = 0.51) vignettes with regard to perceived threat, t(69) = 1.08, p = .29, d = 0.09. As predicted, people perceived the happy vignettes to be more intense than the neutral vignettes (happy: M = 6.08, SD = 1.00; neutral: M = 3.07, SD = 0.88), t(69) = 23.27, p < .001, d = 3.22, as well as more happy (happy: M = 6.31, SD = 0.69; neutral: M = 3.47, SD = 1.01), t(69) = 20.32, p < .001, d = 3.28. Participants rated the ultimate ending distance from the destination as being closer for the happy scenarios (M = 1.91 feet, SD = 1.74) than for the neutral ones (M = 2.59 feet, SD = 1.79), t(69) = 4.38, p < .001, d = 0.39.

Method

Participants

A group of 136 Stony Brook University undergraduates (85 female, 51 male) participated in this experiment for partial course credit. As in the previous experiments, emotionality was varied between participants, and all other factors were varied within participant.

Materials and procedure

The orienting task, objects, and scenes used in Experiment 3 were the same as those in Experiment 1, with a few differences. The vignettes described either neutral or highly positive emotional scenes (Appendix A). The single moving ring was described as holding an object (e.g., a pair of concert tickets) to show or give the black dot, and the vignette ended with the ring being described as approaching the black dot. As with Experiments 1 and 2, the objects depicted in the neutral and happy vignettes were the same (Appendix B). The vignettes in Experiment 3 were six sentences long. This increased length was necessary to establish the emotional contexts of the scenes.

To increase the salience of the semantic content of the vignettes for participants, we increased the number of catch trials to 31 free-response questions. Participants completed four practice trials (one moving in each direction) and 72 experimental trials (2 Speeds [210 and 330 pixels] × 4 Directions of Movement, with nine observations in each of these cells). We used 36 different ending distances in this experiment.

Results and discussion

Using the same exclusion criteria as for Experiment 1, we removed 20 participants’ data for failure to follow the instructions. Less than 1% of the data were removed for the remaining 116 participants (58 in each condition). None of the participants’ data were removed due to low catch trial accuracy.

As in Experiments 1 and 2, a single-sample t test showed that participants tended to remember objects as being farther along their path of motion in the neutral condition (M = 21.19 pixels, SD = 21.67), t(57) = 7.47, p < .001, d = 1.98, again demonstrating a classic RM effect.

A 2 (Scene Type: happy vs. neutral) × 2 (Speed: fast vs. slow) × 2 (Ending Distance: near vs. far) ANOVA was conducted to examine whether people anticipated motion to a greater extent for happy than for neutral scenes. Participants did not extend the ring’s motion for happy scenarios (M = 21.92 pixels, SD = 17.12) to a greater extent than for their neutral counterparts (M = 21.12 pixels, SD = 21.67), F(1, 114) = 0.41, n.s. (Fig. 4). That is, we saw no evidence that people extend the motion of objects in happy scenes further into the future in the same manner as they do for threatening scenes. This occurred in spite of the relatively greater statistical power in this study than in Experiments 1 and 2, due to both the larger number of participants and the reduced design complexity. The results suggest that the exaggerated forward displacement found in Experiments 1 and 2 for threatening moving objects may be attributable specifically to the processing biases associated with threat, rather than to processing biases that cut across a broader range of emotionally arousing situations. Despite the approach motivation typically associated with happiness, the anticipated actions associated with happiness still favor relatively more accurate immediate memory for positions within a dynamic scene. That is, in a happy situation, observers are more likely to want to interact with things than to evade them, and interaction is best served by accurately predicting where objects will be next. Consistent with Experiment 1, rings whose motion ended farther from the destination dot produced greater forward displacement (M = 28.28 pixels, SD = 25.86) than did rings whose motion ended closer (M = 14.92 pixels, SD = 13.94), F(1, 114) = 121.47, p < .001, η p 2 = .516. In contrast to typical findings in the literature, and consistent with Experiment 1, slow rings (M = 22.58 pixels, SD = 19.94) produced greater forward displacement than did fast rings (M = 20.52 pixels, SD = 19.06), F(1, 114) = 41.88, p < .001, η 2 = .269.

Fig. 4
figure 4

Representational momentum for happy and neutral scenes in Experiment 3. (Error bars refer to standard error of the mean.)

Before considering a null hypothesis, it is important to determine whether the experiment has the requisite power to detect an effect. Because we used the same stimuli as in Experiment 1, for the power analysis we used the observed effect size from Experiment 1 (η p 2 = .068), which amounts to a medium effect (Cohen, 1992). We determined that we would need to run 108 participants to obtain a power of .8. Thus, the 116 participants whose data were included should have been enough to detect an effect, had one existed. Furthermore, because we were able to detect effects in the other analyses (i.e., speed and ending distance), we believe we achieved sufficient power to find an effect of happiness on RM if one existed. To test this further, we examined the posterior probability of the prediction under investigation (i.e., that happiness would exacerbate RM magnitude) by using the JASP statistical software (Love et al., 2015) to compute an inverse Bayes factor (BF10). This is represented as an odds ratio in which a value of 1 indicates equal likelihood of the null (H0) and alternative (H1) hypotheses. Values greater than 1 indicate more evidence for H1, and values less than 1 indicate more evidence for H0 (Jarosz & Wiley, 2014). As predicted, this analysis favored the absence of an effect (BF10 = 0.622), with H0 being 1.6 times more likely to hold than H1.

The purpose of Experiment 3 was to examine the possibility that the observed effect of threat could simply be attributed to emotional arousal or to the depiction of motivated, directed action. Despite reflecting approach-oriented, intense, and goal-directed action, the happy scenarios failed to increase forward displacement beyond baseline levels from neutral scenarios. Thus, the systematic RM exaggeration observed in Experiments 1 and 2 may be unique to situations (like threat) in which anticipating further into the future would provide the most effective guidance for behavior.

General discussion

Over the course of three experiments, participants viewed abstracted displays of moving objects and then indicated a target object’s final position. Across all experiments and conditions, participants displaced the moving rings forward in memory, consistent with classic RM effects (Freyd & Finke, 1985; Hubbard & Ruppel, 1999; Ruppel et al., 2009). In Experiments 1 and 2, this forward displacement was greater when observers considered the probed ring to represent a threatening agent than when they did not. Importantly, this effect occurred only for the threatening objects within threatening scenes, rather than for all objects within the threatening scene. That is, people projected the actions of threatening objects further ahead in time than the actions of nonthreatening objects within the same event, which would allow for more efficient calculation of the motion path of greatest significance.

In contrast, arousing happy vignettes (Exp. 3) did not systematically exaggerate forward displacement in memory for the ring’s final position. Instead, the magnitude of RM was comparable to that produced by neutral vignettes. This is consistent with memory biases that would be expected to best serve the demands of happy situations, which are more likely to involve coordination with a target than evasion.

The present work suggests that the relationship between the emotional significance of an event and the optimal memory representation for that event may be complex. Scenarios involving threat are more salient than those involving mundane activity, yet immediate memory accuracy for position was impaired rather than enhanced in the present study. We argue that, whereas the optimal representation of a static threat may be one that accurately reflects its actual location (Mather & Nesmith, 2008; Mather & Sutherland, 2011), the optimal representation of a dynamic threat may be one that involves effective anticipation of its motion. This broad principle regarding optimized processing has parallels in other literatures. For example, findings concerning flashbulb memory (e.g., Talarico & Rubin, 2003) and the impact of stress on memory for detail (e.g., Deffenbacher, Bornstein, Penrod, & McGorty, 2004) similarly demonstrate adaptive reprioritization of processing toward survival relevance.

Our findings are consistent with the principles of the common coding of events (e.g., Hommel, Müsseler, Ashersleben, & Prinz, 2001; Prinz, 1984), embodied cognition (Winkielman, Niedenthal, & Oberman, 2008), and neointernalization theories of RM, whereby forward displacement reflects higher-order processes that are sensitive to anticipated motor activity (Hubbard, 2010). Evasion is an optimal response to threat, and evasion benefits less from precise knowledge of where the threat will be in the next moment than from the ability to predict and direct one’s actions away from the threat’s likely extended path. Indeed, there is substantial evidence that the planning of an action, including actions associated with threat evasion, shares neural resources with perception (Hommel et al., 2001). Given this close connection between observational and behavioral systems, it is not surprising that observing a threat’s motion would facilitate action planning in the way our data suggest it might. In particular, this integrated common coding system would allow computation of the sensory consequences of various possible responses and selection of one that would maximize the distance between oneself and the threat in the future. Because selecting the direction of flight from a threat involves a substantial initial commitment, the cost of an incorrect action decision could be high.

There are a number of reasons why the results reported in the present study may actually underestimate the magnitude of the effect. First, the dots used in these experiments served as boundaries that participants knew (through the instructions) that the rings never reached. If this is the case, trials in which the ring ended close to the dot may have led to repulsion effects, eliminating or mitigating RM (Huttenlocher, Hedges, & Duncan 1991). Second, our participants witnessed abstracted depictions of hypothetical threats against fictitious others while adopting the perspective of a technician. This would be expected to reduce the experience of threat, as compared to a situation in which the threat was aimed more directly at them. Therefore, we would expect that the bias observed in Experiments 1 and 2 might become more pronounced if a first-person experience of threat were involved. In particular, people should be biased to anticipate the motion of an approaching threat and misremember it as having come closer to themselves than it actually had. Indeed, related effects have been reported for spider stimuli (Cole, Balcetis, & Dunning, 2013; Harber, Yeung, & Iacovelli, 2011; Witt & Sugovic, 2013). Third, our displays of rings and dots on otherwise blank backgrounds were schematized, simple, and predictable in their content, and our predisplay descriptions conveyed threat only conceptually (Freyd & Finke, 1985; Hubbard, 1995, 2005; Laney, Campbell, Heuer, & Reisberg, 2004; Laney, Heuer, & Reisberg, 2003; Vinson & Reed, 2002). Thus, our paradigm, which provided for optimal experimental control and comparability between conditions, had rather low ecological validity with respect to threatening situations. We would expect that under circumstances that more accurately depicted realistic threats with realistic stimuli, the effects reported here might increase. Fourth, our results collapsed across many repetitive trials of similar content, and participants may have habituated to some extent to the interpretation of threat over the course of the session. Nevertheless, RM increased as a function of threat in spite of these design decisions.

Finally, it may be of value to consider how individual differences may lead to deviations from the general pattern of responses observed here. Some populations, for example, may be especially disposed to the effects reported here. People with phobias and anxiety disorders show a greater predisposition to expect or detect threats (see also Riskind, Moore, & Bowlby, 1995). Similarly, people with posttraumatic stress disorder may be even more likely than nondisordered participants to perceive threat, attend to it, and predict its course.

Considering some alternative explanations

The threat effects found in these studies could conceivably reflect any of a number of different underlying biases, each with its own set of implications for cognition. We will describe three here and explain why we believe they cannot account for our findings.

Bottom-up mechanisms

A compelling literature has demonstrated privileged processing for evolutionarily relevant threat stimuli, such as spiders (Öhman et al., 2001), angry faces and gaits (Chouchourelou, Matsuka, Harber, & Shiffrar, 2006; Frischen, Eastwood, & Smilek, 2008), and more modern threats such as guns (Loftus et al., 1987). In contrast, the effects shown in the present studies depend on semantic interpretation of a visually neutral situation. We used neither biological motion (Johansson, 1973; Pratt, Radulescu, Guo, & Abrams, 2010) nor static visual depictions of the threatening objects referred to by the vignettes, so the exaggerated RM we have found can be attributed to neither lower-level perceptual processing nor preattentive processing (see also Kuhbandner et al., 2011; Veuilleumier, 2005).

Inferred ending position

One might argue that the final remembered position could have been influenced by an a priori expectation of a closer ending distance for threatening than for nonthreatening agents. Note, however, that our norming study showed no evidence for systematic differences in these a priori expectations. Note also that our norming study did show that people anticipated that happy actions would end at a shorter distance than their comparison neutral actions. However, in spite of this, we did not find exaggerated RM in the happy scenarios. Thus, expectations about default positions within the referent scenario cannot account for our findings.

Increased estimation of duration

People tend to overestimate the durations of both positive (Frederickx et al. 2013) and negative (Loftus, Schooler, Boone, & Kline 1987b) high-arousal events, sometimes by a factor of 2 or 3 (Buckhout, Figueroa, & Hoff, 1975). If two objects move at the same perceived speed, but one does so (seemingly) for longer, then people would experience it as having moved farther. This leads to an intriguing possible explanation for our results, but it cannot account either for the selective exaggeration of RM for the threatening objects within scenes that also contain nonthreatening objects, or for the lack of RM differences in Experiment 3. Presumably, if the perceived duration of the display’s movement had driven our findings, then RM should have increased for all objects within both the negative and positive scenes.

Follow-up studies: Thinking further ahead

The present findings raise several interesting questions that could help us understand how behavioral biases might affect memory biases for dynamic situations like these.

Persistence of the effect

Our investigation was limited to immediate memory following the offset of motion, and RM is known to peak at a few hundred milliseconds following stimulus offset (Finke & Freyd, 1985; Freyd & Johnson, 1987; Hubbard, 2005). We chose to study immediate memory because of its likely role in guiding immediate action. It is not clear from our findings, however, whether the observed distortions persist for longer than the second or two it took for the participants to respond. Given that location memory is better for emotional than for neutral objects in long-term memory (e.g., Mather & Nesmith, 2008), it may be that the distortions we see here are self-correcting at intermediate and long delays. Indeed, the effects we have observed here carry value only for a brief moment, after which a readjustment of memory would be best. On the other hand, when errors do exist in localization, they appear to be more persistent for emotional than for neutral stimuli (Novak & Mather, 2009). Similar persistence for the bias identified in the present study may have important implications, for example, for the accuracy of eyewitness reports following a threatening event (Christianson, Loftus, Hoffman, & Loftus, 1991).

Confronted threat

There are times when confronting the threat is preferred to fleeing. For example, a parent whose child is being menaced may be inclined to challenge the threat rather than flee from it. In this circumstance, the parent might be better served by classic RM, which would better support accurate prediction of the location in the next unit of time, to guide intervention, blocking, or counterattack. Thus, priming a reader to experience this sort of engagement with the threat might lead to a better understanding of the situations that lead to enhanced RM and the functions it best serves.

Postviolence scenario

The increased magnitude of RM observed in Experiments 1 and 2 of this article was argued to support immediate and urgent action. The present paradigm can be applied to threatening scenarios in which the attacker has completed the threatened action and is now approaching the exit. Under such circumstances, we would not predict RM exaggeration comparable to that in the present findings.

Static threats in dynamic situations

Our experiments modeled situations that we believed most accurately reflected threatening events as they are experienced in the world. However, a large number of threats are actually static, lying in wait for an unknowing victim to encounter them. The lion lying in wait for a victim to pass by, a sharp and unseen cliff, or a buried landmine all represent situations in which a threatening stimulus is not moving, but a moving person may nevertheless encounter it. Given the biases we argue for in this article, we would not predict enhanced RM for the actions of the threatened, moving individual under such circumstances.

Conceptual “movement”

If people anticipate the actions of threatening agents further ahead in time, are they more likely to misremember a threatening person’s action as having progressed closer to completion, as compared to the same action by a nonthreatening person (Strickland & Keil, 2011)? To the extent that an increased propensity to predict the threat’s future activities can inform other types of survival-relevant avoidance, we might see increased false memory for completed actions (the assassin succeeding in acquiring the necessary passport) that had actually only been portrayed as goals.

Conclusion

In the classic RM paradigm, observers are generally found to displace the position of a moving object forward in immediate memory. This systematic distortion in classic RM paradigms may afford optimal action planning, whereby behavior can be closely coordinated with the moving object’s anticipated next position. In the present experiments, memory for final position was distorted even farther forward for moving threats. This exaggerated magnitude of the classic RM effect may be uniquely beneficial for threatening situations, wherein an adaptive response may be evasion rather than coordinated action. Thus, anticipation of the threat’s action further into the future, although it leads to a larger memory error, may nevertheless serve to optimize behavior.