Mode (user interface)

In user interface design, a mode is a distinct setting within a computer program or any physical machine interface, in which the same user input will produce perceived results different from those that it would in other settings. Modal interface components include the Caps lock and Insert keys on the standard computer keyboard, both of which typically put the user's typing into a different mode after being pressed, then return it to the regular mode after being re-pressed.

An interface that uses no modes is known as a modeless interface.[1] Modeless interfaces avoid mode errors, in which the user performs an action appropriate to one mode while in another mode, by making it impossible for the user to commit them.[2]

Definition

[edit]

In his book The Humane Interface, Jef Raskin defines modality as follows:

"An human-machine interface is modal with respect to a given gesture when (1) the current state of the interface is not the user's locus of attention and (2) the interface will execute one among several different responses to the gesture, depending on the system's current state." (Page 42).

In Raskin's sense and according to his definition, an interface is not modal as long as the user is fully aware of its current state. Raskin refers to this as "locus of attention" (from the Latin word locus meaning "place" or "location"). Typically a user is aware of a system state if the state change was purposefully initiated by the user, or if the system gives some strong signals to notify the user of the state change in the place where interaction occurs. If the user's locus of attention changes to a different area, the state of the interface may then represent a mode since the user is no longer aware of it.

Larry Tesler defined modes as "a state of the user interface that lasts for a period of time, is not associated with any particular object, and has no role other than to place an interpretation on operator input."[3]

Examples

[edit]
[edit]

Several examples of software have been described as modal or using interface modes:

  • Text editors – typically are in insert mode by default but can be toggled in and out of overtype mode by pressing the Insert key.
  • Bravo (editor) – the first WYSIWYG modal editor made for Xerox Alto computers at Xerox PARC by Butler Lampson and Charles Simonyi
  • vi – has one mode for inserting text, and a separate mode for entering commands. There is also an "ex" mode for issuing more complex commands (e.g. search and replace). Under normal circumstances, the editor automatically returns to the previous mode after a command has been issued; however, it is possible to permanently move into this mode using Shift-Q.
  • Emacs – has the concept of "prefix keys", which trigger a modal state by pressing the control key plus a letter key. Emacs then waits for additional keypresses that complete a keybinding. This differs from vi in that the mode always ends as soon as the command is called (when the sequence of key presses that activates it is completed). Emacs also has many "major and minor" modes that change the available commands, and may be automatically invoked based on file type to more easily edit files of that type. Emacs modes are not restricted to editing text files; modes exist for file browsing, web browsing, IRC and email and their interaction patterns are equivalent to application software within the Emacs environment. Modes are written in Emacs Lisp, and all modes may not be included with all versions.
  • Cisco IOS – certain commands are executed in a "command mode".
  • Tools chosen from a palette in photo-editing and drawing applications are examples of a modal interface. Some advanced image editors have a feature where the same tools can be accessed nonmodally by a keypress, and remain active as long as the key is held down. Releasing the key returns the interface to the modal tool activated by the palette.
  • Video games can use game modes as a mechanic to enhance gameplay.
  • Modal windows block all workflow in the top-level program until the modal window is closed.[4]

Modeless

[edit]

Larry Tesler at PARC devised insights for a modeless word processor from the feedback gathered from a user test with newly-hired Sylvia Adams, where she was asked to ad lib some gestures to correct proofreading marks on the digital text.[5] This test convinced Tesler's manager Bill English of the problems with their previous modal interface.

Mode errors

[edit]

Modes are often frowned upon in interface design because they are likely to produce mode errors when the user forgets what state the interface is in, performs an action that is appropriate to a different mode, and gets an unexpected and undesired response.[6][7] A mode error can be quite startling and disorienting as the user copes with the sudden violation of their user expectations.

Problems occur if a change in the system state happens unnoticed (initiated by the system, or by another person, such as the user who was previously using the machine), or if after some time the user forgets about the state change. Another typical problem is a sudden change of state that interrupts a user's activity, such as focus stealing. In such a situation it can easily happen that the user does some operations with the old state in mind, while the brain has not yet fully processed the signals indicating the state change.

Examples of mode errors

[edit]
  • The most common source of mode errors may be the Caps Lock key. Other common modes available in PC keyboards are the other lock keys, Num lock and Scroll lock, and often the Insert key. Dead keys for diacritics also create a short-term mode, at least if they do not provide visual feedback that the next typed character will be modified. While the Lock keys on PC keyboards are designed with the intention that they be used as modal keys, the IBM PC hardware design does not require these nor any other specific keys to be modal but allows software to treat any key as modal. (The PC BIOS normally implements Caps Lock, Num Lock, and Scroll Lock states, so the modality of these keys may appear intrinsic, but it is neither technically nor practically necessary to use the BIOS for keyboard I/O, and in fact most modern operating systems do not use BIOS keyboard I/O.)
  • PC users whose language is not based on the Latin alphabet commonly have to interact using two different keyboard layouts: a local one and QWERTY. This gives rise to mode errors linked to the current keyboard layout: quite often, the synchronization of "current layout" mode between the human and the interface is lost, and text is typed in a layout which is not the intended one, producing meaningless text and confusion. Keyboard keys in user interface elements like "(y/n)" can have opposite effect if a program is translated.
  • A frequent example is the sudden appearance of a modal error dialog in an application while the user is typing, which is a form of focus stealing; the user expects the typed text to be introduced into a text field, but the unexpected dialog may discard all the input, or may interpret some keystrokes (like "Y" for "yes" and "N" for "no") in a way that the user did not intend, often triggering a destructive action that cannot be reverted. Programmers can mitigate this by implementing a short delay between the modal dialog displaying and it beginning to accept keyboard input.
  • The Unix text editor vi can be notoriously difficult for beginners precisely because it uses modes, and because earlier versions configured mode indication to be turned off by default.
  • In many computer video games, the keyboard is used both for controlling the game and typing messages. Users may forget they are in "typing mode" as they attempt to react to something sudden in the game and find the controls unresponsive (and instead their text bar full of the command keys pressed).

In transportation accidents

[edit]
  • Mode confusion was part of the events that led to the loss of Air France Flight 447 in 2009, and the loss of life of 228 people. The pilots reacted to a loss of altitude by pulling on the stick, which would have been an appropriate reaction with the autopilot fully enabled, which would then have put the aircraft in a climbing configuration. However, the airplane's systems had entered a mode of lesser automation ("direct law" in Airbus terms) due to a blocked airspeed sensor, allowing the pilots to put the plane in a nose-high stall configuration, from which they did not recover.[8]
  • According to the NTSB, one of the factors contributing to the 2013 Asiana Airlines Flight 214 crash was "the complexities of the autothrottle and autopilot flight director systems … which increased the likelihood of mode error".[9][10]
  • On January 17, 2015, the offshore supply vessel "Red7 Alliance" collided with a lock gate of the Kiel Canal in Germany, damaging it severely. An investigation concluded that the levers controlling the ship's Azimuth thrusters were not used in a way appropriate to the mode they were set to, resulting in the ship accelerating instead of coming to a stop in the lock.[11]
  • On August 21, 2017, the US Navy destroyer USS John S. McCain collided with a commercial tanker in the Strait of Malacca, resulting in the loss of life of ten crew members. An investigation conducted by the US military concluded that immediately prior to the collision, helm and propulsion controls had been redistributed between bridge stations, and the bridge crew was not fully aware of that redistribution.[12]
  • On April 10, 2018, the 5000 ton supply vessel VOS Stone unberthed from a wind platform under construction in the Baltic Sea. The vessel's master decided to put the steering in an alternative mode to perform a test of the system. Insufficient communication with the officer of the watch led to a temporary loss of control, collision with the platform, injury to three crew members, and significant damage.[13]
  • On April 19, 2020, an F-35A fighter jet was destroyed in a landing mishap at Eglin Air Force Base. Investigations concluded that the aircraft was misconfigured with the wrong mode of autothrottle, resulting in the aircraft becoming uncontrollable upon touchdown.[14][15]

Assessment

[edit]

Modes are intended to grab the user's full attention and to cause them to acknowledge the content present in them, in particular when critical confirmation from the user is required.[16] This latter use is criticised as ineffective for its intended use (protection against errors in destructive actions) due to habituation. Actually making the action reversible (providing an "undo" option) is recommended instead.[17] Though modes can be successful in particular usages to restrict dangerous or undesired operations, especially when the mode is actively maintained by a user as a quasimode.

Modes are sometimes used to represent information pertinent to the task that do not fit well into the main visual flow.[16] Modes can also work as well-understood conventions, such as painting tools.[7]

Modal proponents[who?] may argue that many common activities are modal and users adapt to them. An example of modal interaction is that of driving motor vehicles. A driver may be surprised when pressing the acceleration pedal does not accelerate the vehicle in the forward direction, most likely because the vehicle has been placed in an operating mode like park, neutral, or reverse. Modal interfaces require training and experience to avoid mode errors like these.

Interface expert Jef Raskin came out strongly against modes, writing, "Modes are a significant source of errors, confusion, unnecessary restrictions, and complexity in interfaces." Later he notes, " 'It is no accident that swearing is denoted by #&%!#$&,' writes my colleague, Dr. James Winter; it is 'what a typewriter used to do when you typed numbers when the Caps Lock was engaged'." Raskin dedicated his book The Humane Interface to describe the principles of a modeless interface for computers. Those principles were implemented in the Canon Cat and Archy systems.

Some interface designers have recently taken steps to make modal windows more obvious and user friendly by darkening the background behind the window or allowing any mouse click outside of the modal window to force the window to close – a design called a Lightbox[18] – thus alleviating the risk of modal errors. Jakob Nielsen states as an advantage of modal dialogs that it improves user awareness. "When something does need fixing, it's better to make sure that the user knows about it." For this goal, the Lightbox design provides strong visual contrast of the dialog over the rest of the visuals. However, while such a method may reduce the risk of inadvertent wrong interactions, it does not solve the problem that the modal window blocks use of the application's normal features and so prevents the user from taking any action to fix the difficulty, or even from scrolling the screen to bring into view information which they need to correctly choose from the options the modal window presents, and it does nothing to alleviate the user's frustration at having blundered into a dead end from which they cannot escape without some more or less destructive consequence.

Larry Tesler, of Xerox PARC and Apple Computer, disliked modes sufficiently to get a personalized license plate for his car that read: "NO MODES". He used this plate on various cars from the early 1980s until his death in 2020. Along with others, he also used the phrase "Don't Mode Me In" for years as a rallying cry to eliminate or reduce modes.[19][20]

Bruce Wyman, the designer of a multi-touch table for a Denver Art Museum art exhibition[21] argues that interfaces for several simultaneous users must be modeless, in order to avoid bringing any single user into focus.[22]

Design recommendations

[edit]

Avoid when possible

[edit]
Small signs make explicit the mappings from signal to roads.

Alternatives to modes such as the undo command and the recycle bin are recommended when possible.[23] HCI researcher Donald Norman argues that the best way to avoid mode errors, in addition to clear indications of state, is helping the users to construct an accurate mental model of the system which will allow them to predict the mode accurately.[24]

This is demonstrated, for example, by some stop signs at road intersections. A driver may be conditioned by a four-way stop sign near his or her home to assume that similar intersections will also be four way stops. If it happens to be only two way, the driver could proceed through if he or she sees no other cars. Especially if there is an obstructed view, a car could come though and hit the first car broadside. An improved design alleviates the problem by including a small diagram showing which of the directions have a stop sign and which do not, thus improving the situational awareness of drivers.

Proper placement

[edit]

Modal controls are best placed where the focus is in the task flow.[23] For example, a modal window can be placed next to the graphical control element that triggers its activation. Modal controls can be disruptive, so efforts should be made to reduce their capacity to block user work. After completing the task for which the mode was activated, or after a cancel action such as the Escape key, returning to the previous state when a mode is dismissed will reduce the negative impact.

Quasimodes

[edit]

In the book The Humane Interface, Jef Raskin championed what he termed quasimodes, which are modes that are kept in place only through some constant action on the part of the user; such modes are also called spring-loaded modes.[25] The term quasimode is a composite of the Latin prefix quasi- (which means almost, to some degree) and the English word "mode".

Modifier keys on the keyboard, such as the Shift key, the Alt key and the Control key, are all examples of a quasimodal interface.

The application enters into that mode as long as the user is performing a conscious action, like pressing a key and keeping it pressed while invoking a command. If the sustaining action is stopped without executing a command, the application returns to a neutral status.

The purported benefit of this technique is that the user does not have to remember the current state of the application when invoking a command: the same action will always produce the same perceived result.[26] An interface that uses quasimodes only and has no full modes is still modeless according to Raskin's definition.

The StickyKeys feature turns a quasimode into a mode by serializing keystrokes of modifier keys with normal keys, so that they do not have to be pressed simultaneously. In this case the increased possibility of a mode error is largely compensated for by the improved accessibility for users with physical disabilities.

See also

[edit]

Notes

[edit]
  1. ^ Usability Glossary: modeless Archived 2007-10-22 at the Wayback Machine
  2. ^ Usability Glossary: mode error
  3. ^ Tesler, Larry (2012-07-01). "A personal history of modeless text editing and cut/copy-paste". Interactions. 19 (4): 70–75. doi:10.1145/2212877.2212896. S2CID 21399421. (pdf)
  4. ^ "How to Use Modality in Dialogs". Oracle Corporation.
  5. ^ "Of Modes and Men". IEEE Spectrum: Technology, Engineering, and Science News. August 2005. Retrieved 2020-02-21.
  6. ^ Glossary: mode error
  7. ^ a b Usability Glossary: modal
  8. ^ BEA final report on the loss of Air France 447
  9. ^ National Transportation Safety Board
  10. ^ Poor UI design can kill
  11. ^ M/V Red7 Alliance investigation report (in German)
  12. ^ "USS McCain collision ultimately caused by UI confusion". 2017.
  13. ^ Investigation Report 118/18, Federal Bureau of Maritime Casualty Investigation (Germany), April 10, 2019
  14. ^ US Air Force accident report
  15. ^ ]F-35A Crash at Eglin AFB, C.W. Lemoine, Youtube
  16. ^ a b "Modal Panel - Context". Infragistics.com. Archived from the original on 2013-05-06.
  17. ^ Aza Raskin, A List Apart: Never Use a Warning When you Mean Undo
  18. ^ Jakob Nielsen, Alertbox. "10 Best Application UIs".
  19. ^ Origins of the Apple Human Interface by Larry Tesler, Chris Espinosa
  20. ^ Origins of the Apple Human Interface - full transcript
  21. ^ Technology for Experience's Sake: Guest Post by Bruce Wyman
  22. ^ Bruce Wyman's post at the ixda.org mailing list
  23. ^ a b "Modal Panel - Implementation". Infragistics.com]. Archived from the original on 2013-05-06.
  24. ^ Norman, Donald A. (1983). "Design rules based on analyses of human error". Communications of the ACM. 26 (4): 254–258. doi:10.1145/2163.358092. S2CID 47103252.
  25. ^ Usability Glossary: spring-loaded mode
  26. ^ Spring-Loaded Modes, Jakob Nielsen.

References

[edit]
[edit]