Introduction to Augmented Reality (AR)
There is a primary disconnect between the massive digital data at our disposal and the physical world in which we apply it. We, humans, live in a three-dimensional world. But the so-called rich data that aids us in making prudent decisions and actions remain confined to two-dimensional views. Such a scenario creates a chasm between the real and digital worlds and restricts our ability to take advantage of the stream of information and insights produced by billions of smart, connected devices and products worldwide. Augmented Reality (AR) is a technology with capabilities to superimposes or overlay digital data, visualizations, and images on the physical world
To understand AR, we need to understand Virtual reality(VR). Thanks to superior computing power today, computer graphics have progressed to a point where the computer-generated images are often indistinguishable from the real world. Such developments around graphical capabilities paved the way for Virtual Reality (VR). VR is a simulation of a realistic experience using computer-generated scenarios using graphics. VR simulates images to perfection; however, these computer-generated images remain detached from our physical surroundings. The limitation comes from the interests we have in our day to day life, that is not directed toward some virtual world, but rather toward the real world that we are part of or surrounding us.
Augmented reality (AR) creates direct, automatic, and actionable links between the physical world and digital world. It provides a simple and an immediate user interface to a digitally enhanced physical world. This immense potential of augmented reality is a paradigm-shift in user interface design and human-computer interaction (HCI). AR takes us beyond mobile computing. It bridges the gap between virtual world and the real world, both spatially and cognitively. With AR, the digital information appears to become part of the real world, at least in the user’s perception. “Seamlessly achieving this connection is the grand goal.”
Misconceptions about what AR is: AR should not be confused with computer graphics. It is true that often people associate the visual combination of virtual and real elements with the special effects in movies. For example, films like Star Wars, Avatar, Jurassic Park or the Guardians of the galaxy have extensive visual contents. It is also true that that computer graphics techniques used in movies may apply to AR as well, but if you look carefully, such films lack one crucial characteristic of AR – “interactivity.”
So, what is an AR application? AR applications have three fundamental characters
1. Combines Real and Virtual
2. Interactive in real time
3. Registered in 3D
The above definition does not require a specific output device, such as a head-mounted display (HMD), nor does it limit AR to just visual media. Audio, haptics, and even olfactory (sense of smell) or gustatory (sense of taste) are included in its scope, even though these may be difficult to conceptualize and realize. The above also implies that a user of an AR display can exercise some sort-of interactive viewpoint control. That is the computer-generated augmentations in the display will remain registered to the referenced objects in the environment.
When you design a complete AR Systems, your architecture should primarily focus on four components.
1. Tracking component - Determines user position or location in the real world
2. Registration component
3. Visualization component
4. Spatial model - This is your data repository that captures and store information about both the worlds. Hence it is divided into a Real-world model and a Virtual world model. The real model captures reference points in the real world for the tracking component. The virtual model consists of contents for the augmentation. Both parts of the spatial registered in the same coordinate system.
How does it work?
AR operates using a feedback loop between human user and the computer system. A human user observes the AR display and controls the viewpoint. The system tracks the user’s viewpoint, registers the pose in the real world with the virtual content, and presents situated visualizations.
Some AR Concepts
Display: AR is synonymous with the visual overlay of information onto a person’s perception of the physical world. But, the human experience of the physical world is multimodal. Hence, AR displays will support multi-modal or multiple augmentation modalities. That is, apart from traditional displays that combine virtual and real stimuli, AR displays have other sensory modalities. Some AR endeavors focus on specific individual nonvisual modalities. e.g., Audio. In the real world, interactions with physical objects typically occur through touch. You can find sensory controllers on a touch-sensitive electronic keyboard. The sound velocity is controlled based on the pressure you apply on the musical keyboard, devices like Roli-Seaboard has taken this to a new level that modulates sounds based on finger pressure and glides. However, when it comes AR, providing a realistic sensory impression of touch for various real worldly things is still a complex problem to solve. There are on-going efforts to create advanced AR displays also support Haptic, Tactile, Olfactory and Gustatory capabilities.
Registration, Calibration, and Tracking: These are the three prominent terms in AR context. Any AR system must address these three significant, overlapping concepts.
What is Registration? Registration is the process of aligning spatial properties. Every Object “registered in AR” are aligned with each other in some coordinate system. Accurate registration of virtual information with objects in the physical scene in a user’s perception is a typical goal of AR systems.
What is Tracking? Tracking is used to denote the dynamic sensing and measuring of AR systems. Dynamic determination of spatial properties at runtime is called tracking. In the AR context, tracking is measuring an entity’s position and orientation continuously. The entities here could be user’s head, head, eye, or hands. To display virtual objects registered to real objects in three-dimensional space, we must know at least the relative pose - that is, the position and orientation of the AR display relative to the real-world objects.
What is Calibration? Calibration is the process of comparing measurements made with two different devices, a reference device, and the calibrated device. This is done to determine device parameters to deliver measurements on the known scale.
Please note, Tracking is a continues process while calibration is usually carried out only at discrete times. Depending on the measurement system, calibration may be done just once in a device’s lifetime
Visual Coherence is a process of combing and blending virtual objects into the real-world environment. The purpose of Visual coherence is to get to a point where they are indistinguishable from the real world. It might not apply to all types of AR, but visual coherence is critical for areas such as entertainment, education, and commerce.
Conclusion
It is true that few isolated AR applications have been around for many years. But only in recent years technologies to unleash AR’s potential is available. I am sure many are familiar with AR entertainment applications, such as the game Pokémon Go or Google Translate. [if you have not used Google translate app, please do it now]. Given the advancements in technology and engineering today, using AR we can transform volumes of data and analytics into images or animations that can be overlaid on the real world and make people interact with it seamlessly. AR enables a new information-delivery paradigm, which will have a profound impact on how data is structured, managed, and delivered.
With the advances in AI and robotics are getting better by the day, I believe conjoining the machine capabilities with distinctive human strengths and values will lead to far grander productivity, and value creation than either could create alone. A powerful interface is a must to realize AR's potential, an interface that connects the digital and physical world seamlessly for humans to interact using their Vision, Haptic, Tactile, Olfactory or Gustatory senses. Augmented Reality (AR) will be the innovation that would expedite and revolutionize this possibility.
Part 2 – AR Strategy – Soon...
Agree