Chapter 4. Future Actions

Win Shih

Search is moving from a place of answers to a state of action.

—Microsoft Voice Report1

The AI-driven voice technology is described as a major disruptive technology trend.2 In this chapter, we first review the future trends of voice technology and conversation AI. We then offer guidance and suggestions for preparing such changes at the organizational and leadership level.

Looking Forward

After a decade of development, conversational AI and voice technology have progressed beyond the early infancy stage and moved into a phase of mass adoption. According to Gartner’s Hype Cycle, which charts the maturity, adoption, and business applications and values of emerging technologies and innovations, voice assistant technology has now surpassed the initial proof-of-concept and marketing hype stages and entered into a zone that demands further realization of its value and fulfillment of expectations. Gartner has also upgraded voice assistant technology’s maturity level from emerging to adolescent and predicted that it will reach the mainstream adoption stage in a two-to-five-year time scale.3 In another analysis, both Microsoft and Voicebot.ai reported that voice assistant technology and conversational AI have crossed the chasm of the early adopters phase and entered the early majority of users stage in E. M. Rogers’s Diffusion of Innovation theory.4 We anticipate that AI-powered voice technology will gain more traction and move closer to a central and powerful position in our evolving digital transformation. Meanwhile, end users will accelerate the adjustment to a lifestyle—as well as the formation of an even-closer personal partnership—with voice assistants. It is imperative for organizations to shift their worldview of working and serving their customers in this transformation, identifying a new service model centered on these new technologies.

Platform Shift

Our relationship with computers and technology has been evolving over time. Based on the way we interact with technology, Kinsella identified three technology platform and user interface shifts since the advent of the World Wide Web more than twenty-five years ago (see table 4.1).5 Websites and hyperlinks are the first generation of technology platforms and user interfaces that we use to interact with digital content and services. The introduction of mobile devices and apps in the late 2000s expanded the apparatus for us to engage with the digital world. Conversational AI and voice technology elevate us to a new experiential level of human-computer interaction. Although we are still in the midst of voice-based platforms, Deloitte Consulting has already predicted that intelligent interfaces will be the next breakthrough of human experience platforms. Integrating a class of AI-powered solutions, such as affective computing, computer vision, sentiment analysis, and voice stress analysis, the intelligent interface interacts with humans through gestures, gazes, head movements, and voices; detects their physical states, emotional conditions, and moods through sensors; and responds to their needs in the appropriate context.6 Deloitte expects that we will see more progress in and growing application of affective computing in the next eighteen to twenty-four months.

Throughout the process, each platform shift introduces advanced technologies that enhance the old methods, create innovative ways of conducting business, and generate values and efficiencies. Throughout the transition, it is common to see established companies dislodged by start-ups, traditional business models displaced by new paradigms, and consumer behavior transformed. The platform shift from web to mobile apps created new entrants and a new segment of business. Lyft and Uber (ride hailing), Facebook and Instagram (social networking), and Yelp (local business reviews) are prominent examples.

At this early stage of the voice platform shift, the business model is still hazy and the technology standards have yet to be settled. For major tech companies that dominate the web and mobile app platforms, the stakes are high and the competition is fierce. Top players have invested billions of dollars in voice assistants and smart speakers to ensure they continue to dominate their existing market position and maintain competitive advantages with the new voice technology. Amazon and Google are reported to sell their devices below the cost of producing them in order to gain market share.7 Amazon reported having a team of 10,000 employees working on Alexa alone.8 At Amazon’s job site, there are currently over 2,700 Alexa-related open positions, ranging from software development (1,304 vacant positions) to marketing and PR (77 positions).9 In 2017, Microsoft’s AI division had 8,000 employees before its reorganization.10 Google draws employees from related departments to support Google Assistant instead of concentrating positions in one department or product team.11

Which Voice Technology?

In the current conversational AI landscape, voice assistant applications are not compatible. You cannot summon Alexa on a Google Home device or Google Assistant on an Amazon device. With limited resources, organizations can afford to commit to only one proprietary technology. Selecting the appropriate voice platform that fits the organization’s unique business situation and customer needs is a strategic decision. Although all the core players in voice assistant technology overlap in providing basic functionalities, each individual option has its own strengths and specializations. For example, Amazon dominates in the areas of online shopping and control of smart home devices while Google has a stronghold in the content and search arena. To differentiate, Microsoft devises a two-dimensional matrix that maps four key players based on two criteria:

  1. Ability to access and control IoT and home management devices (vertical axis)
  2. Ability of fulfilling purchasing request (horizontal axis)12

The four quadrants generated within this two-dimensional grid represent key mastery of voice assistant performance. These four areas are

  1. Knowledge: the ability to answer questions
  2. Utility: the ability to access and control IoT devices
  3. Commerce: the ability to make purchases with voice commands
  4. Productivity: the ability to integrate into work solutions

Figure 4.1 provides a visualization of how the four voice assistants fit across this functional spectrum based on these two factors. Amazon Alexa is strong in both the utility and commerce masteries. Google Assistant and Microsoft Cortana are positioned close to the center of the grid, an indication that they are both doing equally well in these four areas. Apple’s Siri plays well in both knowledge and productivity masteries.

According to Microsoft, an ideal voice assistant should perform equally well in all four areas and land itself close to the center of this diagram.13 This framework provides a tool for us to see the relative positions of key market players within the four functional areas. The visual snapshot further allows us to assess how these competing technology providers are executing their stated visions and how well their product performs against the two key factors.

Evolving Voice Assistants

As AI-powered voice technologies advance and use cases expand, we are likely to see the following developments in the next few years:

Preparing for the Change

As organizations enter the voice computing era, forward-thinking leaders and decision makers should grow the capacity of organizations and their employees with the following preparatory actions:

Meanwhile, organizations may want to form a working group or interest group with members from relevant functional units to assess the potential adoption of voice technology. Possible assignments and activities of the group may include the following:

Needs assessment and SWOT analysis are two useful exercises for organizations to investigate the appropriateness of adopting voice technology before committing resources. Both strategies also offer an excellent opportunity for an organization to learn about the technology itself as well as user needs.

Needs Assessment

Libraries interested in exploring voice technology and its possible applications should conduct a needs assessment exercise to gauge the interest and desire of their key stakeholder groups. Needs assessment is a systematic process that relies upon data collection and collaboration to identify the gaps between the current (what is) and the desired state (what should be).21 Through interviews, focus group studies, surveys, and observations, libraries collect data from the user community on a specific issue or area of need. By analyzing such data, libraries can make informed decisions, prioritize needs, and take appropriate action. Needs assessment can help organizations to identify and solve existing problems, determine future opportunities and needs, improve performance and services, develop strategic goals and priorities, and align resources with strategy.22 In practice, needs assessment has been used to improve library services to patrons, to assess library training needs, and to reevaluate library space usage.23 Table 4.2 provides an example of a needs assessment data collection plan on implementing Alexa at a large research university.

SWOT Analysis

Another way to assess timing of implementing a new technology before pouring resources into it is to conduct a strengths, weaknesses, opportunities, and threats (SWOT) analysis. When considering a new technology, service offering, or strategic direction, libraries can employ SWOT analysis and conduct an environmental scan to facilitate the decision-making process. SWOT analysis maps an organization’s internal strengths and weaknesses, as well as the external environment’s opportunities and threats related to the organization’s initiative. Through SWOT analysis, organizations can identify their favorable and unfavorable internal factors (strengths and weaknesses) and external factors (opportunities and threats) that might affect the success and performance of the initiative. The findings allow organizations to strategically chart their direction more effectively.24 Armed with a better understanding, organizations can leverage their strengths to realize new opportunities while avoiding or minimizing any potential negative impact and remediating or overcoming potential threats.

SWOT analysis has been widely employed by libraries to assess programs and new initiatives, including adoption of social media to promote library services, library instruction, and holograms in cultural institutions.25 Table 4.3 presents a sample SWOT analysis on implementing Alexa at a large academic library.

Data Governance

Because voice technology can amass enormous volumes of personal data, it creates a higher potential and risk for fraud, identity theft, and hacking. Organizations should pay particular attention to fortifying the security measures of their IT infrastructure while ensuring a thorough and updated data governance policy and data management practices are in place. To earn trust from patrons, leaders and decision makers need to do the following:

Conclusion

Applications of AI-fueled innovations are proliferating in all business sectors, including education. Forward-thinking and tech-savvy information professionals should seek opportunities to explore AI and voice technologies and identify possible and promising voice applications to enhance services, augment productivity, and innovate operations and services. Organizations also need to develop policies, practices, security measures, data governance models, and data risk management programs to mitigate privacy, security, and ethics concerns.

Our relationship with computers and technology will continue to evolve in the near future. Voice assistants are becoming more adept and attuned to our needs and are more intertwined with our personal lives. They will know a whole lot more about us and anticipate our needs in the years to come, increasingly assuming the intermediary role for quick and one-shot answers and even making choices for us in anticipation of our preferences. To remain a source of authenticated and valuable information, libraries, information organizations, and information professionals should rethink their role and relationship with patrons in the voice-based information landscape.

Notes

  1. Christi Olson and Kelli Kemery, Voice Report: From Answers to Action: Customer Adoption of Voice Technology and Digital Assistants (Redmond, WA: Microsoft, 2019), 28, https://about.ads.microsoft.com/en-us/insights/2019-voice-report.
  2. Rob Prevett, “18 Disruptive Technology Trends for 2018,” Disruption Hub, January 11, 2018, https://disruptionhub.com/2018-disruptive-trends; Gil Press, “5 Top Technologies for Digital Disruption,” Forbes, April 27, 2017, https://www.forbes.com/sites/gilpress/2017/04/27/5-top-technologies-for-digital-disruption/#2e1b4da64898.
  3. Van Baker, “Hype Cycle for Artificial Intelligence,” Gartner, July 25, 2019, https://www.gartner.com/document/3953603?ref=solrAll&refval=238935546 (requires subscription).
  4. Olson and Kemery, Voice Report; “The State of Voice Assistants as a Marketing Channel,” Voicebot.ai, https://voicebot.ai/the-state-of-voice-assistants-as-a-marketing-channel-report.
  5. Bret Kinsella, “Why Tech Giants Are So Desperate to Provide Your Voice Assistant,” Harvard Business Review, May 7, 2019, https://hbr.org/2019/05/why-tech-giants-are-so-desperate-to-provide-your-voice-assistant.
  6. “Tech Trends 2020,” Deloitte, last modified January 15, 2020, https://www2.deloitte.com/us/en/insights/focus/tech-trends.html.
  7. Kinsella, “Why Tech Giants Are So Desperate to Provide Your Voice Assistant.”
  8. Bruce Brown, “Amazon Confirms It Has 10,000 Employees Working on Amazon Alexa,” Digital Trends, January 23, 2019, https://www.digitaltrends.com/home/10000-amazon-employees-work-on-alexa.
  9. Numbers shown are based on figures pulled on February 28, 2020. Amazon, Amazon Jobs, https://www.amazon.jobs/en-gb/search?base_query=Amazon+Alexa+&loc_query=.
  10. Todd Bishop, “One Year Later, Microsoft AI and Research Grows to 8k People in Massive Bet on Artificial Intelligence,” GeekWire, September 22, 2017, https://www.geekwire.com/2017/one-year-later-microsoft-ai-research-grows-8k-people-massive-bet-artificial-intelligence.
  11. Bret Kinsella, “Amazon Alexa Hiring Exceeds All Open Positions at Google. Does it Matter?” Voicebot.ai, March 14, 2018, https://voicebot.ai/2018/03/14/amazon-alexa-hiring-exceeds-open-positions-google-matter/.
  12. Olson and Kemery, Voice Report, 28.
  13. Olson and Kemery, Voice Report, 28.
  14. Heather Kelly and Jay Greene, “Amazon Event Introduces New Line of Wearables That Will Push Alexa into Every Corner of Your Life,” Washington Post, September 25, 2019, https://www.washingtonpost.com/technology/2019/09/25/amazon-is-announcing-new-products-keep-pushing-alexa-into-every-corner-your-life/.
  15. Olson and Kemery, Voice Report, 4.
  16. Bret Kinsella, “Amazon, Baidu, Cerence, Microsoft, Tencent, and 30 Other Companies Launch Voice Interoperability Initiative,” Voicebot.ai, September 24, 2019, https://voicebot.ai/2019/09/24/amazon-baidu-cerence-microsoft-tencent-and-30-other-companies-launch-voice-interoperability-initiative/.
  17. Brian Burke, David Cearley, Tuong Nguyen, and Marty Resnick, “Top 10 Strategic Technology Trends for 2019: Immersive Experience,” Gartner, last modified March 13, 2019, https://www.gartner.com/document/3904416 (requires subscription).
  18. Burke et al., “Top 10 Strategic Technology Trends.”
  19. “35 Key Voice Search Statistics You Can No Longer Ignore,” 99firms Blog, April 16, 2019, https://99firms.com/blog/voice-search-statistics/#gref.
  20. Olson and Kemery, Voice Report, 3.
  21. Valeisha M. Ellis, “Needs Assessment,” in The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation, ed. Bruce B. Frey (Thousand Oaks, CA: SAGE, 2018) 1138–39, https://doi.org/10.4135/9781506326139.n467.
  22. Catherine M. Sleezer, Darlene F. Russ-Eft, and Kavita Gupta, A Practical Guide to Needs Assessment, 3rd ed. (San Francisco: Wiley, 2014), 24–25.
  23. Helen Yueping He, Madeline Gerbig, and Sabrina Kirby, “Needs Assessment for Improving Library Support for Dentistry Researchers.” Journal of the Medical Library Association 107, no. 3 (2019): 352–63, https://doi.org/10.5195/jmla.2019.556; Jasim Mohammed Saleh and Norsida Binti Man, “Training Requirements of Agricultural Extension Officers Using Borich Needs Assessment Model,” Journal of Agricultural and Food Information 18, no. 2 (2017): 110–22, https://doi.org/10.1080/10496505.2017.1281748; Lesley S. J. Farmer, “Library Space: Its Role in Research,” Reference Librarian 57, no. 2 (2016): 87–99, https://doi.org/10.1080/02763877.2016.1120620.
  24. Marilyn M. Helms, “SWOT Analysis Framework,” in Encyclopedia of Management Theory, ed. H. Kessler, (Thousand Oaks, CA: SAGE, 2013), 813–15, https://doi.org/10.4135/9781452276090.n276.
  25. Joe Fernandez, “A SWOT Analysis for Social Media in Libraries,” Online 33, no. 5 (September–October 2009): 35–37, Proquest; Esther L. Gil, “Maximizing and Assessing a One-Shot Information Literacy Session: A Case Study,” Journal of Business and Finance Librarianship 22, no. 2 (2017): 97–110, https://doi.org/10.1080/08963568.2017.1285748; Magdalena Wójcik, “Holograms in Libraries: The Potential for Education, Promotion and Services,” Library Hi Tech 36, no. 1 (2018): 18–28, https://doi.org/10.1108/LHT-11-2016-0142.
  26. Allan Cook, Jonathan Berman, Jiten Dajee, and Rob Eggebrecht, “Intelligent Interfaces: Reimagining the Way Humans, Machines, and Data Interact,” in Tech Trends 2019: Beyond the Digital Frontier (Deloitte Insights, 2019), 71–87, https://www2.deloitte.com/content/dam/insights/us/articles/Tech-Trends-2019/DI_TechTrends2019.pdf.
  27. Catherine Bannister and Deborah Golden, “Ethical Technology and Trust,” in Tech Trends 2020 (Deloitte, 2020), https://www2.deloitte.com/us/en/insights/focus/tech-trends.html?id=us:2em:3na:4di6563:5awa:6di:012320&ctr=cta&sfid=003a000001zD2ewAAC.
Functional spectrum (Source: Christi Olson and Kelli Kemery, Voice Report: From Answers to Actions: Customer Adoption of Voice Technology and Digital Assistants

Figure 4.1

Functional spectrum (Source: Christi Olson and Kelli Kemery, Voice Report: From Answers to Actions: Customer Adoption of Voice Technology and Digital Assistants [Redmond, WA: Microsoft, 2019), 28, https://about.ads.microsoft.com/en-us/insights/2019-voice-report.)

Table 4.1. Platform and UI shifts

Platform & UI

Time

Technologies

Devices

Open Standards

Key Market Players

Web & links

Mid-1990s

Internet, web browser

PC

HTML, HTTP

AOL, Microsoft, Netscape, Yahoo

Mobile apps & touch/swipe/pinch

2008

Wi-Fi, iOS/Android, cloud computing

Mobile devices

Wi-Fi, Android

Facebook, Instagram, LINE, Twitter, WhatsApp, YouTube, Uber, Lyft, Yelp

Voice assistants & conversational UI

2011

AI, natural language processing

Mobile devices, smart speakers

Standardizing natural language process and AI technologies

Amazon, Apple, Google, Microsoft

Intelligent interfaces/affective computing

Near future

Auditory analytics, augmented/virtual reality, cloud and edge computing, computer vision, Internet of Things, 5G network

Wearable devices, gesture control devices, smart headsets, AR goggles, sensors

N/A

Up for grabs

Table 4.2. Needs assessment plan for implementing Amazon Alexa at a large academic library

Stakeholders (who will use this service)

What will be learned

How to collect data

Info to be gathered

Undergraduate students

Assess undergraduate students’ interests in using Alexa to learn about library collections and services.

Focus group

Do they have an Alexa device? Have they used an Alexa device? What do they use Alexa for? What information might they want from a library or university Alexa application?

Graduate students

Assess graduate students’ interests in using Alexa to learn about library collections and services.

Focus group

Do they have an Alexa device? Have they used an Alexa device? What do they use Alexa for? What information might they want from a library or university Alexa application?

Faculty and staff

Gauge the interest of faculty and staff in using Alexa to learn about library collections and services.

Surveys

Do they have an Alexa device? Have they used an Alexa device? What do they use Alexa for? What information might they want from a library or university Alexa application?

Alumni and visitors

Survey alumni and visitors’ interests in using Alexa to learn about library collections and services.

In-person interview

Do they have an Alexa device? Have they used an Alexa device? Will they use Alexa to look up library or university information?

Table 4.3. SWOT analysis of implementing Alexa at a large academic library

Internal to Library

Strengths

Weaknesses

S1: Dedicated employees; willing to learn new technology; innovative and creative

S2: Alignment with library’s strategic goals

S3: Adequate IT infrastructure

S4: Competent IT professionals

S5: Administrative support

W1: Resources/sustainability—funding, staffing

W2: Competing priorities

W3: Data policy—lack of campus-wide data policies

W4: Learning curve

W5: Concerns/resistance from some employees

External to Library

Opportunities

Threats

O1: Strategic/symbolic value—forward-thinking and innovative, meeting organization’s mission, strategic goals

O2: Alternative channel for accessing resources

O3: Accessibility—hands-free, adaptive technology, potential for challenged patrons

O4: Productivity—automate tasks, cover FAQs

O5: Collaboration—with other campus units, external partners (vendors, other libraries)

T1: Privacy concern—privacy and surveillance issues and other nefarious practices with voice technology

T2: Security concern—hacking, liability, data breaches

T3: Proprietary technology—no open standards yet

T4: Sustainability—competition from Apple, Google, Microsoft

T5: Questionable content, algorithmic bias, lack of transparency