Taking 3D interaction further, Softkinetic has developed middle-ware that uses a 3D camera to support full-body gesture interaction with games and other applications. No controllers or devices are needed!
The following video is narrated in Portuguese, I, think, but you can understand the content in any language. I you love the Wii, you'll probably like this!
Here is a video that demonstrate how Softkinetic and Silverfit paired together to develop rehabilitative games for the elderly and others:
The following table is from the Silverfit website:
Game
Movement trained
Puzzle
While sitting down, bend whole body left and right, and stand up. Cognitive/visual component.
Mole
Balance exercise by stepping with one leg while standing.
Catching grapes
Walking movement left and right.
Walking
Walking in place, while avoiding obstacles and thresholds. Activity of Daily Life (ADL) component.
Arm exercise
Arm stretching and reaching in all directions with one or both arms. ADL component.
Picking flowers
Walking backwards, forwards and sideways. Optionally, bending down.
Memory
Arm stretching left, right, forwards and upwards. Cognitive component.
The Interactive Displays Conference, sponsored by Intertech Pira, will highlight an interesting mix of existing and emerging interactive display technologies and applications. The conference will be held at the Hilton in San Jose, California, from Tuesday, April 21st through Thursday, April 23rd.
The pre-conference seminar will feature Sakuya Morimoto, of CANESTA, who will present his company's innovative single-chip 3D image sensor technology that supports gesture interaction. Keynote speakers will beJeff Han, of Perceptive Pixel, and Steven Bathiche, of MicrosoftUS.
Some Highlights:
Pre-conference Seminar: Gesture Navigation in the World of Digital Contents, Enabled by a Single-Chip 3D Image SensorPresenter: Sakuya Morimoto, Senior Director, Business Development in Asia, CANESTA, Japan Related: Hitachi at CES 2009: Use of Canesta's 3D sensor to control television and home systems using hand gestures.
"With the wave of a hand, with the shake of a hand, you can control volume, you can actually change the channels, watch your favorite program...the most exciting thing, I think, is that you can actually control your temperature and the lighting in the room, the environmental lighting. So..it is very unique technology that is out there.."
Another demonstration of Hitachi's gesture interaction using the Canesta's 3=D Depth camera:
How does Canesta's Electronic Perception Technology Work? "Canesta’s electronic perception technology forms 3-D, real time moving images in a single chip through patented methods which use light photons to “range” the image, similar to radar. The silicon sensor chip develops 3-D depth maps at a rate in excess of 30 frames per second, and then performs additional processing on these depth maps to resolve the images into application specific information that can easily be processed by embedded processor(s) in the end-use device or machine. Since Canesta’s software starts with a three-dimensional view of the world, provided immediately by the hardware, it has a substantial advantage over classical image processing software that struggles to construct three-dimensional representations using complex mathematics, and images from multiple cameras or points of view. This dramatic reduction in complexity makes it possible to embed the processing software directly into the chips themselves so they may be used in the most cost-conscious applications."
I will highlight some of the featured presentations in future blog posts:
Steven Bathiche, Director of Research, Applied Sciences Group, Entertainment and Devices Division MICROSOFT, US Guillaume Largillier, Chief Strategy Officer and Co-Founder, STANTUM, France Jeff Han, PERCEPTIVE PIXEL, US Mark Fihn, Publisher, VERITAS ET VISUS, US Derek Mitchell, Conference Producer, INTERTECHPIRA, US Vinita Jakhanwal, Principal Analyst, Small/Medium Displays, ISUPPLI CORPORATION, US Joseph Carsanaro, President and CEO F-ORIGIN, US Tommi Ilmonen, CEO MULTITOUCH OY, Finland Stephen Sedaker, Director of Component Sales WACOM TECHNOLOGY CORPORATION, US Brad Gleeson, Managing Director, Business Development TARGETPATH GLOBAL LLC., US Henry Kaufman, President and Founder, TACTABLE, US Christophe Ramstein, Chief Technology Officer, IMMERSION CORPORATION, US Mary Lou Jepsen, CEO, PIXEL QI, US John Newton, Chief Technology Officer, NEXTWINDOW, New Zealand Herve Martin, CEO, SENSITIVE OBJECT, France Scott Hagermoser, Gaming Business Unit Manager 3M TOUCH SYSTEMS, US Bob Cooney, Vice President, Business Development, ECAST, US Brent Bushnell, Chief Technology Officer UWINK, US Stephan Durach, Head, Technology Office, BMW GROUP, US Jeff Doerr, Senior Manager, Business Development Self Service Solutions Group, FLEXTRONICS, US Andy Wilson, Senior Researcher, Adaptive Systems and Interaction Group, MICROSOFT, US Mats W. Johansson, Chief Executive Officer, EON REALITY, US Lenny Engelhardt, Vice President for Business Development, N-TRIG, Israel Dr Paul Diefenbach, Director, RePlay Lab, DREXEL UNIVERSITY, US Andrew Hsu, Technical Marketing and Strategic Partnerships Manager, SYNAPTICS, US Dean LaCoe, Business Development Manager, GESTURETEK, Canada Keith Pradhan, Global Director of Product Management, TYCO ELECTRONICS, ELO TOUCHSYSTEMS, US Jerry Bertrand, Managing Member/Acting CEO, MICROSCENT, LLC, US Frederic Kaplan, CEO and Co-Founder, OZWE, Switzerland
Ubiq'window, by LM3LABS, is a gesture-based system that is used for interactive show windows, interactive in-store marketing, museum installations, and more.
The slides provide details of the Ubiq'window's system specifications, including a gesture recognition set. The slides also higlight "Airstrike", a system that allows for free-air, touchless interaction.
If you are a new visitor to this blog and interested in interactive multimedia, you'll need to know more about the NUI-Group. Natural User Interface or ~ NUI Group is an interactive media group researching and creating open source machine sensing techniques to benefit artistic and educational applications. (For related information, please read my recent post, Usability, Accessibility, and User Experience in a Win7 Environment.)
Seth Sandler, of the NUI-Group, sent out a great email with links and resources for people who are interested in multi-touch/gesture interaction, hardware, and/or software development. The list of NUI-Group members who have completed projects is listed below, with links to project websites as well as related threads on the NUI-Group forum.
(The information can be found on the NUI-Group Wiki, which boasts a nice icon based front-page)
Thanks, Seth, for organizing this wealth of information!
I received the following videos and links from Anthony Uhrick, who happens to be at WinHEC this week and was at PDC 2008 last week. Touch screen, multi-touch, and gesture technology is starting to catch on. (Anthony is the VP of sales for NextWindow, the company that created the touch screen for the HP TouchSmart and other multi-touch enabled displays.)
Below is a video clip of a multi-touch photo presentation system running Windows 7: Gesture + Touch - has gesture and physics engines.
Apparently the application can run on Vista, Win 7, and Win 7 Touch.
Here is an HP TouchSmart PC, running a Touch Map application on Windows 7:
The following clip is of a newscaster using a multi-touch transparent screen.The display is from U-Touch Ltd. a partner of NextWindow.In my opinion, the application enhances the viewers understanding of the various news topics, and is visually appealing as well.
The graphics engine used in this application was developed by Vizrt, the same folks who were behind CNN's video hologram. Here are a few pictures from the Vizrt website:
If you follow this blog because you are interested in emerging multimedia technologies such as multi-touch and gesture-based displays and tables, you probably know that there is a huge void in terms of content -rich applications for these systems.
Most of the demos show how you can zoom, rotate, and resize photographs, sort through your "stuff", or bat things around the surface as a game. There is so much more power behind surface technology that needs to be realized!
Here are some of my reflections...
As I write this post, leaders of the financial industry, large corporations, and governments are in Davos, Switzerland at the annual meeting of the World Economic Forum. It is interesting to note that all of these bright men and women are struggling to grasp the enormity of the world's financial crisis and come up with strategies that hopefully will work.
The graphic below depicts how much has changed in the world economy between the 2008 annual meeting of the World Economic forum and the present. It lacks the "wow" factor that one would expect for an application running on an interactive display. With some tweaking, it could be transformed into an application that supports two people interacting with the data at the same time.
"On this chart each block represents a year and each column represents a range of return on the S&P index. Over on the right side are those lucky years where the index has soared upward from 50-60%. In the middle are the more typical years, where the market has risen less than 10%.That little box on the far left? Yeah, that's this year..And hey, how many of you knew the S&P had been around since 1825?." - Devilstower of the Daily KOS
I've been thinking about interactive information visualization and how it can support our understanding of the current economic crisis a bit lately, inspired by what I learned in Dr. Robert Kosara's InfoViz class I took last year. In a recent post on the Eager Eyes blog, Dr. Kosara floats the idea of the establishment of a "National Data Agency".
"What we need is a National Data Agency (NDA). This agency would be tasked with collecting data that all other agencies collect and produce, and making it available in a central place and in electronic, machine-readable form. There could and should be a reasonable data presentation on its website, perhaps even a National Data Dashboard (showing data of interest like debt, spending, jobless rate, etc.). But the bulk of data analysis would be left to third parties: analysts, journalists, citizens (and also aliens like me). Easily available data would make for more insightful reporting, more informed decisions, and endless business opportunities."-Robert Kosara
This makes sense.
There simply is too much data to absorb, explore, analyze, understand, and act upon. It is difficult to know if you have all of the data that you need, because some of it is difficult to access. It doesn't matter if you are a banker, a stock broker, a CEO, a CFO, a government leader, an economist, a shareholder, or a student. The current state of world economic affairs is the strongest evidence that our methods simply aren't working.
The work of Hans Gosling provides a good example of how information visualization can help increase our understanding of large quantities of data over time. Hans Gosling is a Swedish professor of development and one of the founders of Gapminder. ("Unveiling the beauty of statistics for a fact-based world view".)
The following video is Rosling's latest presentation, focused on debunking the myths regarding population growth:
"Gapminder is a non-profit venture promoting sustainable global development and achievement of the United Nations Millennium Development Goals by increased use and understanding of statistics and other information about social, economic and environmental development at local, national and global levels. We are a modern “museum” that helps making the world understandable, using Internet."
The visual representation of economic data, if done well, packs a powerful punch. To me, images form a kernel in my memory related to the messages conveyed, and when recalled, also bring up a range of related conceptual details. It is sort of like what happens when I hear the first few notes of a tune from the past.
This doesn't seem to be the case for me when thinking about related text, or even thinking about "boring" charts and graphs.
The world needs effective and efficient data and information analysis and interactive visualization tools in order to solve problems that are on such a colossal scale.
The use of collaborative gesture and multi-touch display systems for data and information visualization is something that I believe will support better methods of decision-making in a variety of fields.Now is the time for the interactive information visualization community and related disciplines such as interactive multimedia and HCI to assist in this effort.
Here are some thoughts:
Those who are coding gesture-based or multi-touch programs need to understand what sort of content people will explore, and make sure that applications provide flexibility in use.
Human-computer interaction specialists will need to continue the study a range of interfaces and interactions in order to determine what supports human cognition of larger amounts of data and information.
Creators of interactive multimedia content, web developers, and others will need to re-examine their work and think about ways their content can support new ways of thinking and problem-solving within the context of "surface" computing.
Computer Supported Cooperative Work researchers will need to figure out what needs to be in place so that information can be effectively shared and analyzed between pairs or teams of people, and how this information can best be communicated to others within a business, agency, or organization, as well as the public.
One of the challenges facing this effort is that few people have an in-depth understanding of what it will take to make it happen. We will need to take an inter-disciplinary effort requiring a much higher level of communication and collaboration between people not accustomed to working within this context.
We will also need to take a "big picture" approach.
Because of the world's economic crisis, I think that interactive information/data visualization applications should target the needs of people who are working to understand the crisis and who have the power to do something constructive about it. This can not happen if they rely on the models and data analysis techniques of our recent past.
At the same time, these tools should be available to the rest of us, via the Internet, so that we may do our part to move us forward.
Back Story:
I started keeping up with the current economic on a more serious level in October. I was becoming numb from information overload. My knowledge about the economic and financial fields was lacking, so I decided to create a blog that I entitled "Economic Sounds and Sights"as my personal on-line repository of searchable content.
The blog has lots of pictures, info-graphics, embedded video clips, and links to a wide range of web-based resources. In my quest for information, I came across interesting quotes, jokes about economists, and tales of greed and scandals. I even found one blogger who has responded to each unfolding event of our economic crisis by re-writing lyrics to popular tunes.
For an example of one of my posts, read "Celestial Economic Sphere, Data Viz for the Finance Biz..." It is my hope that the content I've collected and shared on the blog will become part of an interactive information visualization/timeline designed to support two or more people on a large display or table.
11/4/09: Update: The economic crisis got a bit complicated, so I stopped posting. The blog still remains on-line. Interactive Infoviz for the Health Care Biz will be the topic of an upcoming post.
Via Stephen Few: Example of Horizon Graphs, developed by Panopticon.(Year's worth of prices of 50 stocks in 2005 and comparisons between them, click to enlarge)
Mark Lombardi Take the time to listen to NPR's Lynn Neary's interview with Robert Hobbs, curator of the an exhibit of the late Lombardi's "conspiracy" art/visualizations linking global finance and international terrorism. Lombardi's background as an archivist and reference librarian served him well in his art depicting interesting large-scale networks. Although his art was not interactive, his techniques have inspired the development of computer-based interactive information visualizations.
The examples below are of Lombardi's work connecting the relationships between George W. Bush, Harken Energy, and Jackson Stephens:
George W. Bush, Harken Energy and Jackson Stephens
c. 1979-90, 5th Version 1999 Enlarged Version
Close-upof network detail
(missing) Close up depicting a profit made by Bush, 2 weeks before Saddam Hussein invaded Kuwait via Frances Richard
"...though he possessed the instincts of a private eye and the acumen of a systems-analyst, Lombardi was of course an artist, and from the raw material of wire-service reports and books by political correspondents, he drew not only chronicles of covert, high-stakes trade, but technically pristine and sensually compelling visual forms"-Frances Richard
What is the Post WIMP Explorers Club? I came up with the name of this semi-fictional club as a way to organize my thoughts (and blog posts) regarding the development of a new metaphor for post-WIMP applications and technologies, related specifically to natural user interfaces, natural user interaction design, and off-the-desktop user experience.
Update, morning of 12/31/09: Josh Blake, author of the blog "Deconstructing the NUI", posted Metaphors and OCGM this morning. It fleshes out post-WIMP concepts, addressing metaphors & interfaces. The premise is that NUI metaphors will be less complex than GUI (WIMP) metaphors. My feeling is that on the surface, this will hold true, especially for consumers/users and people creating light-weight applications and software widgets.
Underneath the surface, where designers and developers brains spend more time than users & consumers, things might be more complex. Why? The technology to support the required wizardry is more complex. With convergence, the creation of new technologies, applications, communication systems, and even electronic entertainment, is now dependent upon the work and thinking of people from a wider range of disciplines. Each discipline brings to the table a set of terms rooted in theory, and even research practices.
Update, late afternoon, 12/30/09:
Richard Monson-Haefels response to Ron George's "Part 2". The concept of OCGM might be growing on him now... OCGM: George's Razor : "If Ron George can explain how OCGM encompasses Affordances and Feedback than I'll be convinced that OCGM works for NUI. Otherwise, I think OCGM is a great start that would benefit from an added "A" and "F"." -Richard
OCGM relates to Occam's Razor. It is helpful to read a bit about it if you are are interested in the post WIMP conversations. (The link is to an an article from "How Stuff Works", via Richard Monson-Haefel.)
UPDATE 12/30/09 -- This post is part of a discussion between several different bloggers, and was written before Ron George wrote his latest post, Welcome to the OCGM Generation! Part 2, which I recommend that you read now, or within the same time frame, as this post. Since I'm not ready to write "Part 2" of this post, I tweaked what I had and added some links to a handful of my previous posts that touch on this and related topics. The links can be found at the bottom of this post.
START HERE FOR THE "ORIGINAL" POST FROM 12/29 & 12/20/09:
Background About a year ago I responded to a conversation between Johnathan Brill, Josh Blake, and Richard Monson-Haefel discussing "post-WIMP" conceptualization regarding natural user interfaces and interaction, otherwise known as NUI. The focus of the discussion was on Johnathan's post, "New Multi-touch Interface Conventions". At the time, we were reading Dan Saffer's book, Designing Gestural Interfaces, and contemplating new ways that technology can support human interaction and activities in a more natural, enjoyable, and intuitive manner.
A few days later, I shared some of the concepts from the discussion on a post on this blog, "Why "new" ways of interaction?". The post includes video of Johnathan Brill discussing PATA, a post-WIMP analogue to assist with multi-touch/gesture based application development, which he describes as follows: Places
"Lighting, focus, and depth, simplified searching and effecting hyperlinked content." Animation "Using animation to subtly demonstrate what applications do and how to use them is a better solution than using icons. Animations makes apps easier to learn." Things "Back in the days of floppy disks, objects helped us organize our content. This limitation was forced by arcane technology, but it did have one huge advantage. We used our spatial memory to help us navigate content. Things will help us organize content and manipulate controllers across a growing variety of devices." Auras "Auras will help us track what we are tracking and when an interaction has been successful." (For reference, I've copied some of my responses to the first discussion, which can be found near the end of this post)
A year later.... What has changed? Everything post-WIMP has been covered like a blanket by the NUI-word. "NUI" now functions as a generic term for anything that is not exactly WIMP. There is a sense of urgency now to figure out how best to conceptualize post-WIMP interfaces and interactions. Newer, affordable technologies enable us to interact with friends and family while we are on-the go. Netbooks, e-Readers, SmartPhones, large touch screen displays, interactive HDTV, and new devices with multi-modal I/O's abound. Our grandparents are on Facebook and twitter from their iPhones. Our world no longer requires us to be slaves to the WIMP mentality.
So what is the problem? The technology has moved along so fast that application designers and developers have not had a chance to catch up. (The iPhone is an exception.) The downturn in the economy has made it difficult for many to take the leap from traditional software or web development and gain new skill sets. On top of it all, most of us over the age of 15 have been brainwashed from years of working within the constraints of WIMP. It doesn't matter if we are users, consumers, students, designers, or developers.
Even the folks least likely to have difficulty expanding into the post-WIMP world have had some difficulty. If you've had training in HCI (Human-Computer Interaction), you were inadvertently brainwashed with the best. The bulk of the theory and research you contemplated was launched at a time when WIMP was king, even as the Web expanded. Many of the of the principles held dear to traditional HCI folks have been shattered, and no-one has come up with a "theory of everything" that will cover all of the human actions and interactions that are supported or guided by new technologies.
The problem, in part, is that letting go of WIMP is hard to do, as illustrated by the following post from the Ars Technica website: Light Touch: A Design Firm Grapples with Microsoft Surface (Matthew Braga, 6/29/09) "Ditching the mouse and keyboard means a whole lot more than just doing without two common peripherals. As those who have worked with Microsoft Surface have found out, you have to jettison decades of GUI baggage and start with a whole new mindset...In actuality, few multi-touch gestures are really anything like what we experience in the physical world. There is no situation in which we pull on the corners of an image to increase its size, or swipe in a direction to reveal more content. So, in the context of real-world interaction, these types of gestures are far from natural...gestures should not only feel natural, but logical; the purpose that gestures like these serve, after all, is to replace GUI elements to the end of making interaction a more organic process." (Be sure to read the comments.) Now that the Surface is taking root in more places, and touch-screen all-in-one PC's and tablets are starting to multiply, more people are giving "NUI" some thought. Ron George, an interaction and product designer with experience working with Microsoft's Surface team has contributed to the post-WIMP discussion and spent some time sharing ideas with Josh Blake, a .NET, SharePoint, and Microsoft Surface Consultant for InfoStrat and author of Deconstructing the NUI blog. The outcome of this discussion was Ron George's December 28th blog post, "OCGM (pronounced Occam['s Razor] is the replacement for WIMP", and Josh Blake's post, "WIMP is to GUI as OCGM (Occam) is to NUI". (Be sure to read the comments for both of these posts!)
OCGM (as conceptualized by Ron George)
Objects "are the core of the experience. They can have a direct correlation with something physical, or they can just be objects in the interface."
Containers "will be the “grouping” of the objects. This can manifest itself in whatever the system sees fit to better organize or instruct the user on interactions. They do not have to be, nor should they be, windows. They can be any sort of method of presentation or relationship gathering as seen fit."
Gestures "I went into detail about the differences in Gestures and Manipulations in a previous post [check it out for a refresher]. Gestures are actions performed by the user that initiate a function after its completion and recognition by the system. This is an indirect action on the system because it needs to be completed before the system will react to it."
Manipulations "are the direct influences on an object or a container by the user. These are immediate and responsive. They are generally intuitive and mimic the physical world in some manner. The results are expected and should be non-destructive. These are easily performed and accidental activations should be expected and frequent."
To illustrate a point regarding the validity of the OCGM analogy proposed by Ron George, Josh Blake shares the following video of a presentation from REMIX 2009, in which August de los Reyes, the Principle Director of User Experience for Surface Computing at Microsoft, briefly discusses the TOCA (Touch, Objects, Containers, and Actions) concept, suggested to replace the WIMP concept:
The video wouldn't embed, so go to the following link:
Predicting the Past: A Vision for Microsoft Surface "Natural User Interface (NUI) is here. New systems of interaction require new approaches to design. Microsoft Surface stands at the forefront of this product space. This presentation looks at one of the richest sources for inventing the future: the past. By analyzing preceding inflection points in user interface, we can derive some patterns that point to the brave NUI world."
The concepts outlined in the presentation are similar to Microsoft's Vision for 2019
Richard Monson-Haefel added his thoughts about the discussion about OCGM in his recent blog post, "What is NUI's WIMP?" Richard disagrees with the OCGM concept, as he feels it doesn't encompass some important interactions, such as speech/direct voice input. He'd probably agree that NUI is NOT WIMP 2.0.
It is important to note that activity theory-based interaction design is viewed as a "post-cognitivistic", and informed by some of what I studied in psychology, education, and social science years ago. Within the field of activity theory are some important differences, which I'll save for a future post.
Below are some concepts taken from the book.I am still mulling them over through the prism of NUI, post-WIMP, PATA, TOCA, OCGM, etc. That's why there will be at "Part II", with specific examples.
"Means and ends, the extent to which the technology facilitates and constrains attaining user's goals and the impact of the technology on provoking or resolving conflicts between different goals
Social and physical aspects of the environment - integration of target technology with requirements, tools, resources, and social rules of the environment Learning, cognition, and articulation, internal vs external components of activity and support of their mutual transformations with target technology
Development -Developmental transformation of the above components as a whole" "Taken together, these sections cover various aspects of the way the target technology supports, or is intended to support, human actions". (page 270)
I especially like the activity checklist included in the appendix of the book, as well as the concept of tool mediation. "The Activity Checklist is intended to be used at early phases of system design or for evaluating existing systems. Accordingly, there are two slightly different versions of the Checklist, the "evaluation version" and the "design version". Both versions are implemented as organized sets of items covering the contextual factors that can potentially influence the use of computer technology in real-life settings. It is assumed that the Checklist can help to identify the most important issues, for instance, potential trouble spots that designers can address". (page 269)
"The Checklist covers a large space. It is intended to be used first by examining the whole space for areas of interest, then focusing on the identified areas of interest in as much depth as possible...there is a heavy emphasis on the principle of tool mediation" (page 270).
Other Thoughts What is missing from this picture is a Universal Design component, something that I think holds up across time and technologies. Following the principles of Universal design doesn't mean dumbing down or relying on simplicity. It is a multi-faceted approach, and relies on conctructing flexibility in use, one of the key concepts of Universal Design. I'd like to see this concept embedded in the post-WIMP conceptualization somehow.
Because of my background in education/psychology/ special education, I try to follow the principles of Universal Design for Learning (UDL) when I work on technology project. I've spent some time thinking about how the principles of UDL could be realized through new interaction/interface systems. Although this approach focuses on the educational technology domain, it is important to consider, given that a good percentage of our population - potential users, clients, consumers - has a temporary or permanent disability of one kind or another.
Components of Universal Design for Learning: Multiple Means of Representation Provide options for perception Provide options for language and symbols Provide options for comprehension Multiple Means of Action and Expression Provide options for physical action Provide options for expressive skills and fluency Provide options for executive functions Multiple Means of Engagement Provide options for recruiting interest Provide options for sustaining effort and persistence Provide options for self-regulation -Adapted from the UDL Guidelines/Educator Checklist, which breaks down the components into more specific details.
Note: The concept of Universal Design for Learning shares historical roots with some of the work behind Activity Theory and Interaction Design. Obviously, there is still much to contemplate regarding OCGM and other permutations of post-WIMP concepts!
Here are my comments to the discussion on Johnathan Brill's blog from January 2009:
Thoughts: I'm seeing a trend to "optimize" WIMP-type applications so they can be manipulated by gesture and touch. People who program kiosks, ATM's and POS touch screens are examples of what I'm talking about. Touch and hold, two-fingered touch, and double-tap are just a slight transformation of the WIMP world-view, and in my opinion, are still WIMP (wimpy!). The mouse interaction "pretenders" are fine for using legacy productivity applications, OK in the short run.
For example, I have an HP TouchSmart, but I don't use the touch screen as often as I'd hoped. Try using using Visual Studio to code something on a touch screen. There is so much more that can be done! I know from the touch-screen prototype/demos I've worked on in various classes that applications that support collaboration and decision-making are important, and not just for work.
What do people DO, really? First of all, we are social beings, most of us. Think of what we share and discuss with others, and think about what sort of interactions on a display might best correspond with this interaction. Here are some of the things I've been DOING recently that involved some sort of technology and communication/collaboration with others:
---Travel planning - I recently went on a cruise and with various family members, selected activities I wanted to do on the ship as well plan my shore excursions (a complicated process)
---Picture sharing- I came back from the cruise with lots of pictures that I uploaded on Flickr. Related to this process: Picture annotating, tagging, choosing/comparing & editing it would be SO cool if I could use two sliders to enhance my pictures just so!
---Talking on the phone and responding to e-mail with friends and family members about the pictures, and what they wanted to see on Flickr- "you know, the ones of the dogs in Jamaica"... of which I had about 68!
---Financial planning with my husband. (I took info-viz last semester, so I know the possibilities are there.)
---Using the touch-screen to check-in at my eye-doctor's office: This was a user-unfriendly experience. Such a nice little screen. I was provided with a WIMPY PowerPoint-like interface which was confusing to use- and time consuming!
---Shopping at the new Super Wal-Mart: I asked the greeter if there was a map, and he said, "Food is over there, and the rest of the stuff is that way." Flat panel displays were all over the store, but of course,they weren't interactive. I had no idea where anything was, and the few items I was looking at had no bar-code tags. There wasn't a clerk in sight. Wal-Mart TV rolled on-and-on via the display above my head. If I could only harness the display and have the talking head answer my questions! I gave up on my shopping trip when I was in the facial lotion/potion section. Too many choices, and too much fine print to read.
---Shopping at the mall with my young-adult daughters... I'll have to hold my thoughts on that one for now!
Some suggestions:
I think the artist/designers, (even dancers,) who are interested in multi-touch and gesture interaction have some interesting things to consider. (I linked to some of my previous posts.)
Again: I am still mulling things over through the prism of NUI, post-WIMP, PATA, TOCA, OCGM, etc. So that is why there will be at "Part II". With specific examples!