Skip to main content
SearchLoginLogin or Signup

TouchGrid – Combining Touch Interaction with Musical Grid Interfaces

Musical grid interfaces allow only limited input actions. This paper explores the possibilities of capacitive touch interaction on a button matrix surface.

Published onApr 29, 2021
TouchGrid – Combining Touch Interaction with Musical Grid Interfaces


Musical grid interfaces such as the monome grid have developed into standard interfaces for musical equipment over the last 15 years. However, the types of possible interactions more or less remained the same, only expanding grid capabilities by external IO elements. Therefore, we propose to transfer capacitive touch technology to grid devices to expand their input capabilities by combining tangible and capacitive-touch based interaction paradigms. This enables to keep the generic nature of grid interfaces which is a key feature for many users. In this paper we present the TouchGrid concept and share our proof-of-concept implementation as well as an expert evaluation regarding the general concept of touch interaction used on grid devices. TouchGrid provides swipe and bezel interaction derived from smart phone interfaces to allow navigation between applications and access to menu systems in a familiar way.

Author Keywords

Grid, Touch Gestures, Prototype

CCS Concepts

•Applied computing → Sound and music computing; Performing arts; •Information systemsMusic retrieval;


Musical grid controllers such as the Tenori-On [1][2] and the monome grid [3] influence the way modern musicians produce and perform music for more than 15 years. While this interface concept started in a rather niche community, it developed almost to an industry standard and can nowadays be found in many common musical interfaces such as the Novation Launch line and the Ableton Push controller and even starts to replace keyboard interfaces in some instances (Polyend Medusa, ASM Hydrasynth).

This is obviously due to its generic nature which allows musicians experienced in programming to define custom workflows and performance interfaces, but also enables manufacturers to provide a haptic control element which is still flexible as a display. This allows for multi-functionality provided by one central interface and updatability on the long run, since the generic hardware can be used with and can adapt to updated software easily.

However, to improve the grid hardware interface to specific software, manufacturers added additional control elements such as menu buttons and encoders surrounding the grid. This compromises its original generic nature which is admired by many users since it allows for a focused interface without unused control elements. As a design implication of developing for grid-only controllers, users have to trade buttons permanently for irregularly used functionalities such as function and navigation buttons.

Since this design situation is comparable to designing user interfaces (UIs) for devices with limited screen real estate, we propose that many of the therefore developed solutions can be adopted to grid interfaces which are able to sense touch interaction additionally to the regular button interaction. In this paper we present a proof-of-concept prototype (TouchGrid) which uses the space in between the grid buttons as capacitive touch enabled areas. Beside the technical proof of concept, we evaluated the conceptual idea with an expert group (n=26) via an online survey. In the following we present implementation details and summarize our survey findings.

Related Work

While many UI explorations have been performed in the context of grid devices, non of them have considered the fusion of touch properties as an overlay of a button matrix. Related work has experimented with improving the feedback provided by the grid itself by adding colored LEDs [4] or shape-changing capabilities [5][6] as well as by adding separated touch screens [7]. In this special case, however, the screen was rather used as an integrated computer than a standalone interface. In, other HCI domains experiments with touch augmented keyboards (button-matrices) have been performed [8].

We consider this relevant for our context, since touch interfaces for musical applications have been postulated already in the early 90s by Shneiderman [9]. Early experiments implemented complex musical control tasks such as controlling vocal synthesis [10], bringing them closer to instruments by expressive control [11] and complex mappings [12][13], or facilitating collective musical interaction via mobile devices [14] using touch interfaces.

Touchscreens, as one of the most popular forms of graphical interfaces, provide strong benefits due to their visual capabilities and flexibility. However, their biggest disadvantage is that they lack kinaesthetic feedback and the “history of instrument playing shows us that only few instruments actually depend on visual feedback” [15] and rather the tactile and haptic experience is the important component when playing instruments [16].

This insight is reflected by the insight that new musical interfaces started to combine touch interfaces with tangible objects. As in the example of the reacTable [17], the advantages of touch screens and physical objects are combined. Screens are well suited to display complex information in an attractive manner, however, objects are advanced regarding physical manipulation [18] and their immediacy. Shortcomings of touch surface interaction in the musical context, such as playing pitch information without relying on visual attention, can be addressed by adding haptic layer/divider on top of the screen [19].

Beside their shortcomings touch screens are still considered for musical applications due to their (1) ubiquitousness [20], (2) their ability to open applications for immediate interaction with more than one touch point at a time [21], and (3) the circumstance that mobile devices with their large batch sizes can offer prize-wise attractive state-of-the-art multi sensory input and machine learning enabled devices [22].


Our TouchGrid concept was developed to address problems resulting from the limited resolution and restricted space grids provide by adding interactions of touch interfaces to them. We developed a comprehensive concept which considers learnings and interactions from touch-screen device research and industry standards and implemented a functional prototype which at the moment includes a subset of the outlined interactions possibilities. This proof-of-concept prototype was used to collect expert feedback regarding the concept via an online survey (n=26).


TouchGrid expands the capabilities of a button grid without compromising on its original generic interface vision. Unlike other grids, TouchGrid does not add additional physical controls, but expands the existing interface structure itself. While grid cells track binary or velocity dependent input, soft touches are currently neglected, although they may convey useful information.

We suspect that if a grid can act as a touch-matrix, many interaction methods can be transferred from touch interfaces, including (1) Time-based Gestures such as bezel Interaction [23], swipe, and interaction with screen corners [24]; (2) Static Gestures such as hand postures and gestures [25][26]; and (3) expanded Input Vocabularies using finger orientation towards the device surface [27]. The grid estate problem can be solved by accessing infrequently used controls via touch gestures.

Hardware Implementation

The first iteration of TouchGrid was designed as a 16*8 grid PCB with 128 touch areas instead of buttons (see Figure 1). Creating the prototype we have learned that push and touch events can be distinguished based on time-based features such as the slope, peak or pitch of the capacitive signals. By using the grid as a low res image sensor, static gestures such as flat hand postures (open or closed 5-hand) [28] and time-based gestures such as swipes or bezel interaction can be detected. Due to the utilized technology, the pure number of capacitive cells decreased the maximum sample-rate so that fast interaction was not possible without a major hardware change, still the concept exemplified that with commercial capacitive technology used more gestural and holistic interactions are conceivable.

Figure 1

Image recognition can be used to differentiate the touch events from static gestures such as hand signs or interaction with active edges.

As the next step in our iterative design process, we experimented with common grid DIY kits to explore ways to extend their input capabilities with touch technology. As the basis for the prototype we used the Adafruit NeoTrellis M4 which consists of a 8*4 LED button matrix which can easily be expanded with further NeoTrellis tiles. Other advantages of this hardware platform are the RGB color feedback, and the accessible I2C port to easily connect other sensors and accessories. We designed a custom PCB containing 12 touch areas between the buttons which slides over the silicone button pads. The touch PCB is controlled with a Teensy LC which senses, processes the touch data, and sends events to the NeoTrellis M4, which is running the main application. Having only 12 in contrast to 128 capacitive touch areas, the sample-rate was no restriction in this small footprint device.

Figure 2

TouchGrid consists of (1) slide-over touch PCB, (2) button pads, (3) NeoTrellis M4, (4) Teensy 3.2, (5) screws, (6) hex spacers, (7) spacers, (8) housing.

TouchGrid Prototype

While the hardware is technically able to track many different touch interactions [29][30], TouchGrid currently implements the following touch interactions, since they are well known from generic smart devices and their functionality is used consistently over different manufacturers and operating systems which ensures a unified understanding:

  1. Drag from off-screen to access context specific elements on demand (see Figure 3) such as menus from the top or specific UI elements from the three other sides.

  2. Horizontal swipe to switch through linearly arranged content such as carried out with pictures and apps on touch-screen devices.

We implemented three sequencer engines that are arranged horizontally. A settings menu can be accessed by dragging from off-screen over the device's top side. The menu offers access to global transport controls (play, pause) and engine specific controls such as setting the corresponding MIDI channel (see Video demonstration).


The implemented touch interactions proved that it is possible to save grid estate that otherwise would have been used for control and navigation functionalities. No buttons had to be used for this purpose und thus the whole 8*4 grid is usable for controlling the main application.

We showed that infrequently used UI elements can be removed from the main page and be accessed only on demand. Further, navigation among apps can be achieved by swipe interaction. However, even more possible use cases for touch interactions on grid devices are conceivable.

Figure 3

TouchGrid addresses the grid estate problem. Touch gestures (e.g. “drag from off-screen”) are used to access infrequently used features just on demand.

Since we designed applications and menus to exist in a spatial context, when switching between applications and opening menus this spatiality was emphasized by animations rendering the transitional UI states (see Figure 4). This design is established in smart device context and has proven to help the users in understanding an applications’ structure without increasing the mental workload [31].

Figure 4

The application is organized in different spatially organized pages which can be accessed via swipe gestures. Transitional animations emphasize the spatial layout and are intended to help the users to build up a better understanding of the overall application structure.

Concept Evaluation

We presented a video of our TouchGrid prototype to 26 participants in the context of an online survey and asked for their opinion on the conceptual idea (see Figure 5). The goal of the study was to evaluate the acceptance of the concept and not to evaluate our technical realization.

Our participants were recruited via the monome forum (, due to its homogeneous user group. The 26 participants (22 male, 3 unspecified, 1 non-binary; 5: 20-29 years, 12: 30-39 years, 9: 40-60+ years) used grid devices between 1 and 13 years. All participants agreed that their data would be used in anonymized form for scientific publications.

The participants confirmed that touch gestures are familiar interaction patterns that can add to the interactions with grids (T2) and stated that they can imagine to use touch on grids to access infrequently used functions (T3) as well as to switch applications (T4).

Participants further added potential use cases of touch interaction such as using it as a freely map-able control (n=4), to navigate and zoom in sequence data (n=3), and for musical gestures (n=2).

When they were asked for reasons why they wouldn't use touch on grid devices. They expressed concerns regarding malfunction and interference with current button interactions (n=5), already having sufficient control provided by buttons only (n=3), and an expected increase in complexity due to a new layer of interaction capabilities (n=3).

Overall participants showed keen interest in the idea, provided further options for implementation and expressed their excitement about related improvements on current issues such as switching applications or accessing menu systems.

Figure 5


As the collected expert feedback indicates, musicians would value new input methods in the context of grid-only devices to enlarge the generic input space and thus, giving them more freedom to design interactions which are probably more fitting to tasks they already perform but have to instead map to binary button presses.

Such a task is for example the navigation in larger data sets. The limited resolution and the overall restricted screen estate [32] of grids often prohibit to display a data set’s entirety at once. Specific real world examples are e.g. Ableton Live’s clip matrix view which, if more than 8 tracks or scenes are used, only can show an excerpt of all clips on a grid controller. While the Push 2 and the Launchpad provide navigation buttons, comparable to the arrow keys on a PC, this way of navigation can feel outdated when compared to the reality-based interaction [33] implemented in literally all smart phones and tablets with touch screens.

However, when interactions are time-critical, swipe gesture navigation can result in cumulative latencies due to the need to perform multiple swipe actions compared to only a single button press in case of a navigation bar like implementation. In this case, requirements regarding the available grid estate and the speed to perform an action must be weighed against each other. Our implementation offers the developer the opportunity to choose the fitting design for the specific application, since the touch-interaction adds a new interaction layer without compromising the original interface.

We would like to emphasize that touch-based interaction is perfectly suitable for the implemented interactions such as navigation among apps. Further, the understanding about touch-based interaction is way more uniform than any grid interaction at this point in time. What the implementation of touch-based interaction in the context of grid devices provides is that musicians can transfer their prior knowledge of familiar interaction principles about other technology carriers and their user experience design to the musical interface context.

This not only allows novices to directly experiment with and explore [34] a former unknown technology and still draw from prior experiences, but also helps designers to inform their application design by accepted standards and best practices. For example, gestures for zooming, rotating, translating and accessing hidden menus are already standardized and there is no need for reinventing these every time for each new application.

While this sounds like it would implicate that grids could easily be replaced by touch screens, the tangible properties of these such as the ability to tactually experience the interface [35] by feeling the surface, the interaction elements [36][37], and the passive haptic feedback when pushing buttons [38], in our opinion justifies their existence. Thus, we do not advocate for making touch screens more tangible [39][40][41], but instead we argue for expanding the capabilities of grids and other tangible devices [42] by learning from touch screen based interaction and combing the best from both worlds.


While many research projects investigated how the interaction with and properties of touch interfaces can become more tangible [43], we were interested if touch interaction can be transferred to grid devices to improve the usability of this tangible interface type.

Our preliminary investigations in this context indicate that transferring touch-based interaction principles to grids the usability issue of the limited grid estate can be solved by keeping the limited resources of pixels for frequent interactions regarding playing and conducting music and moving system operations to outsourced pages which are only accessed on demand.

While our project currently includes a technical proof of concept and a concept evaluation with the help of an expert group, we hope that these ideas will stimulate new interface explorations and device developments in the context of grid interfaces.

As future work, we consider the further development of touch resolution, with which more interactions can be detected and thus more expressive possibilities for manipulating and interacting with sounds in real time are available. Furthermore, combinations of touch interaction with simultaneous button presses are conceivable, opening up currently unconsidered interaction possibilities for future grid applications.

In summary we acknowledge that the past 15 years of grid developments were essential in understanding and learning how to play musical grid interfaces, and that the next phase will be determined in taking our play and the deduced learnings to inform new technologies and concepts to help to push grid devices to an even better user experience. Our experiments advocate for a holistic interface design approach which is open to adapt learnings and insights from related interface concepts. We are optimistic that our preliminary investigations in this design context inspires researchers from the NIME community and sparks an open-ended dialogue on future musical grid interfaces.

No comments here
Why not start the discussion?