Tech Blog

Databending with Web APIs

February 24, 2024

Image that has been databent with my custom databend application

An example of an image that was databent using my web-based databending application.

Databending with Web APIs

This repo hosts this blog post and a simplified example:

It’s based on the work for a web application that builds a UI around this concept:

Intro

Databending, like circuit bending, is a way of creatively breaking technology to generate unique and unexpected results. In circuit bending, the idea is to mess around with the underlying electronics: modding, jumping, and shorting to find interesting ways to break circuits. In databending the idea is to break things in the data layer.

Computers operate in 1s and 0s. We might see an “a” in a text file, but a computer will see “01100001”. The fact that we’re just dealing with numbers is important to databending: when we run an audio file through an EQ or an image through a filter, we might think the application has some sense for what a song or a picture is, but ultimately it’s just doing math. So what happens when we run an image through an EQ? Or an audio file through an image filter? That’s one sure-fire way to break things!

A common example of databending (and how I learned about it) is to take the free audio software Audacity, use its “raw data” import option to load in an image, run some audio effects over the image, and export the bits back out as an image file. Here’s an example of an image bent with Audacity:

Image that has been databent with Audacity

(Original image source; Audacity databending tutorial)

It’s fairly easy to do, but it’s also fairly easy to do irreparable damage to the image file. That’s because in most files there’s a section of the bits devoted to the data and there’s a section of the bits devoted to metadata (the header). Running the header through the databending process can result in a file that can’t be opened because at a certain point applications can’t understand enough of the metadata to open the file.

It’s also a long-ish iteration cycle. Audacity isn’t showing a preview of the image you’re destroying because it doesn’t know it’s working with an image; Audacity is just doing math with a bunch of bits you handed it.

The thing I made

As a kid I enjoyed breaking photos with Audacity to make pseudo-generative art and then I became a web developer. So after rediscovering databending, I decided to make my own application to speed up the process. In modern browsers we have a few tools that are relevant to this project:

  1. Canvas API: an API for working with images
  2. Web Audio API: an API for working with audio
  3. Tone.js: a library that takes the very low-level building blocks from the Web Audio API and abstracts them into higher-level audio tools (like BitCrusher and PitchShift).

With these tools, I made the databend project. It uses the Canvas API to convert an image into a bunch of numbers, runs the numbers through different audio effects provided by Tone.js, and then converts the resulting audio data back into an image.

It doesn’t muck with the headers because it’s just manipulating the actual image/audio data. It knows its job is to destroy images, so it provides some previews of how the image will look on the other side of the process.

How it works

The code here will be based on the demo in index.html (shown here). To see the full application code, check out its Github repo.

From image to audio

The first step is getting image data. We do this by creating a Canvas context, rendering an image with it, and then pulling out the image data.

// pretend you have an image object
// like by using `new Image()`
const { width, height } = image

// Create a canvas twice the height of the image
// so we can show before/after
const imageMount = document.getElementById('image-mount')
const canvas = document.createElement('canvas')
canvas.width = width
canvas.height = height * 2
imageMount.appendChild(canvas)

// Draw the original image
const context = canvas.getContext('2d')
context.drawImage(image, 0, 0, width, height)

// Get the image data
// this is confusing: getImageData returns an object
// containing a property called data
const imageDataContainer = context.getImageData(0, 0, width, height)
const imageData = imageDataContainer.data

(getContext, drawImage, getImageData)

So what do you have at this point? You’re now the proud owner of a Uint8ClampedArray representing the image. It’s a 1D array that describes each pixel with four values: red, green, blue, and alpha (transparency). Each value is represented as a number between 0 and 255. So one pixel of fully opaque steelblue and one pixel of fully transparent salmon would be:

// R, G, B, A, R, G, B, A
[70, 130, 180, 255, 250, 128, 114, 0]

Audio works with floats between -1 and 1, so we need to do a conversion:

function scale(number, inMin, inMax, outMin, outMax) {
    return (number - inMin) * (outMax - outMin) / (inMax - inMin) + outMin;
}

// Convert image (0 to 255) to audio (-1 to 1)
function convertImageDataToAudioData(imageData) {
    const audioData = new Float32Array(imageData.length)
    for (let i = 0; i < imageData.length; i++) {
        audioData[i] = scale(imageData[i], 0, 255, -1, 1)
    }
    return audioData
}

Through Tone.js

I didn’t say this earlier, but thank you to the maintainers of and contributors to Tone.js. It’s just a gem of a library.

At this point we have a Float32Array that looks enough like audio that we can pass it to Tone.js. In the application code, things gets a little convoluted here. I wrap each Tone.js module in a wrapper that allows me to apply a dry/wet mix.

I also add an option to split the color channels before running the data through Tone.js: remember that the array is sorted as [R, G, B, A], this option creates four arrays ([R], [G], [B], [A]), runs them individually through the audio effects, and then merges them again on the other side. It’s hard to explain this, but think about a delay: a delay takes information from one position in the array and applies it to another position. By processing all the colors together, information from the red color might affect the blue color or alpha values. By splitting them, red will only affect red. Also it gives me the ability to only apply the effect to one color at a time.

Anyway here’s what it looks like in the demo when we run the data through Tone.js (without all of that added complexity):

async function processAudioData(audioData) {
    const audioContext = Tone.getContext()

    // Create offline renderer so we don't have to wait for the "audio" to "play"
    // before we get a buffer back that we can either manipulate more
    // or convert to an image
    const rendered = await Tone.Offline(() => {
        // Create a one channel Web audio buffer to load audio data into
        const buffer = audioContext.createBuffer(1, audioData.length, audioContext.sampleRate)
        const buffering = buffer.getChannelData(0)

        // Fill the buffer with our data
        for (let i = 0; i < audioData.length; i++) {
            buffering[i] = audioData[i]
        }


        // Create the Tone buffer module to play the audio data
        const bufferNode = new Tone.ToneBufferSource(buffer)

        // Create some effects that will process the audio data
        const phaser = new Tone.Phaser({
            frequency: 20,
        })
        const chebyshev = new Tone.Chebyshev({
            order: 2
        })

        // Connect all the pieces together,
        // including sending the data to the destination (output)
        bufferNode.connect(phaser)
        phaser.connect(chebyshev)
        chebyshev.toDestination()

        // Start the buffer playing
        bufferNode.start()
    }, audioData.length / audioContext.sampleRate, 1, audioContext.sampleRate)

    // Pull the raw data out of the updated buffer
    const processedAudioData = rendered.getChannelData(0)
    return processedAudioData
}

(Tone.getContext, Tone.Offline, Tone.ToneBufferSource, Tone.Phaser, Tone.Chebyshev, Tone.toDestination, createBuffer, getChannelData, connect)

Steps:

  • Since we don’t actually want to listen to the audio, we create an offline Tone instance. Besides saving us from harsh noise, it also speeds up the process since it can run as fast as possible to render audio (rather than have to play the audio at a normal speed).
  • We create a Web Audio API buffer and populate it with the data we generated in the image-to-audio step.
  • We create a Tone.js ToneBufferSource (using the Web Audio buffer) and some effects, connect them, and start the buffer playing.
  • Tone.Offline returns a promise containing the buffer which contains the data we’ll convert back into an image.
  • Once it’s all processed, we pull out the raw data from the buffer: another Float32Array.

This is where the bulk of the experimentation happens. Here you can tweak the settings for the effects or replace them with any number of other Tone.js modules. For each new module, just make an instance using the constructor and add connections using .connect. Dig through the original application code to see how I added dry/wet mixes (although some effects have this built in) and the option to process individual colors.

From audio to image

From here we’ll scale the audio data (from -1 to 1) back to image data (from 0 to 255) and drop the updated pixels back into the Canvas context:

// Convert audio (-1 to 1) to image (0 to 255)
// (mutates the original imageData array with the new data)
function convertAudioDataToImageData(imageData, audioData) {
    for (let i = 0; i < audioData.length; i++) {
        imageData[i] = scale(audioData[i], -1, 1, 0, 255)
    }
}

// Draw the modified image data
// (offset by the height of the image,
// so it doesn't overlap the "before" image)
context.putImageData(imageDataContainer, 0, height)

(putImageData)

Here’s the same image from the Audacity experiment, databent by playing around with the demo code in this repo:

Image that has been databent with Audacity

Conclusion

This isn’t the most performant way to do this. It’s laggy and front-end JavaScript is probably the last place we want to be processing large numbers of bits. I’m sure WASM or Web Workers or C++ would be a better alternative. I am glad I made this though; it highlights how far web APIs have come and how many tools we have available to us on the front-end. Plus I used this to make art that landed me on the cover of a zine I admire!

Anyway, hope this helps someone. Happy hacking!


Written by Matthew Curtis - community organizer, artist, and developer in Fayetteville, AR. Here are my links.