wavelounge

Wednesday, April 8, 2009

from kabuki to dumb type

we had an interesting discussion on modern opera in japan in composition class today. our seminar started with talking about kabuki, bunarku and noh. kabuki is the popular entertainment art since 1603 in japan. bunraku is the puppet theatre. japanese puppet theatre is very fragile and detailed. noh is the most elit and sophisticated type of theatre in japan. it gets into spirit and depth of the person an it is extremely slow and needs a lot of concentration. if you don't know what these terms mean and you are still curious, check out some examples or read about it. i just wanted to introduce these art forms to you, for each of these topics i should write a whole dissertation to get into their depth.

what is more interesting for me is all the modern movements in recent decades. after second world war butoh movement started in japan. it involves taboo topics of the society performed in the form of dance by dancers in white moving their body in hyper controlled motions. if i had to translate the movements to music, i could say it has a lot of slow but exact microtones in it. check out the video and explore it yourself: butoh by sankai juku.

another interesting group in the new japan's theatre is dumb type. dumb type is an interdisciplinary group of artists in kyoto. they make amazing multimedia projects, dance and theatre performances. their most famous pieces are voyage, memorandum, and pH.

i like true as well.

Monday, April 6, 2009

gestonic

Specification

Gestonic is a video-based interface for the sonification of hand gestures for real-time timbre control. The central role of hand gestures in social and musical interaction (such as conducting) was the original motivation for this project. Gestonic is being used to make computer-based instruments more interactive. It also allows the musicians to create sonorous and visual
compositions in real time. Gestonic explores models for sonification of musical expression. It does not use the direct-mapping of gesture-to-sound such as is commonly applied in acoustic instruments. Instead, it employs an indirect mapping strategy by making use of the color and timbre of the sound. The system consists of a laptop's camera, the filtering of camera input via the open source software known as Processing, the sending of OSC control messages to the audio-processing program known as ChucK, and finally the parameter-mapping and sound synthesis enabled by ChucK.

Gestonic consists of two main components:

• Gesture and Image Processing: This part of the system consists of a laptop's video camera and an Open Source software called Processing to filter and calibrate data received from the camera. In the current prototype of Gestonic, the input screen is divided into four sections: each represents a different instrument. In each section, the relative and absolute brightness and
the amount of change compared to the previous frame in red, green and blue is measured. Furthermore, four different blobs each detecting a different color (white, red, green and blue) show up on the screen. By moving objects with the same color as a blob, color tracking those objects with blob tracking is possible, so there are four more possible parameters to map to sound. Chuck and Processing communicate via OSC messages sent from Processing to ChucK in order to manipulate sound and send the opposite direction to control the video output to make the instrument more expressive.

• Data Processing and Sound Synthesis: ChucK programs are used to manipulate data received from Processing to synthesize sound.

Work in Progress

Gestonic is a work-in-progress and there is a lot more to be done to formulate expressive sounds from expressive gestures. Each section on the video frame is mapped to a different instrument. So far, the modules for four types of instruments are implemented. One is a drone like sound. The second instrument is a randomly generated, particle-like sound. The timbre and reverb of this sound is manipulated with gestures. In the future, the density of these random sounds will be indirectly mapped to the density of motion in the image. The third instrument is a beat-detecting instrument tracking the beats in motion. The fourth instrument is a set of human voices. The voices are manipulated with a granular synthesizer and grain parameters are mapped to blob motions received from the video.

Progress week 2

I started reading on Neural Networks to train the instrument by making some basic gesture recognitions possible. I looked into Neural Networks in Processing and Neural Network toolbox in Matlab. Some Neural Networks related References are added below.

Progress week 3

After playing around with matlab's nn toolbox and learning about basic concepts of image recognition such as morphology I decided to use something more practical. Matlab is good for analyzing images, but not for real time performance.

I am finally using Wekinator a free package to facilitat rapid development with machine learning in live music performance. The big advantage of this package is that it is very chucK friendly and it helps me to do real time motion extraction from camera input and the implementation of learning methods in Wekinator and sound synthesis with chuck.

Progress week 4

This week I started to make a simple one layered Neural Network in processing. It gets input from mouse, I haven't mapped it to the video camera yet. So far I can read six different drawings from the screen and train the network with those input drawings. The longer the training the less the error of recognizing the proper drawing. The next step is to get input from the camera. Then the question is how can I proceed? How can I make the training work in real time?

Progress week 5

As we approached the middle of the quarter, we have to deliver the first draft of our paper for this project, so I started to read more and get a deeper understanding of gesture based systems using neural network. I ran into at least twenty different systems and each in a way similar to the others but also unique is certain ways.

- Glove Talker

- Japanese sign-language recognition system

- Japanese manual alphabet recognition system

- Musical conducting gesture recognition system

- handshape recognition system

- Given: a handshape(postures) and dynamic gestures recognition system

- Coverbal gesture recognition system

- Sign motion understanding system

I am going to explain some details about these systems and some of their similarities that are useful in my implementation. Some main structural components of gestures that were used in most of these systems are:

- motion path length

- gesture duration

- maximum hand velocity

- flex for thumb, index, middle and annular fingers

- hand orientations

Progress week 6

This week we are submitting the first draft of our paper. I will upload my paper here soon.

In addition I worked on some image processing stuff. I have approached the problem from two different ways:

- analyzing by brightness

- analyzing by pixelation

I am still working on feeding these values to the neural net.

Progress week 7

This week I worked on making new sounds to map to gestures. It is hard to make sounds interesting enough and map in a non- linear way to make it more EXPRESSIVE!

A good inspiration was that I met with Troika Ranch Dance company. They demonstrated their software, isadora which is totally what I want to want to achieve with my software but instead of their approach, I only use open source software.

Results

The final paper that I summarized all the findings of this project is submitted and published at IHCI conference 2009, San Diego. Paper is available upon request.

Links and References

1. Machover, T.: Instruments, Interactivity, and Inevitability. Proceedings of the NIME International Conference (2002)

2. Kurze, M.: TDraw: a Computer-based Tactile Drawing Tool for Blind People. Proceedings of 2nd Annual ACM Conference on Assistive technologies. ACM Press. Canada (1996) 131-138

3. Fels, S.S., Hinton, G.E.: Glove-Talk: A Neural Network Interface between a Data-glove and a Speech Synthesizer. IEEE Trans. On Neural Networks, Vol. 4, No. 1 (1993)

4. “Processing” website

5. Wright, M., Freed, A.: Open SoundControl: A New Protocol for Communicating with Sound Synthesizers. ICMC. Thessaloniki (1997)

6. Wang, G., Cook, P.R.: ChucK: A DAFx, Concurrent, On-the-fly Audio Programming Language. Proceedings of the ICMC (2003)

7. Carette, E.C., Kendall, R.A.: Comparative Music Perception and Cognition. Academic Press (1999)

Neural Network References

1. Hunt, A., Hermann, T. : The Importance of Interaction in Sonification, ICAD (2004).

2. Kolman, E., Margaliot, M. : A New Approach to Knowledge-Based Design of Recurrent Neural Networks. (2006)

3. Franklin, K., Roberts, J. : A Path Based Model for Sonification.

4. Boehm, K., Broll, W., Sokolewicz, M. : Dynamic Gesture Recognition Using Neural Networks; A Fundament for Advanced Interaction Construction, SPIE Conference Electronic Imaging Science & Technology, San Jose California. (1994)

Friday, April 3, 2009

rocco di pietro

we had the honor to have a guest composer during the winter quarter at ccrma. rocco was advising us on making music and he composed a lot himself too in this period. he even composed a piece for lap top orchestra and we performed it at ccrma and uc irvine. his piece for slork is called 'one stone flow'. it starts with some nature sounds played by laptops, then leading to drone like sounds played with laptops controlled by joysticks and the chords played at the piano. and finally laptops play interviews with the composers who have had influence on rocco's music, or were his teachers such as maderna, foss, boulez and finally john chowning. then the voices of the composers are manipulated with granular synthesis and build up a chaotic sound. the piece ends with a huge laugh played by all laptops.

if you want to hear rocco's interview with sica check out here.

Monday, March 9, 2009

more composers

last week we had a very interesting guest composer "alvin curran". his music is made of electronic and environmental sounds. he has compositions from lake concerts featuring musicians in row boats to ship horn concerts. one of my favourite is his "floor plan/notes from underground" which was a holocaust memorial installation at ars electronica in linz.

good news for san francisco and bay area residents: he has a concert coming up this sunday(march 15th) at contemporary jewish museum in san francisco.

Thursday, February 26, 2009

guest composers

we have had very interesting composers as guests at ccrma in the last couple of weeks. first "yinam leef" was here for pan asian music festival. we had the honor to have a class with him on composition seminar. i don't need to explain how rich and beautiful his music is, you can just listen to his music yourself but i just mention what moved me in his class was that he and lots of great composers have learned composition in a very classical western style and they are not sure if that's the best way to teach it to the next generations. on the one hand having the luxury of learning harmony and counterpoint in early ages help the composer to get to a deeper level in music, but doesn't it take some of her creativity? for example in "berlin hochschule der kuenste" the composition students don't go through this classical education and they don't limit themselves to it. what do you think?

Friday, February 6, 2009

tape festival

last weekend i we went to tape music festival in san francisco. it was in cellspace which is a very cool venue but not necessarily for tape music. the acoustic of the room adds another roughness to the sound, which depending on the sound could make it sound cooler or not.

i really enjoyed a piece "etude aux sons animes" by pierre schaeffer. he has made use of very unique fantastic sounds that i could feel myself floating in a metal bowl or dropping metal ball on my ears. another favourite piece of mine was a composition of thom blum which was especially clean and i enjoyed how rich the whole sound was and clean the it moved from one texture to another. and of course my very favourite wave that night was ligeti's artikulation. it was nice to hear them through eight speakers in a new environment.

Friday, January 23, 2009

waves of the week!

i have a very crazy schedule and never have time to listen to music qualitatively during the week. but every weekend i borrow five to ten cds from stanford music library and take time to listen to them and some times analyzing them. if you do the same thing could be great to start some listening discussions on this blog.

one of the cds i have got for this week is voices 1900/2000 a choral journey through the twentieth century. very beautiful sounds.

another set of waves i listened to this week were tons of gyoergy ligeti's music. having lived in vienna in the last decade of my life, i didn't hear as much ligeti music there as in the last week. but some of his masterpieces were performed in wien-modern (a yearly music festival in vienna on 20th century music.) my favourite ligeti's orchestral piece is atmospheres. this piece has such a thick texture with a huge variety of timbres. you might have heard it in the stanley kubrik's 2001, a space odyssey. enjoy listening to it while looking at the scores. the score is visually as rich and thick as the sound.