background image

4

5

with the same fundamental idea. This is the idea 
that by analyzing the structure and tone quality of 
the human voice, we can then attempt to simulate 
it. As a representative example, let’s look at 
“formant synthesis.” Formants are the spectral 
peaks of the sound spectrum (the distribution of 
the volume of each frequency band) of the voice. 
The idea is that you can simulate human 
pronunciation (the vocal cords and the movement 
of the mouth) by supplying these peak 
movements to a basic sound source.

“Concatenative synthesis” is another method 

that spread quickly due to the shrinking costs of 
digital technology. This method involves linking 
fragments of recorded (sampled) voices to 
synthesize vocals. Vocaloid’s system is basically a 
type of concatenative synthesis which produces 
more music like results. This system achieves this 
effect by similarly connecting vocal fragments, and 
at the same time making adjustments to each 
frequency zone.

Formant synthesis and the robot 
voice

As for an example of a device that is closer to the 

concept of “formant synthesis,” the “Vocoder” is a 
device that is familiar to many in the music world. 
The idea for this device was originally formed in the 
late 1920s at Bell Labs. At the time, it was used as a 
voice compression technology for sending a clear 
voice transmission through a telegraph cable’s small 
bandwidth. The technology was used mainly for 
purposes of military communication, due to the 
limitation of cost reduction with the technology at 
the time, as well as the fact that this was the period 
encompassing World War II. Production costs were 
reduced as semiconductor technology advanced in 
the late 1960s, and instruments and effect 
processors that gave the human singing voice a 
robot-like effect grew popular. Vocoder technology 
as a means of voice compression was later used to 
improve voice clarity in cellphones. This technology 
is still being developed today.

Similarly, a type of effects processor called a talk 

box, which uses the structure of the human mouth 
itself as a physical filter, has become very popular 
in musical genres such as rock and funk. These 
devices, however, are simply effects processors 
that process sound by using the movements of the 
human mouth. They don’t quite belong in the 
same category of “vocal synthesizers” as Vocaloid 
does, because they do not generate singing voices 
on their own.

The birth of Vocaloid

Starting with the Yamaha PLG100-SG in 1997, 

which mounted the formant singing sound source 
as a plug-in board for a desktop music sound 
module, there have been examples in the past of 
vocal synthesizers sold as instruments. However, in 
2000, a project called “Daisy” which payed homage 
to “Daisy Bell” started. In 2003, they released sound 
generating software called “Vocaloid” and 
everything changed. They adopted a unique 
concatenative synthesis system created by 
breaking down data of recorded voices into 
fragments (phonemes), then adjusting and editing 
these fragments to compile a database. In this way, 
they were able to achieve smooth vocal synthesis. 
Vocaloid was praised for its natural vocal 
expression and its user-friendly software. It 
became widely acknowledged, particularly by 
users dedicated to desktop music. In 2007, 
Vocaloid 2 was announced. In the same year, the 
more character oriented “Hatsune Miku” was 
developed by Crypton Future Media.

Pocket Miku’s built in “eVocaloid” 
technology

VOCALOID 3 was released in 2011. Its 

concatenative vocal synthesis engine made even 
more natural vocal expressions possible, and many 
character voices appeared in a library of singing 
voices stocked with vocal fragments. Meanwhile, 
sound chips used in hardware, such as those that 
produce ring tones in cell phones, have become 
widespread and continue to develop. Pocket Miku 
is equipped with the newest of such chips, the 
Yamaha NSX-1. In addition to functioning as a 
sound chip, NSX-1 is equipped with an “eVocaloid” 
sound generator. This sound generator puts to use 
Vocaloid technology which was previously only 
used as sound generating software for personal 
computers and similar devices. Pocket Miku 
brought one further modification to NSX-1. 
Whereas previous Vocaloid systems required 
programming on software called “score editors” 
beforehand, with this modification Pocket Miku is 
the first product in the world that enables you to 
perform real-time on Vocaloid. Pocket Miku is 
battery operated with a built in speaker. By simply 
sliding your stylus across the carbon keyboard, 
Hatsune Miku will sing for you anywhere. Go 
ahead and try it out!

Score information

Musical notes/lyrics/ 

musical expressions

Singer library

Synthesizer 

database

Expression 
database

Score editor

Synthesizer engine

Synthesized vocals

Score information 

input interface

Convert score 

information

into audio signals

Extract from recorded 

vocals of actual singers

WAHHA GO GO

“WAHHA GO GO,” a machine that laughs like a human, was developed in 2009 by Maywa 

Denki. Powered by a flywheel and bellows, the device imitates the movements of human 

vocal cords and the opening and closing of the mouth, resulting in changes in the 

formant (voice quality) and the amount of air (voice volume).

© Yoshimoto Kogyo co.,ltd. / Maywa Denki

Formants and the Vocoder

The peak movements of a formant have a significant relationship with the vocal chords 

and movements of the mouth when a person uses their voice. When similar sounds are 

produced, their formants peak near the same frequencies. The Vocoder is a development 

of audio compression technology that reproduces formants by generating them from the 

receiving end. It uses multiple bandpass filters to detect the extent of the peak of each 

frequency zone.

The construction of “Vocaloid”

“Vocaloid” uses the specialized software “Score Editor” to input score information. This 

made it easier than ever before to produce rich and natural vocal expressions.

Hatsune Miku

Going beyond the boundaries 

of instruments and 

synthesizers, this creation 

swept the charts and flooded 

cities with stunning visuals. 

“Hatsune Miku” is considered 

to be the first virtual idol to be 

recognized world wide. She 

sparked a social phenomenon, 

in which she stands center 

stage.

500

1000

1500

First Formant

 The “A” formant Multiple peaks can be confirmed.

Vocal cords

Second Formant

VOCALOID is a registered trademark of the Yamaha Corporation.

eVocaloid is a trademark of the Yamaha Corporation.

Illustration by KEI

Summary of Contents for Pocket Miku

Page 1: ...The Singing Keyboard POCKET MIKU USER S MANUAL Otona no Kagaku Magazine Special Edition Illustration by iXima ...

Page 2: ... there is a lot of dust Do not use the device if it has been damaged This device requires 3 AAA alkaline batteries Using the batteries incorrectly may result in the risk of heat generation rupture and leakage Please note the following Do not attempt to take apart a battery Do not attempt to short circuit the batteries Do not use chargeable NiCad batteries NiOx batteries or nickel batteries Short c...

Page 3: ...duced into the market such as Macintosh s MacinTalk function and Windows SAPI Various electronic devices such as vending machines were manufactured with built in speech synthesis LSI and the sound quality and functions of this technology made leaps and bounds Today music that utilizes Vocaloid the synthesized singing technology developed by Yamaha is sweeping the music charts Television programs a...

Page 4: ...ncompassing World War II Production costs were reduced as semiconductor technology advanced in the late 1960s and instruments and effect processors that gave the human singing voice a robot like effect grew popular Vocoder technology as a means of voice compression was later used to improve voice clarity in cellphones This technology is still being developed today Similarly a type of effects proce...

Page 5: ...amaha NSX 1 In addition to functioning as a sound chip NSX 1 is equipped with an eVocaloid sound generator This sound generator puts to use Vocaloid technology which was previously only used as sound generating software for personal computers and similar devices Pocket Miku brought one further modification to NSX 1 Whereas previous Vocaloid systems required programming on software called score edi...

Page 6: ...peakers and personal computers with line input When headphones are plugged into the audio out jack the speakers will not produce sound Audio out jack 3 5mm Stereo Mini USB port Micro B Type Power switch Speaker Control buttons Volume buttons Carbon keyboard Stylus LED While using the device pass your cord through the notch on the right hand side To put the stylus away restore it to the indented pa...

Page 7: ...sponse to the use of the button or the stylus may lag slightly Step by step basics Using batteries to power your device Open the back cover and place 3 AAA alkaline batteries in the correct and positions Put the power switch in the BATTERY position The LED will light up along with a voice announcing that your device is on Using a USB cable to power your device A USB cable is not included with this...

Page 8: ...er portions of the carbon keyboard and the positions for each note are roughly the same on either portion This allows for a smooth transition between the two portions during performance Returning to Do re mi mode To return to Do re mi mode after using another performance mode press the VIBRATO button the quadrant shaped button while holding the SHIFT button the thin button in the shape of an arc A...

Page 9: ...ssed again If you hold down both the SHIFT key and one of the A to O buttons simultaneously the preset lyrics of the selected key will be read to you The same preset lyrics from the command SHIFT A to O are also stored in VIBRATO A to O By linking your device with the application you will be able to freely replace the aforementioned preset lyrics See page 10 Applying vibrato By pressing the VIBRAT...

Page 10: ... application these button combinations will be used as empty banks to store the new lyrics Each bank allows you to input up to 64 character sounds What is MIDI Pocket Miku is able to send and receive data in MIDI format MIDI Musical Instrument Digital Interface is used to transmit performance data digitally between electronic instruments and computers Software standards are based on the requiremen...

Page 11: ...Miku s sound may become disordered In addition setting the MIDI input of any component other than Pocket Miku MIDI channel 1 to NSX 39 will cause pitch bend information to be applied to this component resulting in their pitches sounding abnormal Embedding Pocket Miku lyrics into MIDI data Using the Pocket Miku dedicated System Exclusive it is possible to use MIDI sequencers to embed Pocket Miku ly...

Page 12: ...intentionally overwritten by data from an external device Execute panic command to fix this condition Panic command reboots the sound generating chip on the interior of your device restores the MIDI settings to the conditions at startup and reenters Pocket Miku s interior character table data Panic command can be activated by holding down the volume up and volume down buttons simultaneously and th...

Page 13: ... once After touching every key the LED will stop flickering indicating that you have finished tuning Concerning NSX 1 application compatibility mode Although Pocket Miku is equipped with the sound chip NSX 1 by default it will not receive CC or SysEx for NSX 1 s character data If NSX 1 receives MIDI data or applications that use CC or SysEx character data it will not produce action or sound intend...

Page 14: ...our device in a number of ways including the functions assigned to the buttons the content of the preset lyrics the start up sound and the guiding voice and tone that is heard while operating the device For example you can assign internal effects to be operated by pressing a previously unassigned combination of buttons simultaneously You can also adjust the start up voice for a more natural sound ...

Page 15: ...anties or representations that the Licensed Programs are satisfactory to the user s expectations against interruptions or delays in the Licensed Programs for the safety accuracy or completeness of the Licensed Programs that the Licensed Programs are free of errors or for corrections to any faults or other shortcomings in the Licensed Programs 5 Limitation of Liability The Companies liability shall...

Page 16: ... 114 72 re 19 13 gyu 51 33 tyu 83 53 pu 115 73 ro 20 14 gyo 52 34 dyu 84 54 pe 116 74 rya 21 15 sa 53 35 cha 85 55 po 117 75 ryu 22 16 si 54 36 chi 86 56 hya 118 76 ryo 23 17 su 55 37 chu 87 57 hyu 119 77 wa 24 18 se 56 38 che 88 58 hyo 120 78 wi wi 25 19 so 57 39 cho 89 59 bya 121 79 we we 26 1A za dza 58 3A tsa 90 5A byu 122 7A wo wo 27 1B zi dzi 59 3B tsi 91 5B byo 123 7B n N 28 1C zu dzu 60 3C...

Reviews: