Invited Paper

AIM 2001(IEEE/ASME International Conference on Advanced Intelligent Mechatronics)
Teatro Sociale, Como, Italy

July 2001


Robotics and Machine Vision for the Future 
-- an Industrial View --

Masakazu Ejiri,
Fellow, IEEE, IAPR, and IEICE

Central Research Laboratory, Hitachi, Ltd.,
Kokubunji, Tokyo 185-8601, Japan


Abstract
Recent trends in industrial technology are to make things small, synergetic, intelligent, and environmentally friendly. Mechatronics is one research area on these trends, and its perspective is first introduced. The status of research in robotics and machine-vision technologies is then described as a typical example of mechatronics research. Expectations for the future of these technologies are also mentioned from the viewpoint of industry, emphasizing the importance of considering the reliability in robotics and of studying real-time color video processing in machine vision. These fields are becoming increasingly important for establishing a productive, efficient, secure, and stress-free society through factory, office, and social automation.


1 Introduction

Research on mechatronics has made great progress over the last 30 years. In particular, industrial robots and automatic machines equipped with machine vision have played an important role in our society and have contributed significantly in making it productive and efficient. In this paper, we briefly discuss the present status of robotics and machine vision technologies from the industrial viewpoint and suggest some directions of these fields in the future.

2 Trends in Industrial Technologies

Industrial technologies and products are being always innovated. And to improve their functions and performances, novel materials, parts, mechanisms, architectures, and algorithms are continuously being put into use. Higher precision, higher resolution, and higher speed are some examples of targets being aimed at. These targets form the main trend in the technology development underlying the daily activities, especially in the manufacturing industry. By observing the history of the technology over the last 30 years, we can see four more trends in addition to this fundamental one.

The first trend is to make things smaller. Downsizing in computer systems has made it possible to distribute computing power everywhere. In this background, there have always been drastic improvements in semiconductor technology. Size reduction, power reduction, and scale increase in integration are the main efforts made in the development of electronic parts. As a result, a large number of mobile phones and personal data assistants have also become available. By taking advantage of semiconductor technology, the effort to make mechanical parts smaller has also started, forming a new research area called micro machines. 

The second trend is to combine things for synergism. Variously advanced technologies are integrated or fused in order to achieve novel technologies and machines. The fusion of different technologies usually necessitates another technology that behaves as a catalyst or an adhesive. Mechatronics is one typical example of the fusion between mechanical engineering and electronic engineering, to which automatic control technology serves as the adhesive. In recent years, network technology is becoming a powerful adhesive to create new technologies.

The third trend is to make things more intelligent. The end users of most products are humans; thus, the products must have a friendly interface so they are easy to use without any perplexity. As the visual function is the most prominent function in humans, machines with image-based interfaces and multimedia handling capabilities are becoming more and more important.

The forth trend is to make things more environmentally friendly. Without harmony with the environment, no products will be validated from the viewpoint of product liability. It is thus being urged nowadays to employ designs that are easy to disassemble and destroy and to use materials and parts that are easy to reuse. 

These four trends in addition to the main underlying one will continue in the future; as a result, low-cost, high-performance, highly-functional, meaningful and tender products will be expected to appear.
 

3 Perspective of Mechatronic Products

Classical machines such as the windmills and clocks of centuries ago were purely mechanical; they used cams, cranks, and gears for changing the motive power to other modes of motion. With the lapse of time, more complicated and sophisticated motion became attainable by the combination of rather simple mechanisms each controlled at first electrically and afterwards electronically. In the 1970s, the importance of the boundary between mechanical and electronic engineering was recognized more clearly among researchers by being encouraged with the improvement of microcomputer and memory technologies. The compound word “mechatronics” was thus formed by enthusiastic Japanese researchers in control field, and was proposed to best describe this new area of the research. 

In addition to industrial robots, typical examples of mechatronic products are facsimiles, printers, video cameras, magnetic storage devices, optical storage devices, automatic teller machines, automatic ticketing machines, and automatic mail-sorting machines. A common feature of these products is the handling of thin and weak objects such as paper sheets and plastic tapes/sheets/disks. Handling of these objects is not easy or is almost impossible without using sophisticated electronic control. 

Another common feature of the mechatronic products is that they handle some sort of information. Massive amounts of data received are correctly recorded, are re-constructed as documents and pictures, or are analyzed and used as the information for classification of objects. In other words, dense data exchange with other electronic devices or with the outside world is essential in most of those products. 

Mechatronics has thus been an inevitable solution to the problems that electronics or mechanics alone could not solve. And nowadays, mechatronics can further be recognized as a concept representing “electronically controlled information-handling machines”, in which the electronic, control, information, and mechanical engineering are tightly integrated.

In terms of price-to-weight ratio and the ratio of electronic to mechanical parts, some existing mechatronic products are plotted in Fig. 1 to clarify their relative positions among other industrial products [1]. The mechatronic products have a fairly balanced number of electronic parts and mechanical parts, whose ratio roughly ranges from 0.5 to 2.0. However, as semiconductor technology progresses, the number of electronic parts is apt to be reduced through higher integration, so the ratio will shift to the more mechanical direction. Furthermore, since it is easier to reduce the size and add value to electronic parts compared to mechanical ones, mechatronic products will be shifted in the electronic direction. Thus, the ratio of the number of parts is merely a reference index and does not simply represent the degree of how electronic or how mechanical the product is. 

As shown in Fig.1, the price of all industrial products normalized by their own weight ranges between 1-1000 yen/gram (0.3-300 dollars/ounce). For example, both an office-use large-scale laser printer and home-use tiny ink-jet printer cost 10-20 yen/gram. Of course, products with a long history, high maturity, high popularity, and keen competition are apt to have a lower price per unit-weight. It is also interesting to note that, with exception of precious metals, artistic works, antique objects with a scarcity value, and recent computer software, data, and information contents, all industrial products that humankind can create to date range between the price of gold and grain (rice or wheat). In other words, our industrial activity has not produced anything more valuable than gold.




4 Robotics Technology

4.1 Robotics in Japan

Industrial robots are the typical mechatronic products to which, in our knowledge, the word “mechatronics” first began to be used. It has been more than three decades since robots turned from being fictional creatures to engineering reality; thus, mechatronics has a history of more or less 30 years.

The Japan’s first introductory paper on the robotics technology appeared in an academic journal in 1963. Five years later, two types of early industrial robots were imported from the USA. And two years later, in 1970, Japan’s original intelligent robots with computer-controlled vision systems were developed at two different laboratories and were demonstrated successfully. Since then, industry became aware of and interested in these novel technologies. So drastic efforts to create industrial robots began; as a result, the technology attained to its peak. For example, horizontally articulated high-precision assembly robots (SCARA-type, which stands for Selective Compliance Assembly Robot Arm) were invented and have gained worldwide popularity. Also, the enthusiasm of industry leaders to utilize robots in their factories made Japan one of the most advanced industrial societies by installing and operating, at present, more than 400,000 industrial robots in total.

4.2 Industrial Robots

Advances in the technology of industrial robots, however, seem to have stagnated in recent years. It is thus clearly the time to start new investigations to break through this stagnant situation in the present robotics industry. In addition to the continuous efforts to achieve higher speed, higher precision, lower cost, safer, and more reliable industrial robots, young robotics researchers face three challenges [2].

The first challenge is to shift attention from manufacturing to design processes. By attempting to effectively automate whole “production” processes, automation must be expanded from the product-manufacturing process to the more upstream processes; i.e., to product realization and further to product incubation. Total production efficiency will not be attained if robotics technology is restricted within the manufacturing process. Future robotics researchers are expected to play an important role in the automation of product incubation and realization processes in order to facilitate the smooth transition of design knowledge among these processes, thus achieving truly efficient production. 

The second challenge is to shift attention from single to multiple capabilities. With the progress of “individualization and sensuousness”, the variety of product types each being added with a different value may be required to appear. Production methods will thus be affected, and some factories are again starting to adopt a traditional assembly method, by which a worker is responsible for assembling the whole product from beginning to end. This means the return to a multi-functional operation from a distributed single operation used in conveyor-type assembly. Therefore, more flexible multi-functional robots may be needed as cooperative assistants to human workers. In these robots, a skill in performing a single task must be combined with other skills in different tasks in order to perform a complex operation, in which tasks are so dexterously and sometimes concurrently performed that no single task is separately noticeable. Research on skill acquisition of robots by experience-based self-learning and research on giving the learning incentives to robots are quite difficult but will be rewarding in the future. 

The third challenge is to shift attention from assembly to disassembly. Applications of robots to the disassembly of the used products for recycling seem to be important for the future. The concept of an adaptive robot for ecologically conscious industry has been proposed. It reads the identification number and the manufacturer of wasted products from a tag attached in advance when they were produced, and retrieves the product knowledge via a network so it can decide on the optimum disassembly sequence. In disassembly applications, objects can probably be handled quite differently. Robots may be able to throw objects or scrape up the destructed parts for efficient handling. Thus, a new technological paradigm may arise.

4.3 Autonomous Robots

Although recent research on autonomous robots is fairly active, what is missing seems to be a clear vision of possible applications and attention to reliability and power-related issues [1]. The control technology for legged robots has greatly progressed in recent years. However, the researchers are apt to focus only on information-related issues and energy-related issues are sometimes ignored. Thus, prototype intelligent robots often have long power-supply trails or carry heavy batteries on their backs, which last only 30 minutes or so, and have to rest for a couple of hours for re-charging. The past tendency of the power density increase in batteries is only twice in ten years, so it may take 50 more years to develop the right types of batteries for autonomous robots with appropriate energy density, size, and weight. However, the recent developmental efforts of battery-driven cars have given hope that the inconveniences of batteries will be overcome. Robotics researchers must also pay more attention to this problem and not merely wait until someone else solves it. In immediate applications, however, a systems-engineering approach may give an alternative solution to this problem until the right types of batteries are developed. 

Symbiotic robot usage may become more serious in a society envisioned by some researchers; namely one where robots wander around the street. However, this is not the type of society we are aiming for. Robots operating in the street should probably be made smaller, lighter, and less speedy than humans for the safety reasons. Then, what can these robots do for humans? Carrying objects for humans may probably be too slow to be realistic, and carrying information for transmitting to and from humans can probably be done by electronic post at street corners by means of wireless IC cards or wristwatch type novel electronic devices.

Recently, two-legged robots developed by Japanese companies have drawn considerable attention from the public. Also, a variety of animal-like pet robots with some sensor-based learning capabilities are being marketed. These products are fresh breezes to blow through the stagnant robotics industry, and they are welcome in the sense that such mind-stimulating robots will reactivate the robotics industry. The hidden problem is, however, the lack of future perspectives on the effects of these robots. A society in which the stimulation of the human mind can only be achieved by such artifacts will not be a sound society. We have to note that the artifacts also have a life span and will die. Too much emotion and reliance on these robotic artifacts may pose another unexpected problem in the future. 

Autonomous robots are in fact very interesting not only for technologically innocent people but also for engineers. Many robot applications can be conceived for the use in the power industry, in the construction industry, and in disaster prevention and rescue. Also, autonomous robots may be helpful in understanding human functions and behavior. Especially in the educational environment, prototyping of an autonomous robot will provide dreams for young students as well as the joy of learning mechanisms and creating new things. However, one anxiety is that this may bring up “day-dreamers”. Their prototype robots may only work once and may not have to work again once they are taped on video. That is, education based on reliability seems to be missing. Reliability is a very important concept for every industrial activity, especially for design work, which the students may engage in after graduation.
 

5 Machine Vision Technology 

5.1 Factory Applications

Machine vision is an important function that future robots must have. One of the first successful applications of machine-vision technology was to the automatic assembly of transistors in 1973 [3]. By using local-parallel-type image processor as a means of real-time pattern matching, two-dimensional positions of transistor chips in microscope images were automatically determined for wire bonding. Later on, this application was expanded to the assembly of ICs and LSIs and also to the inspection of semiconductor wafer patterns and printed circuit boards [3][4]. Consequently, the assembly and inspection of electronic parts became a significant market for the machine vision. 

Meanwhile, the performance of microprocessors improved greatly and the capacity of memory chips increased drastically. Also, special-purpose image-processing LSIs were developed and became available. These all facilitated the use of gray-level images instead of binary ones until then, and contributed to production of general-purpose, programmable machine-vision systems. Such systems are now being used in many industrial sectors such as electronics, machinery, medicine, and the food industry for the automation of manufacturing processes [5][6]. 

In brief, the machine-vision systems used in factory automation (FA) were aimed at realizing a productive society, and progress was made in such applications as:

・ Position detection for assembly,

・ Shape detection for classification, and

・ Defect detection for inspection.

Though these machine-vision systems do not look like robots, it is our belief that they can also be called robots, because their aim is to realize automatic execution of tasks that had, until then, only been done by skilled human workers.

5.2 Office Applications

In addition to the FA, applications of machine-vision technology to office automation (OA) have also progressed. For example, by fusing the data from optical and magnetic sensors, automatic recognition of monetary bills was achieved and led to automated teller machines (ATM) for banks. The development of technologies to analyze drawings, maps, and documents followed. The resulting geographic information systems (GIS) and document readers are now being utilized for facility management at public service companies, for customer management at banks, and for crisis management and disaster simulation at local governments. By using newly developed mail-sorting machines, postal automation systems are now being renovated. Such systems can automatically read hand-written Kanji addresses and convert them to transparent barcodes, which are later used for mail assembly in the order of the houses on delivery route [7].

In short, machine-vision systems were used for OA with the aim of realizing an efficient society, and progress has been made in such fields as:

・ Efficient handling of large-scale data, and

・ High-precision and high-speed recognition and handling for paper-based information.

In other words, OA has been a typical application field of not only machine vision but also other mechatronic products, and it will continue to be one in the future. The characteristics of the image processing being utilized in machine-vision applications are illustrated in Fig. 2. In addition to FA and OA, the next important target of application seems to be SA, i.e., social automation.
 

5.3 Social Applications

In general, the things that move in our society are humans, physical objects, money, and information. The mobility of the last two is becoming fairly smooth, quick and efficient, thanks to the progress of network technology. 

The mobility of humans is realized by traffic systems, and traffic control is a key technology for achieving a smooth flow of humans. The first application of machine vision to traffic use was probably the elevator-eye system in 1977 [7], At that time, however, the number of systems installed was very limited, so the business was not too successful. In this system, a video camera installed in each elevator hall recognized the number of persons waiting for the elevator at each floor, and designated an elevator to quickly serve the crowded floor. Nowadays, many studies are being done in the field of car traffic control, where the recognition of license number plates, traffic flows, the degree of congestion, and illegally parked cars, etc. is being tried. These approaches will probably be combined as an important function of the future intelligent transport system (ITS), which is a typical example of SA.

Physical distribution systems control the flow of physical objects. They usually identify and dispatch a variety of objects by the help of barcode systems. Conventional machine-vision systems are mainly for shape and position recognition, and will not simply solve the identification problem especially in large-scale physical distribution systems. Instead, as an extension or substitution of barcodes, a novel machine vision combined with wireless semiconductor-chip type “information carriers” may probably play an important role in future physical distribution systems. 

As a part of SA, machine-vision applications will also realize a more comfortable and secure human life. For example, a river-water monitoring system using fish behavior, an intruder-monitoring system based on image-change detection at railroad crossing, and a remote monitoring system of volcanic activities have been developed. A GIS-based restoration-assistance system after a disaster such as an earthquake and an intelligent on-line monitoring system during medical surgery have also been developed and are being planned for further improvement. As indicated in these examples, applications of machine-vision technology to human welfare, medicine, and environment improvements are very promising. 

The key concept representing the future seems to be the realization of a calm society; that is one in which security will be guaranteed and all uneasiness will be relieved through SA, more specifically, through “networked” social automation. The SA is a synonym of “networking technology” in one sense, and the “networked machine vision” may become an important key technology for realizing SA. In this sense, the most important objectives of machine vision may eventually be converged to the realization of the following two functions:

・ 24-hour/day abnormality monitoring via networks, and

・ Personal identification via networks.

To date, motion capturing, face recognition, facial expression understanding, gesture recognition, sign language understanding, and behavior understanding are keen interests among researchers. The progress of machine-vision technology to date and in the future is summarized in Table 1.
 

5.4 Dynamic Video Processing

In most SA applications, dynamic image processing, which analyzes and/or synthesizes color-video images in real-time, will be the key to success [7]. The technology already developed includes real-time scene-separation based on an algorithm for quickly finding changes between consecutive image frames [8]. This technology not only forms the basis of video editing but also forms the basis of abnormality finding. For example, a broadcasting company has put it into use for video inspection so that subliminal advertising can be detected before a video goes on the air. 

One interesting example of dynamic video analysis and synthesis is “Tour into the picture (TIP)” technology. A 2-D picture is scanned into a computer and is interpreted as a three-dimensional model by fitting vanishing lines on its display. The picture can then be looked at from different angles and distances [9], thus generating a motion video from a single picture. For example, viewers can feel as if they were taking a walk in an ancient city when an old picture of the city is available.

Another dynamic image processing technology is called “Cyber BUNRAKU”, in which human facial expressions are recognized by small infrared-sensitive reflectors put on a performer’s face. By combining the facial expressions thus obtained with the limb motions of a “Bunraku doll”, which is usually used in traditional Japanese theatrical performance, a 3-D character model in a computer can be choreographed in real-time to create video images [10]. This technology will be one of the fastest and most effective ways to produce multimedia animation programs.

These simple methods mentioned above can combine real images and computer graphics together and, thus, can easily create video contents for use in mechatronic products. For example, an ATM can be equipped with animation-based sign-language instructions, and a rehabilitation machine for walking-disabled persons can be fitted with a specific video for both instruction and enjoyment during the tedious training period.
 

6 Conclusion

An industrial view on robotics and machine-vision technologies, together with that on mechatronics, was briefly introduced. Expectations for the future of these technological fields were also mentioned, emphasizing the importance of the reliability consideration in robotics and the importance of studying real-time color video processing for novel machine vision. It was concluded that these fields would be important in establishing a productive, an efficient, as well as a calm society through networked social automation.
 

References

[1] Ejiri, M.: “Towards meaningful robotics for the future; Are we headed in the right direction?”, Robotics and Automation Systems, 18, pp.1-5 (1996)
[2] Ejiri, M: “Expectations of new robotic technology from an industrial viewpoint”, Journal of RSJ, 16, 1, pp.39-40 (1998) (in Japanese)
[3] Ejiri, M.: “Machine vision: A practical technology for advanced image processing”, Gordon & Breach Sci. Pub., New York (1989)
[4] Ejiri, M., Yoda, H., and Sakou, H.: “Knowledge-directed inspection for complex multi-layered patterns”, Machine Vision and Applications, 2, pp.155-166 (1989) 
[5] Ejiri, M.: “Recent image processing applications in industry”, Proc. 9th SCIA, Uppsala, pp. 1-13 (1995)
[6] Ejiri, M.: “Machine vision: A key technology for flexible automation”, Proc. of Japan-U.S.A. Symposium on Flexible Automation, Otsu, Japan, pp. 437-442 (1998) 
[7] Ejiri, M., et al.: “Evolution of real-time image processing in practical applications”, Proc. IAPR MVA, Tokyo, pp. 177-186 (2000)
[8] Nagasaka, A., Miyatake, T., and Ueda, H.: “Video retrieval method using a sequence of representative images in a scene”, Proc. IAPR MVA, Kawasaki, pp. 79-82 (1994)
[9] Horry, Y., Anjyo, K., and Arai, K.: “Tour into the picture: Using a spidery mesh interface to make animation from a single image”, Proc. ACM SIGGRAPH 97, pp. 225-232 (1997)
[10] Arai, K. and Sakamoto, H.: “Real-time animation of the upper half of the body using a facial expression tracker and an articulated input device”, Research Report 96-CG-83, Information Processing Society of Japan, 96, 125, pp. 1-6 (1996) (in Japanese)