You are on page 1of 7

How the Java Virtual Machine (JVM) Works - CodeProject

10,103,093 members (56,479 online) Sign in

home

articles

quick answers

discussions

features

community

help
Next

Articles Languages Java General

Article Browse Code Stats Revisions Alternatives Comments & Discussions (12)

How the Java Virtual Machine (JVM) Works


By Bikash Shaw, 25 Nov 2008
4.21 (25 votes)

About Article
How the Java Virtual Machine Works: Inside the JVM. Type Licence First Posted Views Article CPOL 25 Oct 2008 117,445

Introduction
In high-level programming languages such as C and C++, we write a program in a human-readable format, and a program called a compiler translates it to a binary format called executable code that the computer can understand and execute. The executable code depends upon the computer machine that we use to execute our program; it is machine dependent. In Java, this process of writing to executing a program is very similar, but with one important difference that allows us to write Java programs that are machine independent. Using an interpreter, all Java programs are compiled to an intermediate level called byte code. We can run the compiled byte code on any computer with the Java runtime environment installed on it. The runtime environment consists of a virtual machine and its supporting code.

Bookmarked 27 times Java Architect Dev Design Beginner , +

JVM is an Emulation
The difficult part of creating Java byte code is that the source code is compiled for a machine that does not exist. This machine is called the Java Virtual Machine, and it exists only in the memory of our computer. Fooling the Java compiler into creating byte code for a nonexistent machine is only one-half of the ingenious process that makes the Java architecture neutral. The Java interpreter must also make our computer and the byte code file believe they are running on a real machine. It does this by acting as the intermediary between the Virtual Machine and our real machine. (See figure below.)

Top News
Salaries for Developers and Designers across the world
Get the Insider News free each morning.

Related Videos

Related Articles
JVM Launcher The Common Language Runtime

http://www.codeproject.com/Articles/30422/How-the-Java-Virtual-Machine-JVM-Works[9/24/2013 4:49:14 PM]

How the Java Virtual Machine (JVM) Works - CodeProject (CLR) and Java Runtime Environment (JRE) Home Made Java Virtual Machine Understanding JVM Performance - It is Fast - Very Fast! Garbage Collections in Java Using the Java Native Interface in C# IKVM.NET in Details Espresso- Java & .NET Native Interoperability - Part I Intelligent Video Analytics Using Memory Mapped Files and JNI to communicate between Java and C++ programs Creating Your Own Freaking Awesome Programming Language Introduction to .NET How to create your own virtual machine Introduction to Terracotta DSO Java to JavaScript Communication Stack based vs Register based Virtual Machine Architecture, and the Dalvik VM Virtualization Renaissance Code .NET 2.0; Build Java; Run Linux .NET Platform Invoke Paradigm in Java (J/Invoke) Figure 2 - JVM handles translations Detect if your program is running inside a Virtual Machine

Figure 1 - JVM emulation run on a physical machine

The Java Virtual Machine is responsible for interpreting Java byte code and translating this into actions or Operating System calls. For example, a request to establish a socket connection to a remote machine will involve an Operating System call. Different Operating Systems handle sockets in different ways - but the programmer doesn't need to worry about such details. It is the responsibility of the JVM to handle these translations so that the Operating System and the CPU architecture on which the Java software is running is completely irrelevant to the developer. (See figure below.)

The Basic Parts of the Java Virtual Machine


Creating a Virtual Machine within our computer's memory requires building every major function of a real computer down to the very environment within which programs operate. These functions can be broken down into seven basic parts: A set of registers A stack An execution environment A garbage-collected heap A constant pool A method storage area An instruction set

Registers
The registers of the Java Virtual Machine are similar to the registers in our computer. However, because the Virtual Machine is stack based, its registers are not used for passing or receiving arguments. In Java, registers hold the machine's state, and are updated after each line of byte code is executed, to maintain that state. The following four registers hold the state of the virtual machine: frame , the reference frame, and contains a pointer to the execution environment of the current method. optop, the operand top, and contains a pointer to the top of the operand stack, and is used to evaluate arithmetic expressions. pc , the program counter, and contains the address of the next byte code to be executed. vars, the variable register, and contains a pointer to local variables.

http://www.codeproject.com/Articles/30422/How-the-Java-Virtual-Machine-JVM-Works[9/24/2013 4:49:14 PM]

How the Java Virtual Machine (JVM) Works - CodeProject

All these registers are 32 bits wide, and are allocated immediately. This is possible because the compiler knows the size of the local variables and the operand stack, and because the interpreter knows the size of the execution environment.

The Stack
The Java Virtual Machine uses an operand stack to supply parameters to methods and operations, and to receive results back from them. All byte code instructions take operands from the stack, operate on them, and return results to the stack. Like registers in the Virtual Machine, the operand stack is 32 bits wide. The operand stack follows the last-in first-out (LIFO) methodology, and expects the operands on the stack to be in a specific order. For example, the isub byte code instruction expects two integers to be stored on the top of the stack, which means that the operands must have been pushed there by the previous set of instructions. isub pops the operands off the stack, subtracts them, and then pushes the results back onto the stack. In Java, integers are a primitive data type. Each primitive data type has unique instructions that tell it how to operate on operands of that type. For example, the lsub byte code is used to perform long integer subtraction, the fsub byte code is used to perform floating-point subtraction, and the dsub byte code is used to perform long integer subtraction. Because of this, it is illegal to push two integers onto the stack and then treat them as a single long integer. However, it is legal to push a 64-bit long integer onto the stack and have it occupy two 32-bit slots. Each method in our Java program has a stack frame associated with it. The stack frame holds the state of the method with three sets of data: the method's local variables, the method's execution environment, and the method's operand stack. Although the sizes of the local variable and the execution environment data sets are always fixed at the start of the method call, the size of the operand stack changes as the method's byte code instructions are executed. Because the Java stack is 32 bits wide, 64-bit numbers are not guaranteed to be 64-bit aligned.

The Execution Environment


The execution environment is maintained within the stack as a data set, and is used to handle dynamic linking, normal method returns, and exception generation. To handle dynamic linking, the execution environment contains symbolic references to methods and variables for the current method and current class. These symbolic calls are translated into actual method calls through dynamic linking to a symbol table. Whenever a method completes normally, a value is returned to the calling method. The execution environment handles normal method returns by restoring the registers of the caller and incrementing the program counter of the caller to skip the method call instruction. Execution of the program then continues in the calling method's execution environment. If execution of the current method completes normally, a value is returned to the calling method. This occurs when the calling method executes a return instruction appropriate to the return type. If the calling method executes a return instruction that is not appropriate to the return type, the method throws an exception or an error. Errors that can occur include dynamic linkage failure, such as a failure to find a class file, or runtime errors, such as a reference outside the bounds of an array. When errors occur, the execution environment generates an exception.

The Garbage-Collected Heap


Each program running in the Java runtime environment has a garbage-collected heap assigned to it. Because instances of class objects are allocated from this heap, another word for the heap is memory allocation pool. By default, the heap size is set to 1MB on most systems. Although the heap is set to a specific size when we start a program, it can grow, for example, when new objects are allocated. To ensure that the heap does not get too large, objects that are no longer in use are automatically deallocated or garbage-collected by the Java Virtual Machine. Java performs automatic garbage collection as a background thread. Each thread running in the Java runtime environment has two stacks associated with it: the first stack is used for Java code; the second is used for C code. Memory used by these stacks is drawn from the total system memory pool. Whenever a new thread starts execution, it is assigned a maximum stack size for the Java code and for the C code. By default, on most systems, the maximum size of the Java code stack is 400KB, and the maximum size of the C code stack is 128KB.

http://www.codeproject.com/Articles/30422/How-the-Java-Virtual-Machine-JVM-Works[9/24/2013 4:49:14 PM]

How the Java Virtual Machine (JVM) Works - CodeProject

If our system has memory limitations, we can force Java to perform more aggressive cleanup and thus reduce the total amount of memory used. To do this, reduce the maximum size of the Java and C code stacks. If our system has lots of memory, we can force Java to perform less aggressive cleanup, thus reducing the amount of background processing. To do this, increase the maximum size of the Java and C code stacks.

The Constant Pool


Each class in the heap has a constant pool associated with it. Because constants do not change, they are usually created at compile time. Items in the constant pool encode all the names used by any method in a particular class. The class contains a count of how many constants exist, and an offset that specifies where a particular listing of constants begins within the class description. All information associated with a constant follows a specific format based on the type of the constant. For example, class-level constants are used to represent a class or an interface, and have the following format:
Collapse | Copy Code

CONSTANT_Class_info { u1 tag; u2 name_index; }

where tag is the value of CONSTANT_Class , and the name_index provides the string name of the class. The class name for int[][] is [[I. The class name for Thread[] is [Ljava.lang.Thread;.

The Method Area


Java's method area is similar to the compiled code areas of the runtime environments used by other programming languages. It stores byte code instructions that are associated with methods in the compiled code, and the symbol table the execution environment needs for dynamic linking. Any debugging or additional information that might need to be associated with a method is stored in this area as well.

The Byte Code Instruction Set


Although programmers prefer to write code in a high-level format, our computer cannot execute this code directly, which is why we must compile Java programs before we can run them. Generally, compiled code is either in a machine-readable format called machine language or in an intermediatelevel format such as the assembly language or Java byte code. The byte code instructions used by the Java Virtual Machine resemble Assembler instructions. If you have ever used Assembler, you know that the instruction set is streamlined to a minimum for the sake of efficiency, and that tasks, such as printing to the screen, are accomplished using a series of instructions. For example, the Java language allows us to print to the screen using a single line of code, such as:
Collapse | Copy Code

System.out.println("Hello world!");

At compile time, the Java compiler converts the single-line print statement to the following byte code:
Collapse | Copy Code

0 3 5 8

getstatic #6 <Field java.lang.System.out Ljava/io/PrintStream;> ldc #1 <String "Hello world!"> invokevirtual #7 <Method java.io.PrintStream.println(Ljava/lang/String;)V> return

The JDK provides a tool for examining byte code called the Java class file disassembler . We can run the disassembler by typing javap at the command line. Because the byte code instructions are in such a low-level format, our programs execute at nearly the speed of programs compiled to machine language. All instructions in machine language are represented by byte streams of 0s and 1s. In a low-level language, byte streams of 0s and 1s are replaced by suitable mnemonics, such as the byte code instruction isub. As with assembly language, the basic format of a byte code instruction is:
Collapse | Copy Code

http://www.codeproject.com/Articles/30422/How-the-Java-Virtual-Machine-JVM-Works[9/24/2013 4:49:14 PM]

How the Java Virtual Machine (JVM) Works - CodeProject <operation> <operands(s)>

Therefore, an instruction in the byte code instruction set consists of a 1-byte opcode specifying the operation to be performed, and zero or more operands that supply parameters or data that will be used by the operation.

Summary
The Java Virtual Machine exists only in the memory of our computer. Reproducing a machine within our computer's memory requires seven key objects: a set of registers, a stack, an execution environment, a garbage-collected heap, a constant pool, a method storage area, and a mechanism to tie it all together. This mechanism is the byte code instruction set. To examine byte code, we can use the Java class file disassembler, javap. By examining bytecode instructions in detail, we gain valuable insight into the inner workings of the Java Virtual Machine and Java itself. Each byte code instruction performs a specific function of extremely limited scope, such as pushing an object onto the stack or popping an object off the stack. Combinations of these basic functions represent the complex high-level tasks defined as statements in the Java programming language. As amazing as it seems, sometimes dozens of byte code instructions are used to carry out the operation specified by a single Java statement. When we use these byte code instructions with the seven key objects of the Virtual Machine, Java gains its platform independence and becomes the most powerful and versatile programming language in the world.

License
This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

About the Author


Bikash Shaw
Software Developer (Senior) Socit Gnrale India

Summary: 1. Extensive working experience in web and windows application development, database programming using Java and .Net technologies. 2. Good knowledge in SDLC and Processes like project planning, requirement gathering, development, test planning, release management and production support. 3. Good knowledge and working experience in following methodologies: a. Agile Development Approach b. Test Driven Development c. Behavior Driven Development d. Continuous Integration & Delivery Model. 4. Excellent communication & analytical skills, good team player, great mentoring capability. 5. Interaction with customer and team spread over different geographical area. Technologies / Languages: J2SE 5/6, J2EE, .Net 2.0/3.5, C#, ASP.NET, AJAX, XHTML, CSS, JavaScript, jQuery, PL/SQL, Web Services (SOAP based), Winforms, Hibernate, Spring, GWT, XML, XSD, XPath, SAX, JAXB, JUnit, JBehave, Mockito, Selenium, StringTemplate, Log4J, Apache Commons API Database: Oracle, SQL Server, MySQL, Sybase

http://www.codeproject.com/Articles/30422/How-the-Java-Virtual-Machine-JVM-Works[9/24/2013 4:49:14 PM]

How the Java Virtual Machine (JVM) Works - CodeProject

Application Server: Tomcat, Sun Application Server, JBoss, GlassFish, Jetty, IIS Development Environments: Eclipse, NetBeans, Visual Studio. Designing Tools: Enterprise Architect, Microsoft Visio Version Control: SVN, Perforce Build Management: Hudson & JCruisemonitor, TeamCity Bug Tracking Tools: HP Quality Center, Bugzilla, JIRA Specialties: 1. Agile Test and Behavior Driven Development 2. Continuous Delivery Approach 2. Spring IOC and MVC 3. Web Services Follow on
Twitter

Article Top

Sign Up to vote Poor

Excellent

Vote

Comments and Discussions

Hint: For improved responsiveness ensure Javascript is enabled and choose 'Normal' from the Layout dropdown and hit 'Update'. You must Sign In to use this message board. Search this forum


Go

Relaxed Medium Normal Profile popupsSpacing Relaxed Noise Medium Layout Normal

25 Update Per page 25

First Prev Next

AwesomE!!! My vote of 5 @jvm Thanks You forgot one important think -&gt; Invoke and BeginInvoke Re: You forgot one important think -&gt; Invoke and BeginInvoke Answer So what are the pitfalls of JVM? Re: So what are the pitfalls of JVM? [modified]

Sachin Verma sanchaman4 sonagara sanya.fesak jmanson

19-May-13 2:16 19-Feb-13 21:08 11-Apr-12 7:52 21-Sep-11 19:05 25-Nov-08 12:16

ashu fouzdar

1-Dec-08 23:40

Derek Bartram GriffinPeter User of Users Group

27-Oct-08 23:22 25-Oct-08 9:52 25-Oct-08 13:46

http://www.codeproject.com/Articles/30422/How-the-Java-Virtual-Machine-JVM-Works[9/24/2013 4:49:14 PM]

How the Java Virtual Machine (JVM) Works - CodeProject

Huh! JVM! And do you know how a large transformer works? Xcelent Re: Xcelent
Last Visit: 31-Dec-99 18:00 Last Update: 23-Sep-13 21:18 General News Suggestion Question Bug

bnk

25-Oct-08 3:07

Abhijit Jana Bikash Shaw

25-Oct-08 0:30 25-Oct-08 0:32 Refresh 1

Answer

Joke

Rant

Admin
Article Copyright 2008 by Bikash Shaw Everything else Copyright CodeProject, 1999-2013 Terms of Use

Permalink | Advertise | Privacy | Mobile Web03 | 2.6.1309023.2 | Last Updated 25 Nov 2008

Layout: fixed | fluid

http://www.codeproject.com/Articles/30422/How-the-Java-Virtual-Machine-JVM-Works[9/24/2013 4:49:14 PM]

You might also like