Preview

Limitations of a Superscalar Architecture

Good Essays
Open Document
Open Document
1541 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Limitations of a Superscalar Architecture
LIMITATION OF SUPERSCALAR MICROPROCESSOR PERFORMANCE
By: - Akshita Banthia (11BCE0475)
Abstract
In today’s world there is a new form of microprocessor called superscalar. In this several instructions can be initiated simultaneously and executed independently during the same clock cycle. The limitation of this feature is the handling of data dependencies. If not handled effectively, execution rate of more than one instruction per cycle is difficult to achieve. This case study uses multi bit scoreboard architecture to handle data conflicts for out of order execution and completion of instructions. The paper analyses the performance of the superscalar microprocessor by using two stimulation models which uses benchmark programs and one calculation model which uses queuing networks to derive the formula for data deficiencies from the peak performance.
Introduction
The single bit scoreboard is sufficient to detect dependency in processors with only one pipeline and also stop the flow of instructions until the line is cleared. Whereas a multi bit scoreboard is used in processors with multiple instructions. In this the multiple bit scoreboard in combination with temporary result registers will maintain the flow of the instructions and also to achieve peak performance a branch prediction unit is included.
Multi bit scoreboard architecture
Model
In this model the pipeline architecture has been implemented and it consists of four stages instruction fetch, instruction decodes, execution, and write-back. Instructions are fetched from the external memory or the cache memory to the instruction buffers and then transferred into the decoding units. The set of temporary registers are used as renaming registers for instructions with output and anti-dependencies. The branch prediction unit predicts the next stream of instructions. Data needed by the load/store instructions is handled by the data cache. In case of an interrupt the retire unit restores the proper

You May Also Find These Documents Helpful

  • Satisfactory Essays

    SD1230 Lab 1

    • 239 Words
    • 2 Pages

    2. What type of instruction is used to execute different code based on a condition? -Jumps…

    • 239 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    The operating systems job is managing the memory. The operating system is in charge of bringing this process into main memory (Stallings, 2012). However the processor must deal with references within the program. Branch instructions include an address to reference the order to be executed next. Data reference instructions include the address of the byte or word of data referenced. The processor hardware and operating system software must be able to translate the memory references found in the code of the program into actual physical memory addresses, reflecting the current location of the program in main memory. (Stallings, 2012).…

    • 573 Words
    • 3 Pages
    Good Essays
  • Better Essays

    Website Migration Project

    • 3004 Words
    • 13 Pages

    This project aims to produce a system that will adequately address Tony’s Chips system requirements. In light of this, the system’s architecture will consider all of the system’s requirements in its design. The system’s architecture will make use of the ideally performing applications. The project aims to create a cohesive system from the many available system components by putting emphasis on application compatibility. The project also aims at creating reliable recovery solutions for the system. This will be undertaken with the aim of enhancing system recoverability.…

    • 3004 Words
    • 13 Pages
    Better Essays
  • Good Essays

    Figure 1: Overall Block Diagram The goal of this project is to provide you with a more practical hands-on approach to computer architecture design problems. The processor complex you will be designing is a 32-bit version of the MIPS processor; however, the instruction set will be a small subset of the actual MIPS ISA. You should implement the end to end operation of the complex utilizing the VHDL hardware descriptive language. You may use any constructs within the VHDL language, however, the design must be of your own. Copying of any form from any other student or any internal or external sources is illegal and will not be accepted. The processor supports the three instruction formats: R-format, I-format, and J-format as described in the text book and lectures. Table I Summarizes the core set of instructions for your ISA. The memory is assumed to be byte addressable and each word is 32 bits.…

    • 1082 Words
    • 5 Pages
    Good Essays
  • Satisfactory Essays

    Biology Exam Paper

    • 2143 Words
    • 9 Pages

    Using a technology called ____, a processor can begin executing an instruction before it completes the previous instruction.…

    • 2143 Words
    • 9 Pages
    Satisfactory Essays
  • Good Essays

    The list of machine instructions that a typical CPU must be able to decode and execute is quite short. In fact, once a machine can perform certain elementary but well-chosen tasks, adding more features do not increase the machine’s theoretical capabilities. In other words, at a certain point, additional features may increase such things as convenience but add nothing to the machine’s fundamental capabilities. The degree to which machine designs should take advantage of this fact has lead two philosophies of CPU architecture.…

    • 458 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    The Little Man Computer

    • 752 Words
    • 4 Pages

    The LMC system architecture uses basic design elements to help illustrate the basic concepts of processing input and output. The basic architecture is comprised of a little man in a small room that performs basic tasks. The little man executes basic tasks by following a set of instructions. The main components of the LMC are mailboxes, calculator, inbox, and outbox. Like in a real computer, the memory in the LMC consists of mailboxes which are slots to hold stored information. The calculator is like the Arithmetic Logic Unit on a real computer, and the inbox and outbox are for receiving and outputting data similar to a keyboard and monitor.…

    • 752 Words
    • 4 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Computer Architect

    • 744 Words
    • 3 Pages

    Instruction address calculation (iac): Determine the address of the next instruction to be executed. Instruction fetch (if): Read instruction from its memory location into the processor. Instruction operation decoding (iod): Analyze instruction to determine type of operation to be performed and operand(s) to be used. Operand address calculation (oac): If the operation involves reference to an operand in memory or available via I/O, then determine the address of the operand. Operand fetch (of): Fetch the operand from memory or read it in from I/O. Data operation (do): Perform the operation indicated in the instruction. Operand store (os): Write the result into memory or out to I/O.…

    • 744 Words
    • 3 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Instruction set types

    • 431 Words
    • 2 Pages

    In this form of architecture, instructions are highly encoded in order to enhance the code density.Due to the way the instructions are packed together, results are in smaller program sizes, and they have slow memory access.…

    • 431 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Branch Delay

    • 1747 Words
    • 7 Pages

    A technique for minimizing the effect of control dependencies is to separate the point where the branch operation takes effect from the branch tests. The branch instruction performs a test on a branch condition. If the test succeeds, the PC is modified, but the modification does not take effect immediately. This delayed branch allows one or more instructions following the branch to be executed in the pipeline whether the branch is taken or not.…

    • 1747 Words
    • 7 Pages
    Good Essays
  • Powerful Essays

    This paper explains the architecture development and challenges of multi-core processing, plus provides a glimpse into the upcoming Intel quad-core processors and the direction in which Intel is taking multi-core processors to the future. We discuss many of the benefits you will see as we continue to increase processor performance, energy efficiency, and capabilities. Multi-core processor capability is central to the Intel platform-centric approach. By enabling enhanced performance, reduced power consumption and more efficient simultaneous processing of multiple tasks, multi-core processors promise to improve the user experience in home and business environments.…

    • 2229 Words
    • 9 Pages
    Powerful Essays
  • Powerful Essays

    chip: Concepts, architectures, and implementations, IEEE Design and Test of Computers (2005), no. 5, 414–421.…

    • 15068 Words
    • 61 Pages
    Powerful Essays
  • Better Essays

    The MMX TM Technology extension to the Intel Architecture is designed to accelerate multimedia and communications software running on Intel Architecture processors (Peleg and Weiser). The technology introduces new data types and instructions that implement a SIMD architecture model and is defined in a way that maintains full compatibility with all existing Intel Architecture processors, operating systems, and applications. MMX technology on average delivers 1.5 to 2 times performance gains for multimedia and communications applications in comparison to running on the same processor but without using MMX technology. This extension is the most significant addition to the Intel Architecture since the Intel I386 and will be implemented on proliferation of the Pentium processor family and also appear on future Intel Architecture processors.…

    • 821 Words
    • 4 Pages
    Better Essays
  • Powerful Essays

    Mpmc on 8086

    • 2455 Words
    • 10 Pages

    • Architecture of 8086 microprocessor (8086 Internal Block Diagram) • Register organization • Special functions of General Purpose Registers • Memory Segmentation • Address Generation • Pin Description • Minimum & Maximum modes • Timing diagram.…

    • 2455 Words
    • 10 Pages
    Powerful Essays
  • Satisfactory Essays

    VI CSE Syllabus 010411052318

    • 2829 Words
    • 17 Pages

    The purpose of this subject is to cover the underlying concepts and techniques used in…

    • 2829 Words
    • 17 Pages
    Satisfactory Essays

Related Topics