• On The Insider: Robert Downey Jr Injured on the Set

August 24, 2004 1:42 PM PDT

Japan designers shoot for supercomputer on a chip

  • Print
PALO ALTO, Calif.--Chip designers at Japan's RIKEN say you can get a lot done by specializing.

RIKEN, an anglicized acronym for Japan's Research Institute of Physical and Chemical Research, described on Tuesday the MDGrape 3, a processor it thinks will become the cornerstone of a computer capable of operating at a petaflop, or a quadrillion operations per second--far faster than the 36 trillion ops supercomputers of today.

Samples of the chip, which was designed for life sciences research, can now perform 230 gigaflops, or 230 billion operations per second, while running at 350MHz, better than standard general-purpose chips. In a worst-case scenario, the chip performs 160 gigaflops at 250MHz, said Makoto Tanji, a researcher with RIKEN's high-performance computing group. Tanji spoke at the Hot Chips conference taking place at Stanford University.

The computational power comes, he said, because the chip is specialized for workloads that involve numerous, similar calculations on a comparatively small set of data. This sort of workload is common in the life sciences and bio-nanotechnology field, where researchers need to examine, for example, how a single protein interacts with thousands of different molecules. Consequently, the chip and the computers based on it can be directly compared with general purpose supercomputers only in a limited field, but the processor excels there.

"We can obtain about a 100 times better performance through specialization. The number of operations are more limited on a general purpose computer," Tanji said. For the MDGrape 3 to shine, "the amount of computation must be much larger than the data," he added.

The University of Tokyo initiated the MDGrape project 15 years ago to develop a chip for astrophysics. RIKEN, which is one of the world's largest biosciences institutes, has worked over the last several years to extend the chip's architecture to life sciences and molecular dynamics because the range of applications is wider, Tanji explained. The group will create computers based on the chip for its Protein 3000 project to determine the characteristics of 3,000 proteins. Those machines should appear sometime in 2007.

Commercial systems using the MDGrape 2, which can churn at 16 gigaflops and run at 100MHz, are currently on the market, Tanji said. Work on the MDGrape 3, also know as the Protein Explorer, began in 2002, and the chip should start to be used to run applications in 2006.

Research also continues at the University of Tokyo to develop a quasi general purpose chip capable of 1 teraflop, or a trillion calculations a second. IBM and the University of Texas have a similar teraflop-on-a-chip project.

Architecturally, the MDGrape 3 differs substantially from most other chips. It comes with 20 pipelines for calculations, the equivalent of an assembly line for a processor. Commercial chips typically have one or two. The chip also features what RIKEN calls a broadcast memory architecture, where data is force-fed to the different pipelines simultaneously. Parallelization, a design convention that aims to cut down on redundant or parallel calculations, is optimized in the chip's design.

Despite the differences from other chips, the MDGrape 3 is built on the 130-nanometer process, a manufacturing convention that has been in place for the past few years.

The enhancements lead to huge advantages over general purpose processors. Tanji said the 350MHz Grape 3 can provide a gigaflop of computing power for $15, compared with $400 per gigaflop for a Pentium 4, $640 per gigaflop for the chips inside IBM's Blue Gene/L and a whopping $4,000 per gigaflop from NEC's Earth Simulator, currently the world's most powerful supercomputer.

In terms of power consumption, the 350MHz MDGrape 3 consumers 14 watts of power, or 0.1 watts per gigaflop. A 3GHz Pentium 4 runs at 82 watts, or 14 watts per gigaflop, he said. The Blue Gene/L chip and Earth Simulator come in at 6 and 128 watts, he said.

RIKEN is also designing the computer that will house the MDGrape 3. Twelve chips will fit on a board, while two boards will fit into a 2U-high box (3.5 inches). The chips are all connected to each other through an 81-bit bus, and the boards are connected to the rest of the computer through PCI Express.

The petaflop computer will consist of 6,144 processors on 512 boards clustered together. In all, the system will fit into 32 boxes that will stand on 19-inch pedestals.

"It is very small," Tanji said.

See more CNET content tagged:
supercomputer, life science, calculation, Japan, designer

Add a Comment (Log in or register) 3 comments
comparison
by emc2 August 24, 2004 3:58 PM PDT
From: Vareck Bostrom (v.bostrom@attbi.com)
Subject: Re: 18 gigaflops?
Newsgroups: comp.sys.mac.advocacy
Date: 2003-01-09 22:06:28 PST

I ran the old flops 2.0 test some time ago on both of my own macs, a Pentium 4 1.7 GHz from work and an Itanium 733 Mhz (also from work). I got:

CPU flops(6) - was peak for all machines
Itanium 733 2022.8883 MFLOPS (2.2 GigaFlops)
P4-1700SSE 1332.1317 MFLOPS (1.3 GigaFlops)
P4-1700 475.3642 MFLOPS (0.475 GigaFlops)
G4-800DP 308.3056 MFLOPS (0.308 GigaFlops)
G3-700 311.9328 MFLOPS (0.311 GigaFlops)

In article <dmartinZZZ-0901032106520001@192.168.0.5>,
dmartinZZZ@ufl.edu (Danny Martin) wrote:

> Apple's web site touts the twin 1.25 ghz processor turning 18 gigaflops. Is it valid to compare gigaflops between PPC chips and Pentiums? If so, how many gigaflops does a 2 ghz Pentium turn?

A 2 GHz Pentium 4 would manage about 8 GFLOPS single precision peak. This has come and gone in this newsgroup a couple times and I'm not
really sure how apple comes up with their numbers.

This is all theoretical peak and it doesn't really have much to do with real performance, it's mostly just a matter of adding numbers together
Reply to this comment
its a special use processor
by emc2 August 24, 2004 4:07 PM PDT
Article says, "The computational power comes, he said, because the chip is specialized for workloads that involve numerous, similar calculations on a comparatively small set of data. This sort of workload is common in the life sciences and bio-nanotechnology field"

Another words the performance would really suck if you tried to run an normal desktop application on it, but its really good at adding and multiplying huge mega-sets of data. Sort of like an idiot savant.
Reply to this comment
small set of data
by emc2 August 24, 2004 4:23 PM PDT
"similar calculations...on small set of data".

This phrase is really interesting. This seems to suggest the perfermance test case is just holding the data in its CPU's registers and performaning the same calculations over and over again without needing to access the external memories that would slow it down.

Its fast, but its not a very useful thing for a general purpose processor to do. Its kind of silly to compare this processor with a general purpose computer. Great, it can calculate lots of stuff if it sits in a black box and doesn't communiate with the outside world and your program needs to repeat the same caluation over and over again.
advertisement

Latest tech news headlines

RSS Feeds

Add headlines from CNET News to your homepage or feedreader.

More feeds available in our RSS feed index.

Markets

Market news, charts, SEC filings, and more

Related quotes

Dow Jones Industrials (3.31%) 270.00 8,419.09
S&P 500 (3.99%) 32.60 848.81
NASDAQ (3.70%) 51.73 1,449.80
CNET TECH (3.64%) 36.93 1,051.13
  Symbol Lookup
advertisement

Inside CNET News

Scroll Left Scroll Right
  • Business Tech

    IPOs a thing of the past?

    At AlwaysOn Venture Summit West conference, investment bankers, venture capitalists, and private equity players weigh in on the prognosis for the IPO market.

  • Gallery

    Photos: Space station marks a decade aloft

    The first pieces of the International Space Station went into orbit 10 years ago. Now a full-fledged lab facility, it continues to grow.

  • Security

    Apple deletes Mac antivirus suggestion

    Apple removes statement to customers urging them to use antivirus software, saying that Macs are safe "out of the box."

  • Beyond Binary

    Microsoft expands Vista SP2 testing

    Starting on Thursday, the software maker will make public a test version of the service pack update to Vista.

  • Video

    A toast to online wine A toast to online wine
  • Digital Media

    EFF, Bush administration spar over telecom immunity

    Feds tell district judge government must be allowed to protect the heartland. EFF says that is fine, but don't strip away constitutional rights.

  • Video

    Wi-Fi while you fly Wi-Fi while you fly
  • Gaming and Culture

    From Cy Young to video game fame

    Tim Lincecum, one of the best pitchers in baseball, was chosen to be the cover athlete for 2K Sports' next baseball game. On Tuesday, he did a motion-capture session for the game.

  • Green Tech

    Ta ta, Tesla

    Are the Valley-based VCs and big-wigs who back Tesla Motors really serious about asking the federal government for low-interest loans?

  • Gallery

    Photos: Top-rated reviews of the week

    Here are a few of CNET Reviews' favorite items from the past week, including Adobe suites, laptop bags, and a Panasonic flat panel TV.

  • The Download Blog

    Music and browsing take flight in Songbird

    Music and browsing mashup Songbird has kicked the remnants of its shell to the curb, and the program's main emphasis as a music browser couldn't be more clear.

  • Green Tech

    Ford accelerates electric-vehicle plans

    In its turnaround plan presented to Congress, Ford says it will invest billions in fuel efficiency and introduce a family of hybrid-electric and all-electric cars.