Šta je novo?

Nvidias kepler(gf6xx) comes in to focus.

nVidia ima "bolji" dogovor,eto cisto to ako nisi znao,na neki nacin ima privilegije da se prvo njima isporuci odredjen broj
kartica koje potrazuju.

Citirao sam sebe da bi se nadovezao na ovo
Obratiti paznju na datum mog posta i vest koja je plasirana na netu ;)
A da li sam to mogao reci jos mnogo ranije? :p
P.S.Inace vesti nisam ni gledao,procitah Kostin post da je izasao neki clanak pa sam tek sad i pogledao i linkovao ovde.

Verujte PCBeastu kad vam kaze :D
 
Poslednja izmena:
Prdavac :)

8SXy6.jpg


It seems that the rumors regarding the price of mid-range GeForce GTX 660 card, reported back in January, are true after all. According to OBR-Hardware GTX 660 is coming soon for at least $299.

GeForce GTX 660 will likely be the first card based on brand new Kepler GK106 chip. First leaks said that it will hold up to 768 CUDA cores gathered in 4 SMX clusters (architecture may actually be different from GK104). But if the architecture is similar to high-end GK104 chip then it would hold 64 texture mapping units (TMUs) and 24 raster operating units (ROPs). This would be the card equipped with 1536 or 2048 MB of GDDR5 memory. GTX 660 will also have slightly shorter memory interface of 192-bit. It would consume around 130 Watts, from PCI-E slot and one 6-pin connector.

As for the PCB of this card. It’s possible that the same board from GTX 670 card will be used. NVIDIA did some calculations and find this much cheaper than designing a new one. That PCB is already short, so the only modification we are expecting is to the power connectors count.

SemiAccurate reported back in January that first mid-range card from GeForce 600 Series would cost around $299, and this seems to find confirmation in OBR post, who suggests that it will be available for $299 to $329.

If this is indeed the real price of this card then we should expect another high performance GPU. We all know what a great mistake did NVIDIA make by releasing GTX 670 card, which is almost as fast as GTX 680 card (some units are even faster). However, this mid-range product will surely not reach a performance of GTX 670, but most likely outperform Radeon HD 7870 which is available for $339.

Picture presents only my interpretation of how it may look.

Bice tu jos jedan model sa oznakom Ti koliko je meni poznato.
 
$299 sa samo 768 CUDA WTF!!!!!
 
Al opet mnogo je $299 zar nije trebala biti $249.
 
Sacekaj da vidis performanse i pogledaj cene danasnjih 7870/7850 kartica.
Po tome se nVidia i upravlja.P.S.I ovo nije pouzdana informacija tako da ne treba uzimati zdravo za gotovo.
 
GK106 je logično da bude pola GK104 čipa ali te performanse mogu biti dovoljne za neki GTX660, GTX660 Ti bi mogao da bude GK104 sa 6 aktivnih SMX-ova.
 
Da..Da..Jedan SMX manje u odnosu na GTX670.
Karta ce imati 1152 shader jedinice,256 bit mem interfejs sa nesto nizim defaultnim klokovima od
GTX670.E sad ono sto me brine jeste da ce takva karta biti poprilicno blizu GTX670.
Ili ce da spuste malo osetnije klokove ili ce da preseku mem magistralu.
Mozda ce biti 192 Bit sa tim karakteristikama.
 
Kepler mobile GTX 680M

DDtE9.jpg


NVIDIA is preparing their flagship mobile graphics card to be presented during Computex 2012 event in Taiwan on June 5th.

NVIDIA GeForce GTX 680M is not a full GK104 Kepler chip, it doesn’t even have half of the CUDA cores of its desktop variant. GeForce GTX 680M has only 744 CUDA cores (but some listings suggest it has 768 cores – this still needs to be verified). Surprisingly, it has much higher GDDR5 memory capacity of 4096 MB. It seems that the only similarity to its desktop model is memory interface of 256-bit width. Rumors say that it should have consume up to 100 Watts of power.

This is a second revision of N13E-GTX 680M chip — A2 silicon. Card will support SLI configurations and the latest DirectX 11.1 version.

As for the performance, it’s 37% faster than GeForce GTX 670M based on Fermi architecture. First leaked benchmark results has been posted on Chinese website. Card is reaching 4905 points in 3DMark 11 in Performance Preset. That’s in a tandem of Core i7-3720QM Ivy Bridge processor.

Card will be released during Computex 2012 in June, but some manufacturers are already offering notebooks with this model.

6mLNa.jpg


uWstT.jpg
 
Nebi valjalo da je previse unakaze da bude puno sporija od 670 50e jeftinija :(
 
GK110

The future of GTC General Assembly, GK110, the new version of development tools and CUDA open source

Distance from the GTC 2012 General Assembly, less than 12 hours, NVIDIA boss Jen-Hsun Huang,
was published on 10:30 on the day the opening speech, in addition to preaching the latest technologies of NVIDIA's GPU computing,
the most heavy message be released 70 million transistors The behemoth "GK110 graphics.

The GK110 concerned in the end is new architecture chip or dual-core GK104 Tesla version is not known,
the argument of the latter sort of credibility, because GK104 is just 3.54 billion transistors,
dual-core GTX 690 power consumption and heat control is also very good The GK104 floating point capability before exposure can be resolved.

NVIDIA will not be designed based on the GTX, 690 Tesla Accelerator?

NVIDIA released the real the GK110 chip of the new structure, then in accordance with the previously leaked data,
which has six groups of GPC unit, 24-SMX unit, SMX unit design and the GK104 obviously different, each group of 128 CUDA cores. a total of 3072 CUDA cores,
twice the total number is also GK104, the core area of ​​a further increase to around 550mm2, the memory interface is also upgraded to 384bit, to maintain the standard of GTX 580.

GTC eve of the opening, NVIDIA has also released a new version of the the Nsight development tool, it is a debug and performance analysis tools based on GPU acceleration There are two versions for different platforms.

Version of Visual Studio for the Windows platform, the release of the version number is 2.2, and finally support DX9 architecture graphics (the previous version limit can only be used in DX10/11 graphics), which also supports the latest Kepler architecture to optimize the kernel for the SASS (Source and Assembly) and of PTX (Parallel Thread eXecution, parallel thread execution), the single card is responsible for the display output can also use the CUDA-enabled the Debug.

Nsight the Eclipse Edition supports Linux and Mac OS platform, the Eclipse name happens to be IBM ten years for the same platform release debug software in the name of the same Eclipse version Nsight support for CUDA to accelerate software development.

To create a good atmosphere to the 2012 General Assembly of the GTC, NVIDIA GPU computing, frequent moves, a few days ago has just announced the LLVM compiler will support the CUDA architecture, the LLVM is an open source modular compiler support C / C + +, Objective-C, Fortran, , Ada, Haskell, Java bytecode, Python, Ruby, ActionScript, the GLSL and Rust languages, the CUDA architecture, is currently limited to C and C + + and Fortran.

nvidia_cuda_llvm.jpg


NVIDIA This move means that CUDA also go open source, the increase of the supported languages ​​CUDA to accelerate the popularity of the developers,
but also step great move, it seems GK104 lost GCN no influence on the GPU computing performance, Perhaps NVIDIA intends to re-GPU computing and GPU gaming performance separately.
 
Poslednja izmena:
NVIDIA Tesla K20 To Be Based on Kepler GK110 GPU

bOeWU.jpg


So this is where all the rumors end, GK110 is not GeForce card, it’s brand new Tesla K20 GPU, which is coming in fourth quarter of 2012 –
as announced today by NVIDIA during GTC 2012.New chip is built with 7.1 billion transistors.

NVIDIA is preparing two Tesla GPUs, one is Tesla K10, based on dual-GK104 chips, which is to be devoted to high efficient computing in oil and gas exploration and defense industry.
Card is based on dual-GK104 GPUs and has a memory bandwidth of 320 GB/s.

Next is Tesla K20, the GK110 GPU which is coming in Q4 of this year. It will deliver three times more double precision performance compared to Fermi architecture-based Tesla cards.

First GK110 GPUs will be incorporated into Titan Supercomputer at the Oak Ridge National Laboratory in Tennessee and the Blue Waters system at the National Center for Supercomputing Applications a the University of Illinois.

New Kepler GK110 feature new technologies:

SMX
Delivers more processing performance and efficiency through this new, innovative streaming multiprocessor design that allows a greater percentage of space to be applied to processing cores versus control logic

Dynamic Parallelism
Simplifies GPU programming by allowing programmers to easily accelerate all parallel nested loops – resulting in a GPU dynamically spawning new threads on its own without going back to the CPU

Hyper-Q
Slashes CPU idle time by allowing multiple CPU cores to simultaneously utilize a single Kepler GPU, dramatically advancing programmability and efficiency


“Fermi was a major step forward in computing,” said Bill Dally, chief scientist and senior vice president of research at NVIDIA. “It established GPU-accelerated computing in the top tier of high performance computing and attracted hundreds of thousands of developers to the GPU computing platform. Kepler will be equally disruptive, establishing GPUs broadly into technical computing, due to their ease of use, broad applicability and efficiency.”

NVIDIA Tesla K10

we’re announcing two products today, two kepler: Tesla K10 dedicated to seismic analysis where most valuable resource is bandwidth. it has 3x single precision of today’s fermi tesla and 1.8x the memory bandwidth

NVIDIA Tesla K20

second tesla, k20 is focused on double precision: 3x double precision of fermi, includes hyper q, dynamics parallelism, for stuff like physics, quantum chemistry, computational finance
 
NVIDIA Launches New Tesla K10 and K20 Graphics Cards

k9upd.jpg


GPU Technology Conference – NVIDIA today unveiled a new family of Tesla® GPUs based on the revolutionary NVIDIA® Kepler™ GPU computing architecture,
which makes GPU-accelerated computing easier and more accessible for a broader range of high performance computing (HPC) scientific and technical applications.

The new NVIDIA Tesla K10 and K20 GPUs are computing accelerators built to handle the most complex HPC problems in the world.
Designed with an intense focus on high performance and extreme power efficiency,
Kepler is three times as efficient as its predecessor, the NVIDIA Fermi™ architecture, which itself established a new standard for parallel computing when introduced two years ago.

“Fermi was a major step forward in computing,” said Bill Dally, chief scientist and senior vice president of research at NVIDIA.
“It established GPU-accelerated computing in the top tier of high performance computing and attracted hundreds of thousands of developers to the GPU computing platform.
Kepler will be equally disruptive, establishing GPUs broadly into technical computing, due to their ease of use, broad applicability and efficiency.”

The Tesla K10 and K20 GPUs were introduced at the GPU Technology Conference (GTC), as part of a series of announcements from NVIDIA, all of which can be accessed in the GTC online press room.

NVIDIA developed a set of innovative architectural technologies that make the Kepler GPUs high performing and highly energy efficient, as well as more applicable to a wider set of developers and applications. Among the major innovations are:

SMX Streaming Multiprocessor — The basic building block of every GPU, the SMX streaming multiprocessor was redesigned from the ground up for high performance and energy efficiency. It delivers up to three times more performance per watt than the Fermi streaming multiprocessor, making it possible to build a supercomputer that delivers one petaflop of computing performance in just 10 server racks. SMX’s energy efficiency was achieved by increasing its number of CUDA® architecture cores by four times, while reducing the clock speed of each core, power-gating parts of the GPU when idle and maximizing the GPU area devoted to parallel-processing cores instead of control logic.
Dynamic Parallelism — This capability enables GPU threads to dynamically spawn new threads, allowing the GPU to adapt dynamically to the data. It greatly simplifies parallel programming, enabling GPU acceleration of a broader set of popular algorithms, such as adaptive mesh refinement, fast multipole methods and multigrid methods.
Hyper-Q — This enables multiple CPU cores to simultaneously use the CUDA architecture cores on a single Kepler GPU. This dramatically increases GPU utilization, slashing CPU idle times and advancing programmability. Hyper-Q is ideal for cluster applications that use MPI.

“We designed Kepler with an eye towards three things: performance, efficiency and accessibility,” said Jonah Alben, senior vice president of GPU Engineering and principal architect of Kepler at NVIDIA. “It represents an important milestone in GPU-accelerated computing and should foster the next wave of breakthroughs in computational research.”

NVIDIA Tesla K10 and K20 GPUs
The NVIDIA Tesla K10 GPU delivers the world’s highest throughput for signal, image and seismic processing applications. Optimized for customers in oil and gas exploration and the defense industry, a single Tesla K10 accelerator board features two GK104 Kepler GPUs that deliver an aggregate performance of 4.58 teraflops of peak single-precision floating point and 320 GB per second memory bandwidth.

The NVIDIA Tesla K20 GPU is the new flagship of the Tesla GPU product family, designed for the most computationally intensive HPC environments. Expected to be the world’s highest-performance, most energy-efficient GPU, the Tesla K20 is planned to be available in the fourth quarter of 2012.

The Tesla K20 is based on the GK110 Kepler GPU. This GPU delivers three times more double precision compared to Fermi architecture-based Tesla products and it supports the Hyper-Q and dynamic parallelism capabilities. The GK110 GPU is expected to be incorporated into the new Titan supercomputer at the Oak Ridge National Laboratory in Tennessee and the Blue Waters system at the National Center for Supercomputing Applications at the University of Illinois at Urbana-Champaign.

“In the two years since Fermi was launched, hybrid computing has become a widely adopted way to achieve higher performance for a number of critical HPC applications,” said Earl C. Joseph, program vice president of High-Performance Computing at IDC. “Over the next two years, we expect that GPUs will be increasingly used to provide higher performance on many applications.”

Preview of CUDA 5 Parallel Programming Platform
In addition to the Kepler architecture, NVIDIA today released a preview of the CUDA 5 parallel programming platform. Available to more than 20,000 members of NVIDIA’s GPU Computing Registered Developer program, the platform will enable developers to begin exploring ways to take advantage of the new Kepler GPUs, including dynamic parallelism.

The CUDA 5 parallel programming model is planned to be widely available in the third quarter of 2012. Developers can get access to the preview release by signing up for the GPU Computing Registered Developer program on the CUDA website.

About NVIDIA Tesla GPUs
NVIDIA Tesla GPUs are massively parallel accelerators based on the NVIDIA CUDA parallel computing platform. Tesla GPUs are designed from the ground up for power-efficient, high performance computing, computational science and supercomputing, delivering dramatically higher application acceleration for a range of scientific and commercial applications than a CPU-only approach. Today, Tesla GPUs power three of the world’s top five supercomputers.
 
WHQbv.jpg


SGI, the trusted leader in technical computing, today announced the availability of a complete,
managed GPU solution of its SGI Rackable servers with the new high-performance NVIDIA Tesla K10 GPU Computing Accelerator.
Coupled with SGI Management and Performance Suite software, the solution is built,
completely integrated, and tested in SGI’s manufacturing facility so that it can be installed at the customer site and begin running application codes in less than a day.

The next-generation NVIDIA Tesla K10 GPU Accelerator is designed for throughput and performance per watt.
It features two ultra-efficient GK104 GPUs that provide up to 2X performance per watt for single precision applications than its predecessor, the M2090.
The NVIDIA Tesla K10 features two GK104 GPUs with 1536 CUDA cores each and 4 GB of memory operating at 160 GB/second per GPU.
These GPUs are shown to be particularly well suited to single high precision workloads, including seismic processing, signal, image and video analysis, and radio astronomy.

“The research team at the Swinburne University of Technology has benefited from significant improvements in processing power and research output since our SGI Rackable system was installed in July 2011,” said Dr. Jarrod Hurley, manager of Swinburne’s supercomputer. “That system was over ten times more powerful than its predecessor, and with the introduction of the new NVIDIA Tesla K10 GPU Computing Accelerator we’ll look forward to building even greater functionality and performance into our own system as our requirements evolve.”

The baseline server for the solution is the SGI Rackable C1104G-RP5, a 1U standard depth chassis, and the SGI 2110-RP5, a 2U standard depth chassis.
Each server features one dual-socket server powered by two Intel Xeon processor E5-2600 series, and 8 DIMM slots. The C1104 can accommodate up to four 2.5 inch hard disk drives,
three PCI-E 3.0 x 16 double-width expansion slots and dual-port GigE networking controllers, while the C2110 can accommodate up to ten 2.5 inch hard drives, four internal PCI-E 3.0 x 16 double-width expansion slots and dual-port GigE controllers. SGI is also announcing the availability of the SGI GPU Starter Kit with NVIDIA Tesla K10 GPU Accelerator offering a pre-configured bundle of 10 nodes.

“As a leader in design-to-order high performance computing (HPC) solutions with over three decades of experience in accelerators applied to high precision workloads, SGI is uniquely positioned to bring its customers hybrid solutions of industry-standard microprocessors with GPU accelerators,” said Bill Mannel, vice president of product marketing at SGI. “Our GPU clusters are all you need for a powerful deployment. We’re also announcing the SGI GPU Starter Kit, a 10-server cluster with 20 NVIDIA Tesla K10 GPUs for those who want to start enjoying the benefits of CUDA programming today in a cost-effective solution.”
 
Poslednja izmena:
Sudeći po vestima, piše da se čeka da 28 nm proces dovoljno sazri i tek onda će izbaciti GK110 za consumer korisnike. Ako je tačno da ovaj monstrum ima 3072 cuda cores i 384-bit memoriju i čak 7.1 milijardu tranzistora na površini od 550 mm2, onda će to zaista biti MONSTRUM.

Citat sa techpowerup-a, sa kojim se apsolutno slažem jer je to realno:

Launching the GK110 first as a Tesla part serves quite a few purposes. To mention a few, GK110 is launched to the market in a low-volume yet high-margin market; NVIDIA doesn't have to worry about volumes or pressure to launch the chip in its GeForce avatar, since it already plugged high-end graphics market with a splurgy $999 (read: $1200-ish) GeForce GTX 690. It's possible that GK110 makes its march to consumer platforms once 28 nm manufacturing has achieved maturity (in terms of volumes, it's a 13-year old).

http://www.techpowerup.com/166099/NVIDIA-GeForce-Kepler-110-(GK110)-Specs-Detailed.html
 
Poslednja izmena:
Znaci, ono sto se govorilo za 7 m. tranzistora je ipak GK110 :)

http://www.nvidia.com/content/PDF/kepler/NV_DS_Tesla_KCompute_Arch_May_2012_LR.pdf

Dakle, Tesla K10 je u stvari GTX 690 za HPC i njega ce prodavati za single-precision aplikacije, dok je Tesla K20 GK110 i on ide za double-precision:

http://www.nvidia.com/content/tesla/pdf/NV_DS_TeslaK_Family_May_2012_LR.pdf

With more than one teraflop peak double precision performance,
the Tesla K20 is ideal for a wide range of high performance
computing workloads including climate and weather modeling,
CFD, CAE, computational physics, biochemistry simulations, and
computational finance.

Spominje se 1 TFLOPS double-precision, sto i ne deluje preterano mnogo.

Ovde sam nasao navodno neke specifikacije za GK110:

GK110 - K20-specs or at least parts of it
The known facts at a glance GK110.
- 7.1 billion transistors
- (Each with 192 shader ALUs, Tesla products probably 13 or 14 active SMX), 15 Physical SMX
- 384-bit wide GDDR5 memory tailed (to accommodate as much as you can)
- As for the fourth quarter of 2012, Tesla announced that Quadro and GeForce yet undetermined
- "Dynamic Parallelism", "Hyper-Q" and "SMX"
- 6-pin & 8-pin power connector on the Tesla-K20 map
- DGEMM efficiency at 80-85% (Fermi at about 65%, Radeon> 90%)
- 3x higher DP peak throughput compared to the Fermi-based Tesla cards

http://translate.google.com/transla...Us-auf-GTC-2012-vorgestellt/Grafikkarte/News/

Ono sto se zna jeste da SMX ima 192 shader-a, isto kao kod GK104. Ostaje pitanje broja SMX-a. Gore spominju da fizicki GK110 ima 15, sto dovodi do cifre od ukupno 2880 shader-a. Ako je istina da bi Tesla K20 imala 13-14 aktivnih, uz dosta nize klokove Tesla karti (~750MHz) i uz odnos sinlgle-double recision performance 1/4, onda i moze da se dodje do onih ~1 TFLOPS double-precision.

geforcegtx680dieshot.jpg
GK104


kepler_dieshot.jpg
GK110

Mozda ce neko videti nesto iz slika iz cega ce moci da se izvuce jos neka informacija :)
 
Poslednja izmena:
Vec su ga "procenili" :p

beyKm.jpg

~520mm2

mVFb6.jpg
 

Holy crap! Znaci 2880 shader-a, 384-bit...


Izgleda da je tacno ono sto je Dusan K pricao - da dolazi vreme kada ce se GPU-ovi projektovati zasebno za gaming i za compute.
Ali gotovo sam siguran da ce se GK110 sledece godine pojaviti u consumer grafici. To znaci da nVidia u stvari samo menja strategiju izbacivanja chipova - ranije su uvek prvo izbacivali za gaming, a tek posle ~6 meseci za HPC. Sada je obrnuto (narano, pod uslovom da je ova pretpostavka tacna). Ovaj novi nacin ima mnogo vise smisla - na pocetku kad je yield manji i ne treba im puno ispravnih GPU-ova jer kostaju po nekoliko hiljada dolara/evra. Posle nekog vremena yield se popravi ili odrade nesto kao sto su odradili prelaskom sa GTX 400 na GTX 500 seriju i onda ga prodaju kao consumer proizvod.
Cak i kad se sagleda cela Kepler i cela Fermi serija, nema tu mnogo promena u strategiji - i kod Fermi-ja je GF100/110 bio okrenut na compute dok je GF104/114 ka gamingu, tako je i sada, samo sto je provo izbacen slabiji GPU.

EDIT: 520mm2, pa to je nista :D Obzirom da GK110 ima ravno 2x vise tranzistora od GK104, koji vec pakuje tranzistore daleko efikasnije od Fermi-ja, ocekivano bi bilo ~590mm2. Izgleda da su jos bolje 'kompresovani' kod GK110.
 
Poslednja izmena:
Mislim da je to "iznudjena" strategija. Da je hd7979 bio brzi i da gk104 nije mogao da se nosi sa njim, tesko da bi ovako nastupili...
 
Sve jedno bi prvo izbacili GK104 ,bio slabiji ili ne.Svakako je trebao da bude GTX660.
Obzirom da su imali prostora da mu koriguju klokove taman su podesili da bude dovoljno brzi od 7970
i samim tim su mu dodelili oznaku hi end karte..
 
Nazad
Vrh Dno