• Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA
  • Disclaimer
Saturday, September 21, 2024
CryptoBangs.com
Advertisement
  • Home
  • Live Crypto Prices
  • Crypto News
    • Bitcoin
    • Ethereum
    • Ripple
    • Altcoin
    • NFT News
  • DeFi
  • Blockchain
  • Regulation
  • Shop
  • Blog
  • Calculator
No Result
View All Result
  • Home
  • Live Crypto Prices
  • Crypto News
    • Bitcoin
    • Ethereum
    • Ripple
    • Altcoin
    • NFT News
  • DeFi
  • Blockchain
  • Regulation
  • Shop
  • Blog
  • Calculator
No Result
View All Result
CryptoBangs.com
No Result
View All Result

NVIDIA Unveils NCCL 2.22 with Enhanced Memory Efficiency and Faster Initialization

September 21, 2024
in Blockchain
Reading Time: 3 mins read
A A
NVIDIA Unveils NCCL 2.22 with Enhanced Memory Efficiency and Faster Initialization
ShareShareShareShareShare

Related articles

5 Best Cheap Cryptocurrencies to Buy Under 1 Dollar September 20 – Ondo, Cronos, The Graph, Jupiter

5 Best Cheap Cryptocurrencies to Buy Under 1 Dollar September 20 – Ondo, Cronos, The Graph, Jupiter

September 21, 2024
Top Trending Cryptos on Solana Chain Today – Dogs of Elon, CLYDE, World Liberty Financial

Top Trending Cryptos on Solana Chain Today – Dogs of Elon, CLYDE, World Liberty Financial

September 20, 2024


Caroline Bishop
Sep 21, 2024 13:38

NVIDIA introduces NCCL 2.22, focusing on memory efficiency, faster initialization, and cost estimation for improved HPC and AI applications.





The NVIDIA Collective Communications Library (NCCL) has released its latest version, NCCL 2.22, bringing significant enhancements aimed at optimizing memory usage, accelerating initialization times, and introducing a cost estimation API. These updates are crucial for high-performance computing (HPC) and artificial intelligence (AI) applications, according to the NVIDIA Technical Blog.

Release Highlights

NVIDIA Magnum IO NCCL is designed to optimize inter-GPU and multi-node communication, which is essential for efficient parallel computing. Key features of the NCCL 2.22 release include:

  • Lazy Connection Establishment: This feature delays the creation of connections until they are needed, significantly reducing GPU memory overhead.
  • New API for Cost Estimation: A new API helps optimize compute and communication overlap or research the NCCL cost model.
  • Optimizations for ncclCommInitRank: Redundant topology queries are eliminated, speeding up initialization by up to 90% for applications creating multiple communicators.
  • Support for Multiple Subnets with IB Router: Adds support for communication in jobs spanning multiple InfiniBand subnets, enabling larger DL training jobs.

Features in Detail

Lazy Connection Establishment

NCCL 2.22 introduces lazy connection establishment, which significantly reduces GPU memory usage by delaying the creation of connections until they are actually needed. This feature is particularly beneficial for applications that use a narrow scope, such as running the same algorithm repeatedly. The feature is enabled by default but can be disabled by setting NCCL_RUNTIME_CONNECT=0.

New Cost Model API

The new API, ncclGroupSimulateEnd, allows developers to estimate the time required for operations, aiding in the optimization of compute and communication overlap. While the estimates may not perfectly align with reality, they provide a useful guideline for performance tuning.

Initialization Optimizations

To minimize initialization overhead, the NCCL team has introduced several optimizations, including lazy connection establishment and intra-node topology fusion. These improvements can reduce ncclCommInitRank execution time by up to 90%, making it significantly faster for applications that create multiple communicators.

New Tuner Plugin Interface

The new tuner plugin interface (v3) provides a per-collective 2D cost table, reporting the estimated time needed for operations. This allows external tuners to optimize algorithm and protocol combinations for better performance.

Static Plugin Linking

For convenience and to avoid loading issues, NCCL 2.22 supports static linking of network or tuner plugins. Applications can specify this by setting NCCL_NET_PLUGIN or NCCL_TUNER_PLUGIN to STATIC_PLUGIN.

Group Semantics for Abort or Destroy

NCCL 2.22 introduces group semantics for ncclCommDestroy and ncclCommAbort, allowing multiple communicators to be destroyed simultaneously. This feature aims to prevent deadlocks and improve user experience.

IB Router Support

With this release, NCCL can operate across different InfiniBand subnets, enhancing communication for larger networks. The library automatically detects and establishes connections between endpoints on different subnets, using FLID for higher performance and adaptive routing.

Bug Fixes and Minor Updates

The NCCL 2.22 release also includes several bug fixes and minor updates:

  • Support for the allreduce tree algorithm on DGX Google Cloud.
  • Logging of NIC names in IB async errors.
  • Improved performance of registered send and receive operations.
  • Added infrastructure code for NVIDIA Trusted Computing Solutions.
  • Separate traffic class for IB and RoCE control messages to enable advanced QoS.
  • Support for PCI peer-to-peer communications across partitioned Broadcom PCI switches.

Summary

The NCCL 2.22 release introduces several significant features and optimizations aimed at improving performance and efficiency for HPC and AI applications. The improvements include a new tuner plugin interface, support for static linking of plugins, and enhanced group semantics to prevent deadlocks.

Image source: Shutterstock


Credit: Source link

ShareTweetSendPinShare
Previous Post

The Top 5 Metaverse Games of 2024

Next Post

Bitcoin Indicator Signals ‘Shift To Bullish Territory’ – Can BTC Break Past $65,000?

Related Posts

5 Best Cheap Cryptocurrencies to Buy Under 1 Dollar September 20 – Ondo, Cronos, The Graph, Jupiter

5 Best Cheap Cryptocurrencies to Buy Under 1 Dollar September 20 – Ondo, Cronos, The Graph, Jupiter

September 21, 2024

Join Our Telegram channel to stay up to date on breaking news coverage Amanda Tuminelli, Chief Legal Officer of the...

Top Trending Cryptos on Solana Chain Today – Dogs of Elon, CLYDE, World Liberty Financial

Top Trending Cryptos on Solana Chain Today – Dogs of Elon, CLYDE, World Liberty Financial

September 20, 2024

Join Our Telegram channel to stay up to date on breaking news coverage BNB Chain is collaborating with Telegram to...

Top Cryptocurrencies to Invest in Now September 19 – Sei, Algorand, Hedera

Top Cryptocurrencies to Invest in Now September 19 – Sei, Algorand, Hedera

September 20, 2024

Join Our Telegram channel to stay up to date on breaking news coverage The cryptocurrency market is developing significantly, with...

FINAL FANTASY XVI Launches on GeForce NOW, Expanding Cloud Gaming Offerings

FINAL FANTASY XVI Launches on GeForce NOW, Expanding Cloud Gaming Offerings

September 20, 2024

Peter Zhang Sep 20, 2024 01:37 FINAL FANTASY XVI and Frostpunk 2 headline seven new games...

SLB and NVIDIA Team Up to Enhance Energy Sector with Generative AI

SLB and NVIDIA Team Up to Enhance Energy Sector with Generative AI

September 19, 2024

Joerg Hiller Sep 19, 2024 18:00 SLB and NVIDIA collaborate to develop generative AI solutions for...

Load More
Next Post
Bitcoin Indicator Signals ‘Shift To Bullish Territory’ – Can BTC Break Past $65,000?

Bitcoin Indicator Signals ‘Shift To Bullish Territory’ – Can BTC Break Past $65,000?

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Bitcoin (BTC) to follow US stock market all-time high

Bitcoin (BTC) to follow US stock market all-time high

September 20, 2024
How To Become A Millionaire When DOGE Hits $1.2

How To Become A Millionaire When DOGE Hits $1.2

September 19, 2024
Best Crypto to Buy Right Now September 17 – Pendle, BNB, Litecoin

Best Crypto to Buy Right Now September 17 – Pendle, BNB, Litecoin

September 17, 2024
NVIDIA Showcases AI Security Innovations at Major Cybersecurity Conferences

NVIDIA Showcases AI Security Innovations at Major Cybersecurity Conferences

September 19, 2024
Top Cryptocurrencies to Invest in Now September 19 – Sei, Algorand, Hedera

Top Cryptocurrencies to Invest in Now September 19 – Sei, Algorand, Hedera

September 20, 2024
CryptoBangs.com

CryptoBangs.com is an online news portal that aims to share the latest crypto news, bitcoin, altcoin, blockchain, nft news and much more stuff like that.

What’s New Here!

  • Bitcoin Indicator Signals ‘Shift To Bullish Territory’ – Can BTC Break Past $65,000?
  • NVIDIA Unveils NCCL 2.22 with Enhanced Memory Efficiency and Faster Initialization
  • The Top 5 Metaverse Games of 2024
  • Ripple Whales Buy 390M XRP as Price Eyes $0.60 in October

Newsletter

Don't miss a beat and stay up to date with our Newsletter!
Loading

  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA
  • Disclaimer

© 2023 - CryptoBangs.com - All Rights Reserved!

No Result
View All Result
  • Home
  • Live Crypto Prices
  • Crypto News
    • Bitcoin
    • Ethereum
    • Ripple
    • Altcoin
    • NFT News
  • DeFi
  • Blockchain
  • Regulation
  • Shop
  • Blog
  • Calculator

© 2018 JNews by Jegtheme.

  • bitcoinBitcoin(BTC)$62,935.00-0.45%
  • ethereumEthereum(ETH)$2,545.223.47%
  • tetherTether(USDT)$1.00-0.03%
  • binancecoinBNB(BNB)$568.980.65%
  • solanaSolana(SOL)$145.792.00%
  • usd-coinUSDC(USDC)$1.00-0.06%
  • rippleXRP(XRP)$0.58-0.10%
  • staked-etherLido Staked Ether(STETH)$2,543.923.44%
  • dogecoinDogecoin(DOGE)$0.1049460.10%
  • the-open-networkToncoin(TON)$5.51-3.16%
  • tronTRON(TRX)$0.151776-0.27%
  • cardanoCardano(ADA)$0.352027-0.99%
  • avalanche-2Avalanche(AVAX)$27.250.14%
  • Wrapped stETHWrapped stETH(WSTETH)$2,998.343.48%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$62,831.00-0.45%
  • shiba-inuShiba Inu(SHIB)$0.0000141.15%
  • WETHWETH(WETH)$2,544.863.45%
  • chainlinkChainlink(LINK)$11.300.38%
  • bitcoin-cashBitcoin Cash(BCH)$334.08-1.79%
  • polkadotPolkadot(DOT)$4.330.81%
  • leo-tokenLEO Token(LEO)$5.780.59%
  • daiDai(DAI)$1.000.05%
  • uniswapUniswap(UNI)$6.780.91%
  • litecoinLitecoin(LTC)$65.18-0.54%
  • nearNEAR Protocol(NEAR)$4.340.16%
  • Wrapped eETHWrapped eETH(WEETH)$2,665.983.50%
  • kaspaKaspa(KAS)$0.169038-1.19%
  • fetch-aiArtificial Superintelligence Alliance(FET)$1.595.38%
  • suiSui(SUI)$1.483.32%
  • internet-computerInternet Computer(ICP)$8.353.42%
  • aptosAptos(APT)$7.297.92%
  • PepePepe(PEPE)$0.0000082.75%
  • moneroMonero(XMR)$175.14-0.61%
  • BittensorBittensor(TAO)$407.525.20%
  • First Digital USDFirst Digital USD(FDUSD)$1.00-0.16%
  • POL (ex-MATIC)POL (ex-MATIC)(POL)$0.3990210.07%
  • stellarStellar(XLM)$0.0963200.07%
  • ethereum-classicEthereum Classic(ETC)$18.991.20%
  • blockstackStacks(STX)$1.750.43%
  • Ethena USDeEthena USDe(USDE)$1.00-0.06%
  • immutable-xImmutable(IMX)$1.552.28%
  • okbOKB(OKB)$39.58-1.02%
  • crypto-com-chainCronos(CRO)$0.0855683.20%
  • aaveAave(AAVE)$151.861.43%
  • filecoinFilecoin(FIL)$3.751.34%
  • render-tokenRender(RENDER)$5.302.80%
  • arbitrumArbitrum(ARB)$0.572.37%
  • injective-protocolInjective(INJ)$20.740.64%
  • mantleMantle(MNT)$0.600.77%
  • optimismOptimism(OP)$1.652.83%
WP Twitter Auto Publish Powered By : XYZScripts.com