• Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA
  • Disclaimer
Thursday, July 18, 2024
CryptoBangs.com
Advertisement
  • Home
  • Live Crypto Prices
  • Crypto News
    • Bitcoin
    • Ethereum
    • Ripple
    • Altcoin
    • NFT News
  • DeFi
  • Blockchain
  • Regulation
  • Shop
  • Blog
  • Calculator
No Result
View All Result
  • Home
  • Live Crypto Prices
  • Crypto News
    • Bitcoin
    • Ethereum
    • Ripple
    • Altcoin
    • NFT News
  • DeFi
  • Blockchain
  • Regulation
  • Shop
  • Blog
  • Calculator
No Result
View All Result
CryptoBangs.com
No Result
View All Result

NVIDIA Unveils Generative AI-Powered Visual AI Agents for Edge Deployment

July 17, 2024
in Blockchain
Reading Time: 3 mins read
A A
NVIDIA Unveils Generative AI-Powered Visual AI Agents for Edge Deployment
ShareShareShareShareShare

Related articles

Cardano Founder Offers Free Solution To Help X (Twitter) Combat Hacks

Cardano Founder Offers Free Solution To Help X (Twitter) Combat Hacks

July 17, 2024
Bitfinex Derivatives Launches Public Beta Integration with Thalex

Bitfinex Derivatives Launches Public Beta Integration with Thalex

July 17, 2024


Timothy Morano
Jul 17, 2024 18:22

NVIDIA introduces Vision Language Models (VLMs) for dynamic video analysis, enhancing AI capabilities at the edge with Jetson Orin platform.





An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis, according to NVIDIA Technical Blog. VLMs enable users to interact with image and video input using natural language, making the technology more accessible and adaptable. These models can run on the NVIDIA Jetson Orin edge AI platform or discrete GPUs through NIMs.

What is a Visual AI Agent?

A visual AI agent is powered by a VLM where users can ask a broad range of questions in natural language and get insights that reflect true intent and context in a recorded or live video. These agents can be interacted with through easy-to-use REST APIs and integrated with other services and mobile apps. This new generation of visual AI agents helps to summarize scenes, create a wide range of alerts, and extract actionable insights from videos using natural language.

NVIDIA Metropolis brings visual AI agent workflows, which are reference solutions that accelerate the development of AI applications powered by VLMs, to extract insights with contextual understanding from videos, whether deployed at the edge or cloud.

For cloud deployment, developers can use NVIDIA NIM, a set of inference microservices that include industry-standard APIs, domain-specific code, optimized inference engines, and enterprise runtime, to power the visual AI Agents. Get started by visiting the API catalog to explore and try the foundation models directly from a browser.

Building Visual AI Agents for the Edge

Jetson Platform Services is a suite of prebuilt microservices that provide essential out-of-the-box functionality for building computer vision solutions on NVIDIA Jetson Orin. Included in these microservices are AI services with support for generative AI models such as zero-shot detection and state-of-the-art VLMs. VLMs combine a large language model with a vision transformer, enabling complex reasoning on text and visual input.

The VLM of choice on Jetson is VILA, given its state-of-the-art reasoning capabilities and speed by optimizing the tokens per image. By combining VLMs with Jetson Platform Services, a VLM-based visual AI agent application can be created that detects events on a live-streaming camera and sends notifications to the user through a mobile app.

Integration with Mobile App

The full end-to-end system can now integrate with a mobile app to build the VLM-powered Visual AI Agent. To get video input for the VLM, the Jetson Platform Services networking service and VST automatically discover and serve IP cameras connected to the network. These are made available to the VLM service and mobile app through the VST REST APIs.

From the app, users can set custom alerts in natural language such as “Is there a fire” on their selected live stream. Once the alert rules are set, the VLM will evaluate the live stream and notify the user in real-time through a WebSocket connected to the mobile app. This will trigger a popup notification on the mobile device, allowing users to ask follow-up questions in chat mode.

Conclusion

This development highlights the potential of VLMs combined with Jetson Platform Services to build advanced Visual AI Agents. The full source code for the VLM AI service is available on GitHub, providing a reference for developers to learn how to use VLMs and build their own microservices.

For more information, visit the NVIDIA Technical Blog.

Image source: Shutterstock


Credit: Source link

ShareTweetSendPinShare
Previous Post

Ripple Up 40% as XRP Rally Reignites $1 Hopes

Next Post

$25 Monthly Since Launch is Worth $34 Million Today

Related Posts

Cardano Founder Offers Free Solution To Help X (Twitter) Combat Hacks

Cardano Founder Offers Free Solution To Help X (Twitter) Combat Hacks

July 17, 2024

Charles Hoskinson asked Elon Musk to allow the IOHK developer team to integrate DIDs in X’s software to prohibit hacking...

Bitfinex Derivatives Launches Public Beta Integration with Thalex

Bitfinex Derivatives Launches Public Beta Integration with Thalex

July 17, 2024

Felix Pinkston Jul 17, 2024 10:57 Bitfinex Derivatives has announced the public beta integration with Thalex,...

Bitcoin Price Prediction: BTC Breaks $65K As Craig Wright Admits He’s Not Satoshi Nakamoto And Investors Flock To This Learn-To-Earn Crypto For Its 696% APY

Bitcoin Price Prediction: BTC Breaks $65K As Craig Wright Admits He’s Not Satoshi Nakamoto And Investors Flock To This Learn-To-Earn Crypto For Its 696% APY

July 17, 2024

Join Our Telegram channel to stay up to date on breaking news coverage The Bitcoin price soared 2% in the...

Injective’s First Builder House Event Garners Industry Acclaim

Injective’s First Builder House Event Garners Industry Acclaim

July 17, 2024

Luisa Crawford Jul 17, 2024 02:10 The inaugural Injective (INJ)Builder House in Brussels during EthCC united...

5 Best Cheap Crypto to Buy Now Under 1 Dollar July 16 – SuperVerse, Bubba, Ankr Network

5 Best Cheap Crypto to Buy Now Under 1 Dollar July 16 – SuperVerse, Bubba, Ankr Network

July 16, 2024

Join Our Telegram channel to stay up to date on breaking news coverage The search for the best cheap crypto...

Load More
Next Post
$25 Monthly Since Launch is Worth $34 Million Today

$25 Monthly Since Launch is Worth $34 Million Today

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Will It Happen Before 2030?

Will It Happen Before 2030?

July 12, 2024
Highest Price SHIB Will Trade in 2024

Highest Price SHIB Will Trade in 2024

July 17, 2024
Is Pepe Heading Toward A Crash?

Is Pepe Heading Toward A Crash?

July 12, 2024
Azarus Powers Evo’s Interactive Live Streams for Massive Esports Competition

Azarus Powers Evo’s Interactive Live Streams for Massive Esports Competition

July 17, 2024
Uganda Presents Purchase Plan to Return to the Gold Standard

Uganda Presents Purchase Plan to Return to the Gold Standard

July 16, 2024
CryptoBangs.com

CryptoBangs.com is an online news portal that aims to share the latest crypto news, bitcoin, altcoin, blockchain, nft news and much more stuff like that.

What’s New Here!

  • The Rise of a New Meme Coin
  • Putin warns of power shortages from Bitcoin mining, calls for expansion of CBDC
  • DTX Affiliate Program Going Live in August, Pushing Presale to $950K While Bullish Dogwifhat and Dogecoin News
  • $25 Monthly Since Launch is Worth $34 Million Today

Newsletter

Don't miss a beat and stay up to date with our Newsletter!
Loading

  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA
  • Disclaimer

© 2023 - CryptoBangs.com - All Rights Reserved!

No Result
View All Result
  • Home
  • Live Crypto Prices
  • Crypto News
    • Bitcoin
    • Ethereum
    • Ripple
    • Altcoin
    • NFT News
  • DeFi
  • Blockchain
  • Regulation
  • Shop
  • Blog
  • Calculator

© 2018 JNews by Jegtheme.

  • bitcoinBitcoin(BTC)$64,887.002.04%
  • ethereumEthereum(ETH)$3,454.721.61%
  • tetherTether(USDT)$1.000.02%
  • binancecoinBNB(BNB)$575.931.39%
  • solanaSolana(SOL)$160.763.26%
  • rippleXRP(XRP)$0.616.97%
  • usd-coinUSDC(USDC)$1.000.07%
  • staked-etherLido Staked Ether(STETH)$3,456.111.51%
  • the-open-networkToncoin(TON)$7.26-0.77%
  • dogecoinDogecoin(DOGE)$0.1247192.02%
  • cardanoCardano(ADA)$0.4537143.49%
  • tronTRON(TRX)$0.1346640.83%
  • shiba-inuShiba Inu(SHIB)$0.0000191.62%
  • avalanche-2Avalanche(AVAX)$28.253.13%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$64,971.002.19%
  • polkadotPolkadot(DOT)$6.482.71%
  • chainlinkChainlink(LINK)$14.371.88%
  • bitcoin-cashBitcoin Cash(BCH)$380.74-0.46%
  • nearNEAR Protocol(NEAR)$6.171.40%
  • uniswapUniswap(UNI)$8.150.96%
  • litecoinLitecoin(LTC)$73.01-0.06%
  • leo-tokenLEO Token(LEO)$5.830.04%
  • daiDai(DAI)$1.00-0.32%
  • matic-networkPolygon(MATIC)$0.553.74%
  • Wrapped eETHWrapped eETH(WEETH)$3,610.021.95%
  • PepePepe(PEPE)$0.0000121.36%
  • internet-computerInternet Computer(ICP)$10.045.26%
  • kaspaKaspa(KAS)$0.177558-0.31%
  • fetch-aiArtificial Superintelligence Alliance(FET)$1.5310.95%
  • ethereum-classicEthereum Classic(ETC)$23.601.08%
  • aptosAptos(APT)$7.265.93%
  • Ethena USDeEthena USDe(USDE)$1.00-0.59%
  • stellarStellar(XLM)$0.1100273.91%
  • moneroMonero(XMR)$161.511.30%
  • blockstackStacks(STX)$1.945.84%
  • hedera-hashgraphHedera(HBAR)$0.0777922.53%
  • makerMaker(MKR)$2,915.030.33%
  • filecoinFilecoin(FIL)$4.666.56%
  • render-tokenRender(RNDR)$6.805.70%
  • vechainVeChain(VET)$0.0321111.68%
  • okbOKB(OKB)$43.011.94%
  • cosmosCosmos Hub(ATOM)$6.612.49%
  • mantleMantle(MNT)$0.783.15%
  • injective-protocolInjective(INJ)$26.0312.08%
  • crypto-com-chainCronos(CRO)$0.0941830.36%
  • immutable-xImmutable(IMX)$1.6111.66%
  • arbitrumArbitrum(ARB)$0.762.92%
  • BittensorBittensor(TAO)$323.0811.23%
  • suiSui(SUI)$0.884.46%
  • dogwifhatdogwifhat(WIF)$2.18-0.08%
WP Twitter Auto Publish Powered By : XYZScripts.com