Technologyinternetadminbrad - TECHNOLOGY INTERNET ADMIN BRAD

technologyinternetadminbrad - TECHNOLOGY INTERNET ADMIN BRAD

technologyinternetadminbrad - TECHNOLOGY INTERNET ADMIN BRAD

technologyinternetadminbrad - TECHNOLOGY INTERNET ADMIN BRAD

technologyinternetadminbrad - TECHNOLOGY INTERNET ADMIN BRAD

More Posts from Technologyinternetadminbrad and Others

LMM Large Multimodal Models: Beyond Text And Images

LMM Large Multimodal Models: Beyond Text And Images

Multimodal AI

Digital assistants can learn more about you and the environment around you by utilizing multimodal AI, which gains even more power when it can operate on your device and processes various inputs such as text, photos, and video.

Large Multimodal Models(LMM)

Even with its infinite intelligence, generative artificial intelligence (AI) can only do so much because of how well it perceives its environment. Large multimodal models (LMMs) are able to examine text, photos, videos, radio frequency data, and even voice searches in order to offer more precise and pertinent responses.

It’s an important step in the development of generative AI after the widely used Large Language Models (LLMs), which included the ChatGPT initial model, which could only process text. Your PC, smartphone, and productivity apps will all benefit greatly from this improved ability to comprehend what you see and hear. Digital assistants and productivity tools will also become much more helpful. And the procedure will be quicker, more private, and power-efficient if the device can manage these processes.

LLaVA: Large Language and Vision Assistant

Qualcomm Technologies is dedicated on making multimodal AI available on devices. Large Language and Vision Assistant (LLaVA), a community-driven LMM with over seven billion parameters, was initially demonstrated by us back in February on an Android phone powered by the Snapdragon 8 Gen 3 Mobile Platform. In this demonstration, the phone could “recognize” images, such as a dish of fruits and vegetables or a dog in an open environment, and carry on a conversation with them. One may ask to have a recipe made with the things on the platter, or they could ask for an estimate of how many calories the recipe will include overall. Take a look at it:

The AI of the future is multimodal

Multimodal AI 2024

Given the increased clamor surrounding multimodal, this work is crucial. Microsoft unveiled the Phi-3.5 family of devices last week, which offers visual and multilingual support. This came after Google touted LMMs during its Made by Google event, wherein the multimodal input model Gemini Nano was unveiled. GPT-4 Omni, an original multimodal model from OpenAI, was unveiled in May. This comes after comparable research from Meta and community-developed models like LLaVA.

When combined, these developments show the direction that artificial intelligence is taking. It goes beyond simply having you type questions at a prompt. Qualcomm’s goal is to make these AI experiences available on billions of phones worldwide.

Qualcomm Technologies is collaborating with Google to enable the next generation of Gemini on Snapdragon, and it is working with a wide range of firms that are producing LMMs and LLMs, such as Meta’s Llama series. With the help of their partners, these models operate seamlessly on Snapdragon, and they can’t wait to surprise customers with more on-device AI features this year and the next.

While an Android phone is a great place to start when utilizing multimodal inputs, other categories will soon reap the benefits as well. For example, smart glasses that can scan your food and provide nutritional information, or cars that can comprehend your voice commands and help you while driving, are just a few examples of how multimodal inputs will benefit you.

Numerous difficult jobs can be completed via multimodal AI

These are just the beginning for multimodal AI, which may use a mix of cameras, microphones, and vehicle sensors to identify disinterested passengers in the back of an automobile and provide entertaining activities to pass the time. Additionally, it might make it possible for smart glasses to identify exercise equipment at a health club and generate a personalized training schedule for you.

The precision facilitated by multimodal AI will be important in aiding a field technician to diagnose problems with your household appliances or in guiding a farmer to pinpoint the root cause of crop-related problems.

The concept is that by utilizing cameras, microphones, and other sensors, these devices which start with phones, PCs, automobiles, and smart glasses can enable the AI assistant to “see” and “hear” in order to provide more insightful contextual responses.

The significance of on a device

Your phone or car must have sufficient processing capacity to handle those requests in order for all those added capabilities to function optimally. Since the battery on your phone must last the entire day, trillions of operations must occur quickly and effectively when using it. By using the device, you can avoid waiting for servers to react when they are too busy to ping the cloud. They’re also more private because you keep your device and the answers you receive with you.

That has been Qualcomm Technologies’ top concern. Handsets can handle a lot of processing on the phone itself because to the Snapdragon 8 Gen 3 processor’s Hexagon NPU. Likewise, the Snapdragon X Elite and Snapdragon X Plus Platforms enable more than 20 Copilot+ PCs on the market today to manage complex AI functions on the device.

Read more on govindhtech.com

I wish non-tech people would get into open source. You should post your crochet pattern on github. I want to see your wip novel on a webpage running Wikimedia.

Really awesome command-line programs I've discovered on GitHub

I'm a data hoarder, so all of these have to do with downloading and saving media. I also use a Mac, so all of these are MacOS-compliant via Homebrew. If you also use a Mac (or Linux), do yourself a favor and install Homebrew via Terminal right now- all you need to do is copy-paste a line of code into Terminal and it will open you up to tons of awesome programs that normally only run on Windows.

Anyway, onto the list!

Yt-dlp (Youtube/Video downloader): On top of letting you rip Youtube videos directly from the site, this program supports a huge array of other video/media websites. The program is highly customizable as well; I would highly recommend at least installing FFmpeg, which allows you to download videos in quality higher than the default 720p. Here's a guide on how to do that for Windows and I wrote a guide here for Mac (I forgot to write in the guide that you should install FFmpeg via Homebrew).

Mangadex-dl (Mangadex downloader): Allows you to download manga directly from Mangadex, the hub of scanlation. Like yt-dlp it is customizable and you can pick which chapters you want to download (useful if you only want to download current chapters you haven't gotten before).

Gallery-dl (Bulk image downloading): A godsend for an art-hoarder like me, this program allows you to bulk download things like Pixiv pages, Twitter galleries, Deviantart galleries, Instagram pages, etc. Like yt-dlp it is highly customizable. Some websites (like Pixiv) may require user authentication; the GitHub page outlines the steps each authentication process requires.

List of other command line programs you might find interesting on GitHub

I'm sure I'll add to this list as I find more cool stuff on GitHub!

VOTER REGISTRATION


Tags
  • timetransportersbuiltbyoneperson
    timetransportersbuiltbyoneperson reblogged this · 8 months ago
  • 3wives1setofidentitydocuments
    3wives1setofidentitydocuments reblogged this · 8 months ago
  • technologyinternetadminbrad
    technologyinternetadminbrad reblogged this · 9 months ago
  • technologyinternetadminbrad
    technologyinternetadminbrad reblogged this · 9 months ago
  • machinesforinternationalbusiness
    machinesforinternationalbusiness reblogged this · 9 months ago
  • securityforcesshould
    securityforcesshould reblogged this · 9 months ago
  • cellphoneradiotowerswhitetailln
    cellphoneradiotowerswhitetailln reblogged this · 9 months ago
  • samanthasaintclarissajoanhartpr
    samanthasaintclarissajoanhartpr reblogged this · 9 months ago
  • hermanlowelillyrobertchamberlain
    hermanlowelillyrobertchamberlain reblogged this · 9 months ago
  • bonjovisaddlecrestbabynameearly
    bonjovisaddlecrestbabynameearly reblogged this · 9 months ago
  • stormtroopertakinganaltoearncum
    stormtroopertakinganaltoearncum reblogged this · 9 months ago
  • analsquaremilitaryrankinsigniaaf
    analsquaremilitaryrankinsigniaaf reblogged this · 9 months ago
  • dumbsofsuperficialtextonlysearch
    dumbsofsuperficialtextonlysearch reblogged this · 9 months ago
  • tertiarypredatorlinkageforspirit
    tertiarypredatorlinkageforspirit reblogged this · 9 months ago
  • tertiaryancestrylinksforspirit
    tertiaryancestrylinksforspirit reblogged this · 9 months ago
  • tertiarydietarylinkagesforspirit
    tertiarydietarylinkagesforspirit reblogged this · 9 months ago
  • tertiarysexuallinkagesforspirit
    tertiarysexuallinkagesforspirit reblogged this · 9 months ago
  • eightoverfifteenforpiesquarer5
    eightoverfifteenforpiesquarer5 reblogged this · 9 months ago
  • handjobbehmbasementmathematics
    handjobbehmbasementmathematics reblogged this · 9 months ago
  • moscowbeijingpyongyangintel
    moscowbeijingpyongyangintel reblogged this · 9 months ago
  • blogyourneuralsforbackupycollab
    blogyourneuralsforbackupycollab reblogged this · 9 months ago
  • machinelearninggoogleaicopilot
    machinelearninggoogleaicopilot reblogged this · 9 months ago
  • llamamilitaryintelligencegoogle
    llamamilitaryintelligencegoogle reblogged this · 9 months ago
  • homemaderadiationdetectorkits
    homemaderadiationdetectorkits reblogged this · 9 months ago
  • walkedbelowdeckussdallasactiveor
    walkedbelowdeckussdallasactiveor reblogged this · 9 months ago
  • llamaartificialintelligencelearn
    llamaartificialintelligencelearn reblogged this · 9 months ago
  • askforbehmforaccesscontactpitt
    askforbehmforaccesscontactpitt reblogged this · 9 months ago
  • hearingthingsabovetheirlevel
    hearingthingsabovetheirlevel reblogged this · 9 months ago
  • vimedforalltimenorthsacramento
    vimedforalltimenorthsacramento reblogged this · 9 months ago
  • startrekenterprisetelevisionshow
    startrekenterprisetelevisionshow reblogged this · 9 months ago
  • sexcrimesagainstbradleygeigermom
    sexcrimesagainstbradleygeigermom reblogged this · 9 months ago
  • stanandfrancinesmithandroger
    stanandfrancinesmithandroger reblogged this · 9 months ago
  • technologiessheridanwyoming
    technologiessheridanwyoming reblogged this · 9 months ago
  • walessovutube
    walessovutube reblogged this · 9 months ago
  • princeofwmusicyoutube
    princeofwmusicyoutube reblogged this · 9 months ago
  • britisharmymilitaryintelligence
    britisharmymilitaryintelligence reblogged this · 9 months ago
  • whitetailbathroomtoilets
    whitetailbathroomtoilets reblogged this · 9 months ago
  • foxwolfobriencooper
    foxwolfobriencooper reblogged this · 9 months ago
  • hatersusingsensoryreplacement
    hatersusingsensoryreplacement reblogged this · 9 months ago
  • googlecopilotbinggemini
    googlecopilotbinggemini reblogged this · 9 months ago
  • psychiatricmentalhealthfacility
    psychiatricmentalhealthfacility reblogged this · 9 months ago
  • donaldtrump2024news
    donaldtrump2024news reblogged this · 9 months ago
  • sexcrimesagainstancestorsofearth
    sexcrimesagainstancestorsofearth reblogged this · 9 months ago
  • searchslavecollarsbottomofocean
    searchslavecollarsbottomofocean reblogged this · 9 months ago
  • sexualcrimesbradgeigerancestors
    sexualcrimesbradgeigerancestors reblogged this · 9 months ago
  • fabergeeggsfabrikastarshini
    fabergeeggsfabrikastarshini reblogged this · 9 months ago
  • onlyfanscollarviolinplayerdotcom
    onlyfanscollarviolinplayerdotcom reblogged this · 9 months ago
  • passportsfortraveltoplanetearth
    passportsfortraveltoplanetearth reblogged this · 9 months ago
  • technologiessheridanwyoming
    technologiessheridanwyoming reblogged this · 9 months ago
  • intelcoreultracomputerprocessors
    intelcoreultracomputerprocessors reblogged this · 9 months ago
technologyinternetadminbrad - TECHNOLOGY INTERNET ADMIN BRAD
TECHNOLOGY INTERNET ADMIN BRAD

ADMINISTRATOR

246 posts

Explore Tumblr Blog
Search Through Tumblr Tags