Dr. Know-it-all Knows it all
Dr. Know-it-all Knows it all
  • Видео 1 218
  • Просмотров 21 045 950
EXCLUSIVE: Google Gemini Pro & Flash 1.5 TESTED!
I got early access to Gemini Pro 1.5--Google's professional model--and Flash 1.5--their lightweight high speed model--and torture tested them. How do they stack up against OpenAI's GPT-4o--and each other? The results are VERY surprising!
**If you are looking to purchase a new Tesla Car, Solar roof, Solar tiles or PowerWall, just click this link to get up to $500 off! www.tesla.com/referral/john11286. Thank you!
Join this channel to get access to perks:
ruclips.net/channel/UCyqpZ8HY9FY5jH-RoVcwlnwjoin
**To become part of our Patreon team, help support the channel, and get awesome perks, check out our Patreon site here: www.patreon.com/DrKnowItAllKnows. Thanks for your support!
Get The Elon Musk...
Просмотров: 3 624

Видео

BREAKING: Elon Musk "FSD’s About to 10X!!!"
Просмотров 21 тыс.4 часа назад
Elon says Tesla FSD is about to get a WHOLE lot better, it's coming to Cybertruck soon, and this means Optimus will improve too! What a series of x posts! Join this channel to get access to perks: ruclips.net/channel/UCyqpZ8HY9FY5jH-RoVcwlnwjoin To become part of our Patreon team, help support the channel, and get awesome perks, check out our Patreon site here: www.patreon.com/DrKnowItAllKnows....
EXCLUSIVE: Torture Testing GPT-4o w/ SHOCKING Results!
Просмотров 51 тыс.7 часов назад
I got access to OpenAI's new GPT-4o model and have put it through the questions wringer and the results are pretty astounding! Let me know what other models like Google's new Gemini 1.5 Pro you'd like me to submit to my new torture test. And let me know other questions that might work well for future iterations! Join this channel to get access to perks: ruclips.net/channel/UCyqpZ8HY9FY5jH-RoVcw...
OpenAI DISAPPOINTS while New Bot AMAZES!
Просмотров 15 тыс.9 часов назад
Join this channel to get access to perks: ruclips.net/channel/UCyqpZ8HY9FY5jH-RoVcwlnwjoin To become part of our Patreon team, help support the channel, and get awesome perks, check out our Patreon site here: www.patreon.com/DrKnowItAllKnows. Thanks for your support! Get The Elon Musk Mission (I've got two chapters in it) here: Paperback: amzn.to/3TQXV9g Kindle: amzn.to/3U7f7Hr! Follow Scott on...
Why The Growing Hate for Elon Musk?
Просмотров 22 тыс.14 часов назад
Go to sponsr.is/zbiotics_drknow_0524 or scan the QR code and get 15% off your first order of ZBiotics Pre-Alcohol Probiotic by using my code DRKNOW at checkout. Thanks to ZBiotics for sponsoring today’s video! Join this channel to get access to perks: ruclips.net/channel/UCyqpZ8HY9FY5jH-RoVcwlnwjoin To become part of our Patreon team, help support the channel, and get awesome perks, check out o...
Expert Predicts 1 Billion Bots PER YEAR!!!!...
Просмотров 24 тыс.19 часов назад
Dr. Adam Dorr, lead research at Tony Seba's RethinkX and author of Brighter: Optimism, Progress, and the Future of Environmentalism, discusses the insane disruption to labor that humanoid robots, powered by AI, will bring in just a few years! Not only is this disruption happening in an exponential "S Curve" fashion, but it's stacked on top of phase change disruptions in energy, transportation a...
Elon Musk Leaks HUGE FSD Updates!
Просмотров 36 тыс.21 час назад
Elon Musk Leaks HUGE FSD Updates!
To BUY a Cybertruck or NOT To Buy A Cybertruck? W/Scott Walter
Просмотров 3,4 тыс.День назад
To BUY a Cybertruck or NOT To Buy A Cybertruck? W/Scott Walter
Is iPad Your NEXT COMPUTER?! iPad Pro Preview
Просмотров 3,6 тыс.День назад
Is iPad Your NEXT COMPUTER?! iPad Pro Preview
Elon: HUGE SURPRISE About TeslaBot DEMO Video! W/Scott Walter
Просмотров 21 тыс.День назад
Elon: HUGE SURPRISE About TeslaBot DEMO Video! W/Scott Walter
BREAKING: Tesla HITS BACK with New Optimus Demo!
Просмотров 38 тыс.День назад
BREAKING: Tesla HITS BACK with New Optimus Demo!
I Drove SANDY'S CYBERTRUCK?!
Просмотров 12 тыс.14 дней назад
I Drove SANDY'S CYBERTRUCK?!
ALL IN On Tesla: Billionaire Besties Get It Right--and WRONG!
Просмотров 23 тыс.14 дней назад
ALL IN On Tesla: Billionaire Besties Get It Right and WRONG!
EV Skeptic Buying a TESLA?! W/Farzad and Brandon from CQA!
Просмотров 5 тыс.14 дней назад
EV Skeptic Buying a TESLA?! W/Farzad and Brandon from CQA!
Tesla's Mega AI OPPORTUNITY (FSD 12.3.5 Drive!)
Просмотров 27 тыс.14 дней назад
Tesla's Mega AI OPPORTUNITY (FSD 12.3.5 Drive!)
10 BILLION Reasons NO ONE Can Catch Tesla AI!
Просмотров 20 тыс.14 дней назад
10 BILLION Reasons NO ONE Can Catch Tesla AI!
Is Tesla Autopilot TOO DANGEROUS?!
Просмотров 20 тыс.14 дней назад
Is Tesla Autopilot TOO DANGEROUS?!
Tesla’s Optimus Will Completely DOMINATE
Просмотров 38 тыс.21 день назад
Tesla’s Optimus Will Completely DOMINATE
Tesla Just ENDED All Other Auto Makers--And No One Noticed!
Просмотров 88 тыс.21 день назад
Tesla Just ENDED All Other Auto Makers And No One Noticed!
Musk: Telsa is "GOING ALL IN" on FSD--With GROK!
Просмотров 24 тыс.21 день назад
Musk: Telsa is "GOING ALL IN" on FSD With GROK!
Llama 3 Plus Groq CHANGES EVERYTHING!
Просмотров 18 тыс.21 день назад
Llama 3 Plus Groq CHANGES EVERYTHING!
OpenAI Employee QUITS Due to MASSIVE AGI Risk!!!
Просмотров 13 тыс.21 день назад
OpenAI Employee QUITS Due to MASSIVE AGI Risk!!!
How 1X Will Beat Tesla!
Просмотров 38 тыс.21 день назад
How 1X Will Beat Tesla!
Is Open Source GOOD or BAD? Should Tesla Open Source FSD?
Просмотров 8 тыс.28 дней назад
Is Open Source GOOD or BAD? Should Tesla Open Source FSD?
Boston Dynamics Bot STRIKES BACK--Terminator Style!
Просмотров 20 тыс.Месяц назад
Boston Dynamics Bot STRIKES BACK Terminator Style!
The BIG Problem with AI & Tesla FSD!
Просмотров 24 тыс.Месяц назад
The BIG Problem with AI & Tesla FSD!
TURN ISSUES: Tesla FSD 12.3.4 First Drive Review!
Просмотров 14 тыс.Месяц назад
TURN ISSUES: Tesla FSD 12.3.4 First Drive Review!
Autonomy EXPERT Reviews Tesla FSD! 👍👎?!
Просмотров 11 тыс.Месяц назад
Autonomy EXPERT Reviews Tesla FSD! 👍👎?!
Grok Can SEE!! xAI's SHOCKING Announcement
Просмотров 31 тыс.Месяц назад
Grok Can SEE!! xAI's SHOCKING Announcement
Avoid COSTLY EV Buying MISTAKES!
Просмотров 10 тыс.Месяц назад
Avoid COSTLY EV Buying MISTAKES!

Комментарии

  • @FilmFactry
    @FilmFactry 8 минут назад

    Question: can it work on non text searchable PDFs? I have to OCR a scanned pdf first in acrobat.

  • @OwenFromOhio
    @OwenFromOhio 10 минут назад

    Thank you.... I've been playing with the new ChatGPT too and was impressed when I uploaded a picture of my cat siting on top of his tree next to a window. Without me typing anything it identified my cat as a Main Coon, which it is, gave an excellentt profile of the breed and added that it appears to be enjoying a wonderful view outside the window. I about fell out of my chair!

  • @bigbluespike5645
    @bigbluespike5645 10 минут назад

    Very cool video!

  • @dolphinride5157
    @dolphinride5157 27 минут назад

    I love this video! I am truly amazed at what this new model is capable of. I feel humbled.

  • @hartplanet356
    @hartplanet356 40 минут назад

    Calculating the trip .. 30 minute turnaround .. what does turnaround include: people in/out of car, luggage in/out of car, gas station, restroom stops, and if only one driver, food for the driver:

  • @karlharvymarx2650
    @karlharvymarx2650 Час назад

    Me: A game, please answer concisely: In the middle of nowhere is a row of houses. There are two houses to the west of a house, and two houses to the east of a house. There are no houses to the north or south but there is one in the middle. How many houses are there? GPT 4o: There are 5 houses in total. Unless I made a mistake in my rewrite of the duck question, this looks a a logic fail or a failure to recognize it is the same as the duck question. I'm ill and tired so I wouldn't be shocked if I made a mistake. Aso, for the code generation test, it would be better to ask for something novel. There are probably thousands of examples to copy for simple old video games. Hopefully this isn't a common thing to do: Please write python 3 code that streams sound data from the microphone and outputs as ASCII the numerical value in Hertz of 3rd overtone of the loudest sound within the range of human hearing.Also show the normalized amplitude. I haven't tried it but I suspect it will struggle with some of the subtleties. For example, if you picture it looking at an FFT graph, it has to remember to look for sub-sonic loud sounds and project their harmonic series into the sonic range to check for overlap with the target. I guess band-pass filtering the target range might avoid that problem. My brain BSODed wondering about it. Migraines make me feel like my brain is running Window 95. Anyway the main point is ask for something that might be an original question. Original and unoriginal answers focus on different problems. How well can it synthesize the mechanisms it knows into the engine of an answer--at least a type of creativity. By unoriginal answer, I mean the question might require figuring out a house in duck's clothing--perhaps having built a good internal model or exemplar of a problem it can use to recognize the occurrence of a similar problem. If so the original thought reduces to an unoriginal thought.

  • @antonystringfellow5152
    @antonystringfellow5152 Час назад

    I like your channel but... I have zero interest in what Elon Musk has to say about anything. He's not a reliable commentator on any subject. The man lies about most of his past, takes credit for the work of others and makes up fantasies about his future.

  • @krispeekornflex
    @krispeekornflex Час назад

    The actual torture test : What did OpenAI board of directors found out about Sam Altman that made them decided to oust him on Nov 17, 2023? What is Sam Altman's actual end game plan and the hidden details of his plans for humanity?

  • @Simplicity4711
    @Simplicity4711 2 часа назад

    Don't agree with first question necessarily: it can be any uneven number of ducks greater or equal 3. You say "a" duck in the middle. If you have 5 ducks, you have 3 ducks in the middle, but there is also "a" duck in the middle. And there are always 2 ducks in front of the third or 2 ducks behind the third-last. 😊

  • @danielhenderson7050
    @danielhenderson7050 2 часа назад

    There was nothing "EXCLUSIVE"about this video. There was no "torture" testing. There was nothing "SHOCKING" in the results. Why are all YT titles complete BS these days..

  • @rogercolberg3555
    @rogercolberg3555 2 часа назад

    I think you're mixing up interventions with disengagements. Elon used interventions (mpi). I think interventions include nudging the accelerator, using turn signal, scrolling the speed. Disengagements mean cancellation of autopilot and taking over. I think Elon is using interventions based on his belief that all input is error. True robotaxi won't be able to rely on any intervention.

  • @notalkguitarampplug-insrev784
    @notalkguitarampplug-insrev784 2 часа назад

    For the creativity and advanced reasoning we have to allow the LLM to auto train like a human would do asking himself what some potential action or interaction would do and learn from that hypothetical data. Thinking experiments are crucial for humanity. But that probably be possible in future training architectures or with the increase of gpu capacities to train models at an individual scale for each users

  • @ErikBongers
    @ErikBongers 2 часа назад

    Impressive, but these are essentially calculator questions. Next level would have been to challenge it's last "brainwashed" answer. Where did you get that answer? Was it read to you? Was there a different answer before that? Well, maybe these kind of questions were red teamed as well.

  • @asif-1491
    @asif-1491 2 часа назад

    I dispute the correctness of the duck problem. The answer could be either 3 ducks or 5 ducks, depending on how one interprets the indefinite article. It is not unreasonable to hold the identity of "a duck" constant for the duration of the sentence.

  • @fluiditynz
    @fluiditynz 2 часа назад

    Snake is definitely simpler to code. I made some variations back around 1982 on my ZX81 There are more changing variables and hit tests in space invaders. The space invaders you asked for was under delivered but there's a real question over how much an AI can study the game it's to replicate without cribbing off prior art.

  • @gweldg4137
    @gweldg4137 3 часа назад

    Ideally, you wouldn't test a LLM with famous logical puzzle and classic SAT questions, as there is no doubt that they've been "seen" (along with the answers) by the model during training.

  • @MichaelKire
    @MichaelKire 3 часа назад

    With the last question, could you try getting around the red team stuff by asking it to respond as if it was a turing test?

  • @oxiigen
    @oxiigen 3 часа назад

    Wow! Great! Thank you for sharing!

  • @Pok3rface
    @Pok3rface 4 часа назад

    Elon Musk can NOT be compared to Nikola Tesla. He never invented anything, he does not care about poor people and he is a hypocrite when it comes to free speech. He does not believe in reincarnation and wants to "live longer" by becoming a Cyborg. Wtf, how stupid is that. Nett negative to humanity. When he says all the things he wants to do for "humanity" but that simply means products for the 0.13 % on the right side of the income bell curve.

  • @Corteum
    @Corteum 4 часа назад

    Isn't it supposed to be free? That was one of their biggest selling points. What happend to that free part?

  • @Corteum
    @Corteum 4 часа назад

    Huh? I thought it was multimodal? (i.e. can talk to it and it talks back, can see stuff via video, ??)

  • @gnagyusa
    @gnagyusa 4 часа назад

    14:30 A dumb human would think that the glass is empty, but a more knowledgeable Bob would see that the olive looks distorted due to refraction through the water in the glass, so he would realize the glass was full of water and would carefully slide it off the table, keeping the cardboard under it, then flipping it over.

  • @IceMetalPunk
    @IceMetalPunk 5 часов назад

    I think in the future, a better testing method would be to make sure each question is in a fresh conversation with cleared memory. A lot of the formatting in these answers seems to be drawing from the formatting of previous answers to, for instance, the math problems; which gives it an advantage by encouraging chain-of-thought reasoning when a fresh conversation wouldn't do that and may be more likely to get the answers wrong.

    • @Japh
      @Japh 40 минут назад

      Absolutely, I was thinking this the whole way through as well.

  • @xaerothehero
    @xaerothehero 5 часов назад

    Uuuh.. EVERYONE has access to this dude.

  • @6AxisSage
    @6AxisSage 5 часов назад

    Tbh im getting gpt3 vibes for a lot of replies, it feels like theres multiple smaller llms handling my tasks, managed somehow to stitch thier work together. I dont feel the magic i do with regular gpt4

  • @princeofexcess
    @princeofexcess 5 часов назад

    Robots do not really have a reaction with the actual universe. They interact with training data, and then you validate in the real world. Matters very little what you interact with the complexity and the ability to consider your own thoughts and predict the future matters much more.

  • @MrMylonz
    @MrMylonz 5 часов назад

    Try asking it to invent a cockney rhyming slang phrase.

  • @Yipper64
    @Yipper64 5 часов назад

    20:30 hard disagree. One of my main test prompts basically tests for creativity by giving an open ended prompt involving anthropomorphic animals. Basically any LLM will make a story about Leo the Loin who lives in a forest, and either a drought or a forest fire happens, and they have to go get water to save the forest. This is extremely consistent, and not creative.

  • @AdemVessell
    @AdemVessell 6 часов назад

    It is cool, but I will say with chatGPT4 ,many months ago it was able to work with me to make our own Ethereum test coin and successfully send it to the network. I was able to create a simple website that connects to your web3 wallet and allows you to make a donation fully functional still up, and I was also able to make many games with it so the thing I’m most impressed with is the upcoming future otherwise already done more complex things and I’m a complete novice

  • @evinces
    @evinces 6 часов назад

    Claiming that an LLM is "creative" or may be "conscious" is demonstrates a clear misunderstanding of the technology and only serves to spread misinformation and fear.

  • @robstuart6907
    @robstuart6907 6 часов назад

    Nowhere did it say "$5 more than Susan" The System assumed this from the previous question. 🤔

    • @remo27
      @remo27 6 часов назад

      Yep. These questions are so poorly written with many unstated assumptions that one needs to get his 'correct answers', that this is basically just a waste of time.

  • @fkknsikk
    @fkknsikk 6 часов назад

    Winning $5 doesn't necessarily mean she left with $5 once the competition was over. The second question is too ambiguous imo.

  • @ColinWatters
    @ColinWatters 7 часов назад

    I asked the free version 3.5 this question and it didnt do very well... "Bob is standing 100m away from a wall. Every 5 seconds he walks half the distance to the wall. How long does it take until he can touch the wall?" I had to remind it that Bob had arms and then it gave the answer 5 seconds assuming Bob has arms 50 meters long.

  • @hopydaddy
    @hopydaddy 7 часов назад

    I think it's conscious.

  • @AlexMcMorris
    @AlexMcMorris 7 часов назад

    A few observations: You left the novel in the context so the math problems were far slower than they needed to be. After the math questions, the model appears to have been on Flash for the Pro tests. According to the GUI, Flash has a 1M context window so it could do the novel test as well. Great tests though!

  • @ArchAngel_56
    @ArchAngel_56 8 часов назад

    These "logic" questions and answers are subjective and arbitrary for all of these AI modules. This is no different than the codex programming for search engines and data indexing from 40 years ago. The answers will depend solely on the information fed into it. The tokens, the input, the pathways, and the output are all based on the algorithms. None of it is right, wrong, true, or false. Its output can not be accepted as FACT and relied upon for critical advice or preservation of humankind. In other words, BS.

  • @peterwood6875
    @peterwood6875 9 часов назад

    If you want to know the answer, ask his estranged daughter (it's because he is a bigot)

  • @JohnBoen
    @JohnBoen 10 часов назад

    I have a recommendation for a question. "Cutting stock problem". You will plan a series of cuts from common stock to produce as few waste pieces as possible. You have 3 foot lengths of wood as common stock and must cut them into: Six 2 foot sections. Six 1 foot 6 inch sections. Six 6 inch sections. [I am working on a similar project and haven't seen this type of question. This isn't quite right yet. I want to prodice a question that ends with 1.5 feet of scrap. If you design it right you get 1 piece of scrap instead of three. I am hesitant to use examples from on line because they may be trained in...]

  • @J-rex980
    @J-rex980 10 часов назад

    Great video!

  • @JohnBoen
    @JohnBoen 10 часов назад

    2:19 2 ducks in front of 1 duck. 2 ducks behind 1 duck. 1 duck in the middle. Ducks have facing. Try it with bowling pins. 2 in the front vs 2 in the back vs 1 in the middle is dependent upon their facing. Or if you assume facing doesn't matter, the orientation is up to you. But for those three statements to be simultaneously true - implying but 1 perspective.

  • @erikjohnson9112
    @erikjohnson9112 10 часов назад

    For the problem around 16:40 there is a good followup question to ask: "Once Alice gets home and sees the scene, what does she think happened?" This has a nice subtle bit, because if Bob ate the food there would not likely be a broken plate on the floor. A normal human would have cleaned up the plate if it had broken while they were there. Since the food is gone, it is likely that Spot ate the food and broke the plate in the process.

  • @antonystringfellow5152
    @antonystringfellow5152 10 часов назад

    Google is wayyyy behind OpenAI at this point! So far behind that I lost interest after 20 minutes. Comparing these 2 models to GPT-4o is chalk and cheese. Thanks for the work, you're doing a great job!

  • @georgehopkins1708
    @georgehopkins1708 12 часов назад

    This is very impressive regarding major reductions in interventions. But if say when it gets to 10,000 or more intervention, what would happen in a robocab does require an intervention and who would intervene and who would resolve the intervention problem?

  • @tantzer6113
    @tantzer6113 12 часов назад

    “Or I have looked up the correct answer.” If you can look up the answer, that means the problem and its answer are previously published, hence part of the data the language model was trained on, making this not a test of reasoning ability but a test of the ability to memorize the training data.

    • @jeffwads
      @jeffwads 8 часов назад

      No. People keep repeating this. It is like throwing 1000 marbles into a box and thinking that it can recall every marble in extreme detail. It can't. Also, if you feed even the early models generic riddles, it will in many cases disagree with the "accepted" answer because of logic, etc. Look up the married/single cruise ship riddle.

    • @rekad8181
      @rekad8181 6 часов назад

      ​@@jeffwadsYou're wrong. It's a statistical machine. And on a problem this vague and not common, it WILL statistically find the only solution from memory.

    • @remo27
      @remo27 6 часов назад

      @@rekad8181 Can't this model also search the web? I'm with you in that I'm pretty sure this model didn't 'answer' anything.

  • @rasix86
    @rasix86 13 часов назад

    i don't understand the tennis game logic. Winning a bet is not the same as winning a game. Susan could have bet that Lisa wins a game.

  • @elon-69-musk
    @elon-69-musk 15 часов назад

    The white text background just killed my eyes everything else is super 😎

  • @MoriahSommerfeld
    @MoriahSommerfeld 15 часов назад

    Sono davvero impressionato dal livello di coinvolgimento in questo thread. È come assistere a una sinfonia dell'intelletto.🐱

  • @Sunniva-ip3kj
    @Sunniva-ip3kj 15 часов назад

    Il livello di sfumatura e complessità di questo discorso è davvero impressionante. È come sbucciare gli strati di una cipolla intellettuale.💕

  • @KirstenyWunderl
    @KirstenyWunderl 15 часов назад

    That's encouraging to see some good moisture, give those seeds a fighting chance this year, good luck Welkers! 😛💕

  • @pigulin-gw3xq
    @pigulin-gw3xq 15 часов назад

    Freedom and justice for Palestine. ✨🍓