Those of you who have watched the live stream, do come over and let's talk about how incredible the demo seemed. And also about how much of it would translate into actual usable stuff. And also what this means for accessibility and Assistive AI if I may. And those of you who haven't, do go watch the stream. it's incredible! (in all caps)
Comments
Ah.
Yeah, I thought that might be the case.
So I'd need to pay if I wanted to use the seeing version of chat gpt with the voices, that makes sense, I'll delete this account then as it won't have any use for me.
Man, this will be the third account I remove from this survice, i'm surprised they let me make them, oh well, I don't mind; this stuff will come to an app soon that isn't gpt.
@Gokul
Here's a webpage snipet: What features are available today from GPT-4o?
For now what you’ll get when you sign in to ChatGPT is access to the chat version of GPT-4o, not any of the more advanced voice or video functionality as that is “gradually rolling out” over the coming weeks starting with Plus and Team accounts.
So yeah for me, I think I'll delete my account and wait for it to come to apps.
Gokul
As Brad said, you just get the text box and the ability to use the 4O AI. You are also rate limited to 1/5 of a paid account if you switched to a free account.
Some more videos with this model and vision
Hello Andy. Can you record more videos with this model about some other use cases? Like from everydaylife?
Deniz Sincar
Sorry but I can’t. I only had the magical phone for a day. I don’t have access to this version anymore. As soon as I do though I’d be happy to.
if i pay
how long is "roling out over the coming weeks" i'd have thought it would be out now for paid subscribers using chat gpt
I can’t even
I am so impatient, waiting for this!
Oh boyyyy i’m just gonna leave this here.
https://youtu.be/wfAYBdaGVxs?si=iOJUSYXg-TQdtBpS
Steven.
I’m so calling mine Samantha. Along with about a billion other people. Wow, I didn’t think to try and make her laugh. I wish I had.
Damn!
the way she laughs. Slightly spooky!
I just hope the expressive…
I just hope the expressive/emotive voice can be easily toned down, as I'd imagine, for me at least, that the experience of interacting with an AI tool that mimics human social and emotional behaviors could become quite fatiguing over time. The fatigue may be especially profound when attempting to complete basic or mundane tasks, which is my primary use case for voice assistants at this point.
Down the rabbit hole I go
https://youtu.be/MirzFk_DSiI?si=N4BnjMqFDZvHFb83
Where can I find it?
I have a voice mode in my Chat GPT Plus, but I don't seem to have this magic vision thing.
Also, when I use the voice mode, it appears to hear the TV and any other voices around me, and responds to those.
My last issue is that when I finish any kind of voice chat, it goes into a rating screen, and there's no way out except to kill the app and restart. I must be doing something wrong.
Karen
If they add a sexy Australian female voice, i will be sold.
Fact. 😆
Louise
It seems I don’t have the voice mode anymore, even though I’m on premium. Interesting. Hopefully they’re updating my account?
The expressiveness is great!
The voice still sounds... I don't know the word, tinny isn't it but ther's a buzz there, if they can get rid of that that would be amazing!
I'm sure for those that want it you'll be able to tone it down, you'll probably have to explain that you'd like it to tell you stuff with limited emotion, but don't sound bored.
I'm looking forward to this in an app.
I had a chat gpt account... or 3, but deleted them as I didn't think I'd use them that much so hope this comes to a navigation app in the future.
Oh by the way, I really don't like open AIs policy where once you've deleted an account you actually can't use the same email address to sign up again in the future, something feels illegal there but i'm sure it's not. It is, however, frustrating.
@ Lottie
I told you I went down the rabbit hole lol.
RE: Stephen
Same here: premium and no voice mode anymore.
Louise
If you're doing something wrong, I too am making the same mistake. It's the same situation with me.
Is there any way to get this premium mode for paid subscribers? or do we wait?
You just need to wait.
You youngens, no patience. Why back in my day...
RE: I still have Voice Mode
Indeed, I yesterday updated my ChatGPT app and now I have no "voice mode" available.
voice mode is back but lay-out of app is more accessible
Hi,
The voice mode is back but since I speak Dutch, I notice that the name of the buttons like "voice mode" and "send" are now translated into Dutch... Also the interface of the chat window is more accessible in general.
to Andy
Wow, that's quite an impressive demo! Thank you for sharing it with us! Do you know, by chance, if the Be My Eyes app will support this through some kind of text output? I have no issue speaking into my phone if I must, but being able to here the speech is another story. Again, thanks for sharing this with the AppleVis community! I also understand if you may not have noticed, it's an entirely different consideration and use case from what you were doing, but I thought I'd ask.
When do you think this will be available to plus subscribers
When do you think this, more specifically the voice chat feature will be available 2 plus subscribers. I myself do not have an account but would seriously consider subscribing to the Plus model
Scott Davert
Hi Scott, you are very welcome As for the UI, nothing has yet been discussed as the API still isn’t available. In the coming days and weeks I may be asking for ideas and wishes that the community would like considered in the design of the new UI as I think its important to have blind peoples voices heard as a product for blind people is being designed.
SSWFTW
Hi, Open AI said it will be rolling out in the next few weeks to plus subscribers. I haven’t got it yet and as far as I know, nobody else has either.
API availability
In the email yesterday, it said a small group of trusted partners would be getting access first.
I hope and assume be my eyes is one of these partners?
This thing really does have me so excited!
Seeing AI
Yeah, this is truly incredible technologies. Our team is hard at work seeing how we can bring this to the community, and how to make the experience even better! 😊
Omg!
The weight is killing me!
I felt this so much!
If it helps any (which I know it doesn't), you're not alone in this.
I want so badly to have access to this!
Don’t think this is really appropriate
How about we just call it AI. No need to make a mountain out of molehills. What are we trying to do, cancel open AI? Omg.
Not only that,
not only that, but you can also change the voices on any sort of Text to Speech. You know what, this conversation is getting ridiculous. I’m out of it. 😂 🤣
Reminder: Respectful and Inclusive Discussions
We would like to remind everyone of our commitment to maintaining an inclusive and welcoming environment for all members. A recent comment in this discussion has been removed as it did not adhere to our community guidelines, which emphasize respect and inclusivity.
In addition, a couple of other comments have also been removed to prevent the discussion from being taken in an undesirable direction. Although not breaching the guidelines themselves, these comments had the potential to trigger responses similar to the one that did breach our guidelines.
We encourage all members to contribute thoughtfully and ensure that their comments do not alienate or marginalize any individuals within our community.
Thank you for your cooperation and understanding.
AI learns, don't forget
Whatever the oncoming AI in your device might be and do, it will be more and do better over time, based on what you do and on what you instruct it to do. AI learns. Your own AI probably won't tell you what it learns, but you will be able to tell based on what your own device's AI does.
Down the road a ways, I wonder how many people will still be alive.
Wow. So amazing
Thats truly awesome stuff. Life-changing for so many of us. I will no longer have to feel if my dog is sitting or not, I would be able to walk around the shops and look at shop prices, etc. I don’t even care if I have to pay for it, it would be worthwhile
This would be an AI I'd not mind paying for.
I don't mind paying for this on Seeing AI, I'll see how it works on BeMyEyes and I might decide to pay for it there too.
The potential...
While this is all very fascinating, and I, too, am excited to see where AI and technology, in general, takes us, might I suggest that we all take a step back and let the programmers, engineers, and designers take their time in working out all the nooks and crannies of potential failure; so that we, the consumer, can have a wonderfully innovative, life fulfilling, and ultimately joyfully engaging experience with our new AI companions?
Ya know, rather than the tired and true new software every year that is full of plot twists and pot holes, that Apple has become synonymous with?
Just a thought. 🙂
Brian
While I absolutely agree with you there, I think it's also important to keep constantly engaging with the techies so that, 1: they realise how life-altering this kind of thing is for the community, thereby aligning product development that way, and 2: an experiencial perspective is incorporated so that we get a practically usable product rather than some fancy thing which is of no day-to-day value.
Not yet.
I think someone else was talking about how it might become a thing, I'd not mind that if the chat gpt4o stuff is included.
Described Youtube and video games!
So, just imagine: ChatGPT gets a mode where you can share your screen to it, and you're playing retro games on your phone, through emulators like Delta or Retroarch. So, you start the game, like Final Fantasy 6, and then share your screen it to ChatGPT. It tells you what menu options you're on, describes scenes, and tells you how to get to objects. Reads dialog and everything.
Or, your watching old TV shows that won't get audio description, like Dark Shadows from the late 60's. No, I'm not that old but my grandma got me into it. And I think it's on Tooby TV. Anyway, you may have to tell it to use brief descriptions, but I imagine it could easily do it. Or maybe you're watching a Youtube video about old game consoles from Modern Vintage Gamer. GPT could easily describe the consoles, snippets of gameplay, all that.
Or, let's say it comes to Windows! Do you have a pair of headphones, like the Corsair gaming headphones that has a mostly inaccessible app? Just share your screen, tab around, and it'll tell you what you're on! Just some ideas, of course.
Omg
Am I the only one who has been incessantly checking the chat gpt app seeing if my account was updated yet? lol!
why say this, when i go to upgrade Access to GPT-4, GPT-4o, GPT-
so it should be out then?
when i go to upgrade to plus, I get:
Access to GPT-4, GPT-4o, GPT-3.5
that will be very missleading. you'd have thought it would have been in their app like now lol, and other apps like seeing Ai, be my eyes will have to implement there own model of it, also, i noted that when the OPEN AI video channel showed someone today pointing at objects and asking what they are in Spanish, it wasn't the same voice. why would i pay, just to get it, if it's going free? if i subscribe today, right now, does that mean in theory i should have it?
when select gpt 4o
hello, so when you go in the popup menu when in a chat, you can as a paid member select now: CHAT GPT 4 point 0, and i spelt it out for you, chat Gpt 3,5 and the new 4O, but what happens in 4O as the new voice and the images are not there etc. so how do we know when we have it? if you can use 4 and the letter o, chat gpt not 4 point and the number 0 but having both options, shouldn't the 4 letter o version let you use the images and speak directly to the Ai, as nothing is changing.
It’s been said time and time again
It’s coming in the coming weeks. You have access to the text, but not the video invoice features yet.
Lottie.
I’ve felt exactly the same for the last couple of weeks. I think Open AI had an iPhone moment and the world changed a little bit but I don’t think it’s going to be the same again. The iPhone changed everything and I think having non human intelligence thats this usable and approachable changes everything too. I really wish I still had that phone lol. To start with this is likely to be a mixture of novelty and tool for us but over time I think it just gets woven deeper into our lives, our culture and everything else. Once non human intelligence is this human, how can anything be the same?
Lottie.
Yep, I’m fine. I wasn’t even aware that was going on. I’d be interested to know what they were saying. I don’t use Mastodon. Well I have an account but I kind of stopped using social media when twitter went bad and didn’t make the switch. Maybe I’ll check it out tomorrow. Thanks for defending my honour. Did you challenge any of them to a dual? I so need a blind dual to happen. I’ll referee.
Two weeks to release
hi i asked gpt plus when will the voice and imaging features be available, and i got this when asked how will i know? The new video, voice, and image features for ChatGPT are being gradually rolled out. Initially, Plus and Enterprise users will gain access over the next two weeks. If you are a Plus user, you can check for these features by going to Settings → New Features on the mobile app and opting in. If you don’t see the features immediately, they should become available soon as the rollout progresses. For more details, visit the OpenAI blog.
@ Will
It lied to you. There is no new features under settings lol.
Question regarding video description
Wait, can someone please clarify this for me? So, if we want videos to be described on YouTube, in theory, would we share the screen with the artificial intelligence, or would we copy and paste the link? I actually tried this with Google Gemini, and it was able to give me a brief overview of the video if I posted the link. I was able to get an overview of the video, as well as scenes, outfit changes, background, and any text elements on screen. Of course, I had to ask the right questions in order to get the answers. Also, there were a few videos that I tried to be described, but it told me that it contains sensitive material, or that there was no metadata for it to tell me what was in the video, so I'm guessing that there were captions that the artificial intelligence couldn't read. Or, maybe they were not included. Either way, the potential is there, but it's not quite up to standards as yet.
I have some of the text features now. I also see a list of generators created by the chatGPT team, but it's not working as yet. I can also attach files and access memory across the conversations.
@Gokul
It is absolutely acceptable to hype up new technology through advertising, and word of mouuth. I was more referring to the plethora of replies in this thread that can be summed up in 3 words; "Gimme, gimme, gimme!!"