Hello guys,
I have created the free app PiccyBot that speaks out the description of the photo/image you give it. And you can then ask detailed questions about it.
I have adjusted the app to make it as low vision friendly as I could, but I would love to receive feedback on how to improve it further!
The App Store link can be found here:
https://apps.apple.com/us/app/piccybot/id6476859317
I am really hoping it will be of use to some. I have earlier created the app 'Talking Goggles' which was well received by the low vision community, but PiccyBot is a lot more powerful and hopefully useful!
Thanks and best regards,
Martijn van der Spek
Comments
Re: clipboard
If I am in Facebook and tap an image, there is a More icon. Under that is an option to copy the image.
I presume this is going onto the clipboard but I could be wrong. Knowing me and my lack of social media skills, I'll find I've been posting them to my timeline or something stupid.
Voices are working again
Hi Brad, thanks for the heads up. There was an issue with the server handling the voice requests. It should all be working fine now.
Re: Voices
Ah I was wondering why everything was silent. Then it suddenly started working just now.
Great!
Thanks for such a quick response.
Copying images to clipboard
If the image is in your camera roll on your iPhone, when you go down to the share sheet, there is an option that says, copy. Not copy iCloud link, but, copy. I also know that it's possible to copy images directly on Google Chrome on the iPhone. When you find the image, if you long press on it, it will give you an option to copy the image. If you don't see the option there, if you go down to the share sheet, you should see copy image as an option. You should be able to paste the image to your notepad and interact with it from there. Sometimes, it's not always possible to paste the image directly into an artificial intelligence app for descriptions, such as this one, for some strange reason, even though the image has been copied to your clipboard. Letting you know this information, so that you can keep it in mind, in case it doesn't work. The copy image option isn't available on the Safari browser. Hope this helps.
wow!
The personality thing is just superb, especially if you're looking through your old pictures or are just killing time. Does anyone know if there's any way to use these voices from Open AI to read, say, an ebook?
Re: Personality
I love the option of the personality but I personally couldn't stomach it for long. Literally every single photo is the most wondrous and magical thing that's ever been shared by anyone ever. The little laughs, breaths or gasps are realistic but doing that for every photo it just becomes a bit silly.
One time I had a photo and asked it to tell me which animals were in it. The voice waffled on and on about how amazing and wondrous this scene was and how special and magic the animals were. I swiped to the description and VoiceOver said "dog and a cat" which made me laugh quite a lot.
Anyway it's a good option but I can't see myself using it again but good if others are enjoying it.
I like the app, though. The AI voices are a really nice touch and being able to swap between the AI models is great.
I did have a couple of instances where I asked for a detailed description of my dog (yes that's more or less all my photos are of) and it refused saying it couldn't do that for known people or animals or something. Which is a shame. I recently lost one of my dogs and would love it to be able to describe her in a way that really brought her to life. I miss being able to look at photos myself and good though the AI is, it doesn't come anywhere near close to replacing that.
If it could make it easier for me to describe Facebook images then I would get a lot of use out of it as my sister is always posting images without saying what they are about.
One other very minor thing - I found the settings where you choose the voice a bit confusing in VoiceOver. Each voice is listed twice, the second one says button after it. It's not clear that the first time is where you select it and the second time is the preview. Well, it wasn't to me anyway. But that's only a small thing as I probably won't change voice much. I'd also love some British voices but I don't think OpenAI has any yet.
What camera does the app use?
Does this app use the front or back facing camera on the iPhone? I took a picture of something laying on my bed, and it gave me a description of my bedroom.
Re: Which camera
When you take a photo, there is a switch camera button at the top which presumably toggles between front and back.
However, it doesn't seem to tell you which one it is using with VoiceOver unless I am missing something?
New update: camera switch and language support
Hi Lisa and Mr Grieves,
A new update is available to PiccyBot that should support VoiceOver on the camera direction. Please try it out?
The update also supports several more languages, it will speak out the image in the language your iPhone has been set at. The settings screen options will also be presented in that language. Still finalizing full language support, should have that ready for the next update.
disable the waiting sound?
well, we just can't please everybody, can we? the waiting sound is probably a good idea, but personally i'd rather wait in silence. if it's not too much trouble, an option to disable the waiting sound would be great.
@Martijn
Thanks - that works great and I can now tell which camera is active.
I also noticed that you added a speaking rate option which is also really appreciated. I tried it at the max of 150% and it still sounds very convincing and human.
It still seems a little random as to how much detail I'm going to get out of it. For example I used the default prompt and was told there was a sofa and a couple of other things. I then asked if there were any animals and it gave me a more detailed answer about the room as well as telling me the answer. Whereas I think I find Be My AI a bit more consistent in the level of detail I get.
I like the fact you have sliders to increase the verbosity and I went straight in and set them all to max for now, but I can see some situations where less is more.
I think having all the customisations does set it apart. I can almost see a time when it might be handy to be able to set up different profiles (which could include the settings plus custom prompts) and then I can swipe up and down on the main screen to cycle between them. For example, let's say I always lose my keys, I might want a fast speaking rate, low verbosity and a prompt "where are my keys?". If I'm just wanting a description I might want it slower, with more detail. Or I might want a couple of different personality options. Anyway, that might be a terrible idea so please ignore if it makes thins too complicated.
descriptions are very brief
hello the descriptions in my language are very brief eventhou i set the option for 100 procent. i do not appreaciate the opinion of the application. i appreaciate 3 engines althou the hird one does not work for me. can you add option descri,bing pictures that are instantly changing such in a movie or youtube? anyway keep up a good work and yes i, bought life time subscription to support you
longer answer got cut off
i don't mind if the ai voice can't read the whole thing, but the text also got cut off, so there isn't other option, such as review it with voice over. some example here"
The image shows a person's neck with several red spots. There are a number of possible explanations for red spots on the skin, and without a proper medical diagnosis, it's hard to ascertain the exact cause. However, I can provide some general possibilities:
1. Acne or pimples: Red spots can be the result of inflammation due to acne, particularly if the person has been wearing something that rubs against the neck, causing irritation and breakouts.
2.
well, you see, there is definitely number 2, and possibly 3 or 4, but it got cut off at number 2. is this some kind of limitation or a bug?
pdf files?
Many of the images we get come as scanned pdf files. But PiccyBot doesn't appear in the sharing sheet of PDFs. I don't know if it's me or if it's that way. In case it's designed to not work with PDFs, including that option would be great.
Beeping setting
In reaction to the request from LaBoheme, I'll include a setting to switch the beeping off, in the next update.
What model are you using that is cut off? Thing is, I have to match the model's tokens with the duration setting of PiccyBot. But for each model that works out differently. Still tweaking that.
claud 3 was the model being used
i forgot to try it with other models. also, panorama photo can't be submitted using the share sheet. i can submit pano using the photo picker in the app, but not from the share sheet.
Added processing sound setting and more localization
A new update is available. I made the processing sound optional and fixed the iOS 17 mic crash issue. It also has better language support.
Keep the comments coming!
Loving this APP!
I really am loving this APP allot and I find this is better than Be My Eyes especially on the Claude 3 setting. I collect action figures and I do a bit of action figure photography, and with Be My Eyes, it has issues describing allot of the most... shall we say, "adult" looking figures and statues. (Try running photos of characters from Lady Death or Hellwitch, and you'd know what I mean LOL) This is where Piccybot on Claude 3 setting really comes through. It still has snags, some times it's acting self righteous and refuses to describe some things, but I find that if I keep at it it will generally work after rephrasing the questions a few times.
I wish that there's a website version of this APP we can access through a windows PC since most of my photos are stored on my PC and labeling/renaming them is way faster on the PC than having to swipe and click 6 or 7 things on the Iphone.
Another feedback is the "personalities" of these voices are really rather annoying at time :) I eventually just turned personality off cause they came across as condescending or snarky and tended to piss me off when all I wanted was detailed descriptions of a character.
I ahve also encountered the voiced descriptions cutting off half way quite often. Even the text shown on the screen would not show the intire description at times.
Overall, this is one of the best photo describing APPs I've used. Other than Be My Eyes and, to a lesser extent, Seeing AI, nothing else can compare at this point. Thank you for creating this little wonder of an APP :)
Android?
Hi, is this on android?
any chance to add preset prompts?
Still really enjoying the hell out of this APP. As an artist and action figure collector, I interact with lots and lots of photos and have to sort through them. This APP has been a god-send. Yes, Be My Eyes was my go-to before, but it is so sensered that it's useless when dealing with some artistic subjects.
I want to put in a feature request and suggestion for pre-set prompts. Often I am on the phone or meetings while sorting photos using the app- kinda like my little fidget toy :) but I can't dictate anything while a phone call or zoom meeting is going on, and typing by hand is slow and tedius after awhile. I sort through lots of action figures, statues and artworks, and I do have a set of questions I usually ask "Hone in on face." "Rotate to the left" "Hone in on background, ignore the figure" etc etc. It would really be useful if we can pre-program say, a set of 10/20 prompts that we can just tap on for descriptions when dictation is not possible.
Also, not sure if this is possible at all, but while I am on the phone it's really hard to hear the APP cause during a phone call other things like voiceover and sounds are lowered in volume. Is it possible to have a "phone call" mode that I can toggle on and the APP can speak louder?
Thanks for such a wonderful APP!
Click pictures with button click?
Would be great if pictures could be taken with, say, a double click of the side button, especially if you're trying to click some written content or when you're using the front cam.
re: click to take photo
"Would be great if pictures could be taken with, say, a double click of the side button, especially if you're trying to click some written content or when you're using the front cam."
Totally agreed. I actually submitted that suggestion to Seeing AI but they've not put it in. On Iphone's own camera app, you can start/stop video, take photos, just by clicking the volume button. I find that makes thing so much easier cause then you can take photos using the bluetooth keyfob remotes that come with selfi sticks and some tripods. With seeing AI and other apps, I find that I may have the shot lined up, but when I double tap on "take picture," it shakes the camera and things get blurred.
New update available
Hi guys,
Gokul and Privatetai: thanks for the suggestion, the new update allows you to take a picture using the volume button. It will then straight away process and speak it out using your current settings. I also added a photo click sound.
I will look into preset prompts. This update does include a history drop down of previous questions. Hope that is useful in some way.
The PiccyBot update has a few more fixes, the duration setting is now scalable from 10 to 100 and should work better than before.
Joshua, there will be an Android version soon. Will let you know when it is on google play.
Thanks for the feedback as always, please let me know what works and what needs further working on!
re: new update
Thanks for the update! Took me awhile to find the "history questions" lol, it's beside the keyboard after you double tap to go to text field. Now I'm going to spend the next few days building up the history prompts, then it'll be smooth sailing!
Wow!
the click with volume buttons should be a blessing and a time-saver. Will try it out!
Re: volume button
Thanks for the update.
The volume shortcut works well from a quick play last night. I like how it skips the intermediary step of orientating etc the image which always felt a bit unnecessary to me. Not sure i that is still there if you use the camera the normal way but it's nice to skip past it.
share screenshots with the app?
hi,
It is not possible for me to share a screenshot to the app? When I tried it and press send on the prompt I see a quick dialogue to add a picture?
Secondly, where can I find the option to add the most recent picture?
Compare 2/more pictures?
There are situations when we might need to take one picture, get the describtion, take the same picture from a different angle or may be take a new picture, and the actual thing we need might come from a comparison. so an add picture option or something like that will be nice to have.
New update
A new update is available. Cleaned up the settings screen a bit, making it more logical. Functionality is still the same. I did add a language selection option.
The voices all sound good in English, but some of them are better for different foreign languages. Fable for example is sounding perfect as a native Dutch speaker, while Alloy has a heavy US accent.
Hope you guys can play around with it and let me know what you think.
Ollie and Gokul, thanks for the feedback, looking into that. The album and metadata options are interesting to add as a batch process (OpenAI gives 50% discount on that so that is helpful). May consider a separate app for that to keep PiccyBot focused, functionality wise. The personal info is another good point but it may result in Apple objecting, privacy wise. Facing some related issues with Google about the Android PiccyBot release currently, and don't want to rock the boat with Apple too much. Thinking!
Android version now available
Joshua, Google has finally accepted PiccyBot for Android. It is available on Google Play now. The same link works for both iOS and Android versions: https://sparklingapps.com/PiccyBot
Ollie, thanks for the suggestion about the image meta data. I have now added that to the iOS version. When you share the image, it will add the description into the metadata of the image. So you can later access it, index/search for it, and so on. Will include that with the next update of PiccyBot.
One more thing I forgot to mention is that the PiccyBot camera in the latest update has a flash option as well. So if you want to take a quick picture, use the volume control button, but if you want to select the front or rear camera, add flash, or want to zoom into a part of the image, use the regular camera button.
Different language, different result?
I fed it a photo of a woman sitting on the bed, wearing a form fitting dress. I then asked it to "generate a 20 words descriptive title for this photo based on outfit, posture and body type." When I had the language set to Mandarin Chinese, it absolutely refused to generate the title. It kept returning something like "I am sorry, but I can't do that because it's not respectful of the person." I changed language back to English, and suddenly it's working every time LOL.
BTW, while having the prompt history is pretty good and acts as a good alternative to having pre-programable prompts, I noticed that once I rebooted my phone, it cleared all my prompt histories and that was a bit frustrating.
New update
Apple accepted my latest update just now. I have added more AI models to choose from, all flavors of Claude 3 as well as the new Reka AI. Give them a shot and let me know what you think? Personally I feel Claude Opus is still the best overall.
Apart from that, the image description tag (thanks again for the suggestion Ollie!) has now been implemented. Any time you save an image through PiccyBot using the share option, it will include the description with the image. This works for both the basic version of PiccyBot as well as the pro version. Let me know what you think about this feature please. If this is deemed useful I am considering creating a separate app that processes image folders and adds these descriptions to each image in a folder, as a batch process.
I don't care much for photos but...
I'm so glad I supported you, I don't need the app anymore but you're so commited to making this app the best it can be that I'm willing to donate again, is there a donate button in the app or is it a one time thing?
I have to say this
This app is turning out to be a game-changer in so many ways, particularly because you are continuously listening to the user feedback and getting user suggestions implemented. That also makes this practically the most useful app in day-to-day life. What would be so nice will be to see this app being integrated to something like the Meta Glasses as a subscribtion service or something.
One suggestion: it'll be nice to have a 'module' where you can ask the app to look for a specific thing or item at the first instance itself rather than first getting a general discribtion, and then asking follo-ups etc. There are instances where I perfectly know where I am, and what I need to find. For example, let's say I am in a department store. I need to just say something like "tell me my position with respect to X section" and then the app can click the picture and give the info.
Exactly
@Ollie. Exactly that's what I wanted to have. instead of asking it for a general describtion, the user should be able to define the exact info they need in the first instance itself.
Also, while we do have apps like BeMyEyes and Seeing AI, it is always good to have alternatives out there; for instance, when you need to compare results and make sure things are exactly the way they are being described.
adding descriptions to images
This is such a great new feature, thank you.
After upgrading to the latest release, Share was broken until I went in and selected an AI Model (probably because the list has changed), which confused me a bit. But now it works very well - the description is now just sat there under the photo in the Photos app.
Awesome work, thank you.
I love this over other AI models...
Hey developer, I have been using other AI models to get descriptions for the last couple of months and stumbled upon this a few months back so I decided to get a paid version of it and I'm loving it. There's a lot of good features and options so that made me want to pay for it. I do have one suggestion though, the share sheet situation is a bit aggravating because I choose to share photos from X which is formally known as Twitter and when I click on this app to get a description it basically just brings me to the app like I was opening it for the first time and I'm not able to get any descriptions unless I would have to save the photo to my photos app and then hit it from there to get the description. Not all the time I want to save photos to my album just to get descriptions. What's going on with that? And sometimes when I hit share from my Photos app it basically brings the app open but it does not get the description I would have to actually open the photo from the album through the app and select it from there. Is there a way for you to fix that?
You are doing an amazing job at this. I gave you kudos for all your work and then helping us by listening to our suggestions. Even if this doesn't get fixed I'm still going to continue to support you and the development of this app. I'm loving how I can save photos now with the description in my albums! Keep up the good work and mini blind/visually impaired people are loving what you do because it's unlike any other AI models.
First of all, thanks a lot…
First of all, thanks a lot Ollie, Gokul, Brad, Mr. Grieves, Martin, for all the positivity about PiccyBot. I really appreciate it and I especially value you jumping to the defense of the app, Ollie.
Lottie, you have a point about the speed. I am working on it, the main pain point is the voice output. I could stream audio with ElevenLabs but that would cost the earth..
To improve speed and remove the personality effect, you can already take the following steps with the current version:
1. Turn voice off
2. Turn personality off
3. Set length to 10 out of 100
4. Choose either Google Gemini or Claude 3 Sonnet as models, they are faster than GPT4 or Reka
This will result in responses that are up to 5x faster. The description should still be good, and you can share it as metadata with your image as usual.
You do need the pro version to access these settings.
My current focus is on performance improvements without sacrificing the quality of the description and voice. I will also check the share issue mentioned by Martin.
Re: voices
I think the voices are one of my favourite things about the app. I am a sucker for that sort of thing.
I wonder if Martin's problem was the same one I had. I presumed I just needed to switch model but maybe that was a coincidence. I was finding that the app would start but not speak anything.
One related thing I've noticed - when I come out of Settings, sometimes it seems to then re-process the current image and sometimes it doesn't. If I wanted to try the same image with a different model, and I go to settings and switch, it doesn't always have another try. And if I go to the text prompt and tell it to go then it won't because I've not changed anything. Not a particularly big deal, just a tiny bit fiddly.
claud opus is very intelligent
it identified my coffee roaster. no other ai model or any image recognition program have ever been able to recognize it as a coffee roaster, they just called it a machine, some contraption or something that was completely wrong, like a oven or a vacuum. opus also stated the name of the roaster, a detail description of the machine, so much so i suspect it searched the web to gather info about this particular roaster?
ran into a bug
The APP is getting hung up on the previously described photo. For example, I had a photo described last night. This morning I shared another photo with it, and it gave me the same description of the photo last night. I thought it was me selecting the wrong photo, so I shared that photo with another app, it described correctly. I then shared it with Piccy again, and again, it described the photo from last night. I even prompted it with "That's not the correct photo" and it came back with "I am sorry for any confusion" and then described last night's photo again... I had to physically use the camera button in the app to select the new photo for it to finally describe it properly, but now it's stuck on that one and won't move on to the next shared photo LOL.
It's like the cache is not getting cleared. I even restarted the APP but still no good.
Also, not sure if this would be doable or not, but when we export/share from the APP to say, "save to dropbox" it would be nice if the App can put a descriptive title in the file name. Right now how I been doing it is, I'll ask the APP "generate a 30 words descriptive title for the photo." and once I got the title I want, I copy it, go export the photo, save to dropbox, and manually paste the title into file name. It's a bit lengthy and tedius, so if the APP has a function to do that it'd be really nice.
Image description too far from the image itself.
I tried this app, but the thing I don't like is that the description is too poetic and imaginative, that is, it doesn't describe what's in the image, but it makes all its personal considerations from the image which don't help.
Turn off personality
It's been said here multiple times, but again, you can turn the personality off and it'll give you accurate describtions.
re: Image description too far from the image itself.
If you haven't, turn off personality in the settings will make it less ... mouthy and more factual. Also, if that's still not good enough, I usually prompt it with "Give me a 200 words factual description. Do not insert speculation or opinion, don't be poetic, give me facts only."
Turn off personality
Do I need to subscribe to premium to deactivate the personality? Because in my app I don't see any way to access the settings. It just gives me the subscription button, then takes photos etc. Nothing about settings.
Re: personality
Whereas I really like having the personality option available as it is fun to play with, it also pretty much renders the app useless in my opinion. So I appreciate it being there, but think it really should default to off. It seems like it is rightly confusing people, particularly if the option isn't available for non-subscribers.
Re: personality
"Whereas I really like having the personality option available as it is fun to play with, it also pretty much renders the app useless in my opinion. So I appreciate it being there, but think it really should default to off. It seems like it is rightly confusing people, particularly if the option isn't available for non-subscribers."
Totally agree. While it is fun to play with. I generally keep it off cause its accuracy suffers too much with personality is on- not to mention it's really quite annoying if you're having an off day :) So defaulting to being off for new users would make sense and just make it kind of a perk of subscribers.
Although, and I'm smiling as I type this, from a business point of view it almost makes sense to default it to be on. Like, it's so annoying that, when you pay for membership and can finally switch it off, in your head you go "Oh thank god..." But if it's off by default and I pay, and I find out what the "personality" is like, I may feel like "Ugh, I paid cause I thought its cool!" I want my money back!
Re: business sense
I don't think anyone is going to try an app, think "this is terrible, I'd better subscribe to see if that makes it good". But maybe that's just me.
Re: voices
I don't disagree that the voices can be thought of as a bit of a gimmick as they don't add any specific extra utility to the app. However, for me personally I find listening to more natural, human voices is just more pleasing so I enjoy using the app more because of them.
It's a bit like if I am online and have a long article to read, I will switch over to Edge and ask it to read it instead of using VoiceOver.
The VO voices are fine and do the job well enough, but I wouldn't say I actually like listening to any of them.
You can switch it off and just use VoiceOver - maybe this also needs the subscription.
I think if you do like the personality option then then they would make even more sense.
I'm assuming that now we have them, there isn't a huge amount of dev work going into that area but I might be wrong.