Aaron C Wacker PRO

awacke1

AI & ML interests

AGI and ML Pipelines, Ambient IoT AI, Behavior Cognitive and Memory AI, Clinical Medical and Nursing AI, Genomics AI, GAN Gaming GAIL AR VR XR and Simulation AI, Graph Ontology KR KE AI, Languages and NLP AI, Quantum Compute GPU TPU NPU AI, Vision Image Document and Audio/Video AI

Organizations

awacke1's activity

posted an update 15 days ago
view post
Post
433
I am integrating Azure Cosmos DB, the database system that backs GPT conversations into my workflow, and experimenting with new patterns to accelerate dataset evolution for evaluation and training of AI.

While initially using it for research prompts and research outputs using my GPT-4o client here which can interface and search ArXiv, I am excited to try out some new features specifically for AI at scale. Research on memory augmentation is shown. awacke1/GPT-4o-omni-text-audio-image-video

awacke1/AzureCosmosDBUI
posted an update about 2 months ago
view post
Post
1263
I just launched an exciting new multiplayer app powered by GPT-4o, enabling collaborative AI-driven queries in a single shared session!

### 🔗 Try It Out! 👉 Check out the GPT-4o Multiplayer App
Experience the future of collaborative AI by visiting our space on Hugging Face: awacke1/ChatStreamlitMultiplayer

🎉 This innovative tool lets you and your team reason over:

###📝 Text
###🖼️ Image
###🎵 Audio
###🎥 Video

## 🔍 Key Features

### Shared Contributions
Collaborate in real-time, seeing each other's inputs and contributions.
Enhances teamwork and fosters a collective approach to problem-solving.

### Diverse Media Integration
Seamlessly analyze and reason with text, images, audio, and video.
Breakthrough capabilities in handling complex media types, including air traffic control images and audio.

## 🛠️ Real-World Testing
This morning, we tested the app using images and audio from air traffic control—a challenge that was nearly impossible to handle with ease just a few years ago. 🚁💬

🌱 The Future of AI Collaboration
We believe AI Pair Programming is evolving into a new era of intelligence through shared contributions and teamwork. As we continue to develop, this app will enable groups to:

Generate detailed text responses 📝
Collaborate on code responses 💻
Develop new AI programs together 🤖
replied to Wauplin's post 2 months ago
view reply

Such good news thanks! With this we can now create AI pipelines with much greater simplicity to make models interchangeable service parts. I think for cutting edge techniques like MoE gating networks, Self Reward and Comparison across models, Memory across AI pipelines, etc this becomes the differentiator to make it all much easier. I hope that by operating key models like GPT-4o, Claude 3.5 Sonnet, Gemma, Llama, and other front runners in this open pattern unlocks better more powerful AI coding patterns.

posted an update 3 months ago
view post
Post
2383
✨🚀 Claude Sonnet 3.5 API. It's already weaving digital magic!
🧠💻 Try it at my space: 🔗 awacke1/AnthropicClaude3.5Sonnet-ACW

Kudos to @AnthropicAI for this elegant API! 👏 #AI #CodeMagic #AnthropicAI Thanks Huggingface for hosting the best hub in the world for AI development!

replied to their post 3 months ago
view reply

It uses my openai key and org id and is hard to run in an open fashion due to usage. It uses the billed model.

replied to their post 3 months ago
view reply

You can use whisper-1 for now and that pattern works great. The speech wav stream recorder is not in the code for openai yet. I use a streamlit recorder in order to get speech in which is working but I am looking for a better speech in/out technique. The audio to text is used as well and is how the video modality inputs its transcript for additive data input with the image slices from video. One thing I also did not see yet was the image generator inside the client api. That would be nice to add as well and also the speech synthesis.

replied to radames's post 4 months ago
view reply

Oh wow that is so cool! I'll have to read up on this. Thanks for sharing source.

I saw the a16z moniker in the code, is that one of Marc Andreessen Horowitz's group inventions? I think they were investing in Roblox and the tech behind the multiplayer aspects with now multiagent systems is pretty neat.

Convex looks interesting too w combo of vector search, db, ts, AI and realtime services with a great stack of client libraries. I was able to double click into it with github perms and get started.

Its pretty amazing to me that HF can host something so cool. Your Dockerfile implementations are simply amazing.

replied to prabhatkr's post 4 months ago
view reply

As the root of the proto-indo-European language tree I thought all of our European languages came from it but the Sanskrit Heritage Dictionary has around 180k words, so it might be something smaller like token representations counting inter-combinations of symbols. If you count compound words and infleccted forms then you could probably say Sanskrit has maybe millions of words but probably not billions.. Looks like a language model hallucination to me :)

posted an update 4 months ago