My COVID Protocol

December 16, 2023

I have COVID, for the second time. It’s not fun, but I’m doing 10x better than last time.

Big inflection point: Paxlovid. I got it within 24 hrs of testing positive, and it is clearly keeping everything in check. Horrible metallic taste in my mouth, but I’ll take it. I was initially very skeptical after hearing all of the stories of rebounds, but I gave it a shot after a few friends shared how quickly it helped them recover. Fingers crossed for no rebound or weird complications.

Activity: Slow walking, at least 10,000 steps a day. Sleep as much as possible at night and just let myself nap whenever I’m tired.

Dietary: I’m basically doing a keto diet. Bulletproof coffee, tons of liquids, eggs, avocado, veggies, and meat. I’m not a mouse, but this paper is interesting: “Impaired ketogenesis ties metabolism to T cell dysfunction in COVID-19.” Nature.

Supplements:

P.S. - I got GPT4V to make the list of these supplements from a single photo. Unfortunately, both Bing and ChatGPT refuse to find product links on Amazon. For that I had to switch to Bard, and then fix about half of them manually…oh well.

LLM Prompting Goldmine

December 15, 2023

Remember the story about the McDonald’s at the top of the volcano?

Well it erupted. (…and I sincerely hope that you didn’t quit your job to make custom GPTs for a living.)

A few people figured out how to trick the custom GPTs that are starting to roll out to the app store into revealing their prompt instructions.

The result is hands-down the most comprehensive and high quality repository of prompts that I have ever found: https://github.com/linexjlin/GPTs

For additional context, many of these are “professionally-created”, bringing in thousands of dollars a month in revenue sharing to their creators. I’ve shared elsewhere that I think it’s a terrible idea to try to be selling AI products right now, this is case-in-point. It has a nice parallel with the “AI as electricity” riff - once you know how to do the magic trick, it’s pretty hard to keep other people from copying you! (and very quickly everyone will expect it to be essentially free.)

P.S. - If you want to learn prompting more formally, this is the best course I’ve found: https://learn.deeplearning.ai/chatgpt-prompt-eng/

P.P.S - Hat tip to Mayo as usual. Sign up for his newsletter and you won’t need me.

Introducing: Human Instruct Turbo

December 14, 2023

I struggle to create clear instructions for AI.

I struggle to create clear instructions for humans.

I struggle to create clear instructions for AI to create clear instructions for humans.

But we move forwards, step by step.

Try out Human Instruct Turbo today!

In this video, we go behind the scenes of the creation of the revolutionary new GPT, Human Instruct Turbo. This is unedited, unplanned, completely raw footage of creating a custom OpenAI GPT from start to finish. Watch me fumble so you don’t have to.

Private and Personalized AI Transcription

December 13, 2023

In this video, I delve into the world of AI transcription, specifically focusing on MacWhisper, a leading tool for AI-driven voice-to-text transcription on Mac. We explore how to enhance MacWhisper with a custom glossary for accurate transcription of unique words and proper nouns, and share tips from the OpenAI Cookbook to refine your MacWhisper settings.

I also demonstrate real-life application by adding custom product names to our vocabulary, troubleshoot common transcription mistakes using find-and-replace, and explore the benefits of using larger AI models for improved accuracy.

Whether for professional or personal use, MacWhisper adapts to your specific language needs, offering privacy-focused and highly accurate transcriptions.

What to work on? A trifecta approach

December 12, 2023

Today I attended Builder’s Roundtable: Generative AI for eCommerce. It was pretty good. 2x it, or, better - get your AI to watch the replay for you.

There were a few interesting ideas on customizing generative AI for eCommerce. Unfortunately I think the OctoAI product still has a long way to go, I frankly was not impressed with the onboarding experience after getting amped up by this webinar.

But one quote stuck with me all day. It has nothing to do with AI:

Find something that you’re really passionate about, find something that you can become the best in the world at, and find something that you can make money doing.

Hikary Senju, Omneky

I think this is fantastic advice for anyone, not just entrepreneurs. A trifecta of how to decide what to work on. (58:39 in the video)

Dalle3 made this diagram. I don't know what the symbol in the middle is, but I think you get the point.

Zone of Genius Enumeration

December 11, 2023

A brief enumeration (or perhaps perturbation) around the concept of the “Zone of Genius”:

Incentive Structures

December 10, 2023

“Show me the incentive, and I will show you the outcome.” - Charlie Munger

This weekend I’ve been thinking a lot about incentive structures. (RIP Charlie Munger)

I don’t think I have any new “crispy realizations”, but here are a few related items:

The Big Companies Will Never Catch Up
- Why small teams have a big advantage over large organizations, especially right now.
Justifying Our Own Existence
- Bad incentives at Twitter led to a bloated codebase.
Loonshots: How to Nurture the Crazy Ideas That Win Wars, Cure Diseases, and Transform Industries
- A very thought-provoking book about how to structure organizations to nurture innovation (via incentive structures).
Advice Under Uncertainty
- “What is the ‘business model’ of the person giving me this advice?”.

The Pace Is Exhausting

December 9, 2023

The pace of AI development over the last few months has been simply exhausting. Exhilarating, but exhausting.

Just look at this chart. This is just the open source LLMs.

I don’t check this leaderboard very often, but the Mistral models that were winning two weeks ago aren’t even in the top 20.

Here’s some cool stuff I found since I ate dinner an hour ago. (Sorry, I literally don’t know what else to do…there’s too much cool stuff.)

WikiChat on GitHub: WikiChat enhances the factuality of large language models by retrieving data from Wikipedia.
LLMCompiler on GitHub: LLMCompiler is a framework for efficient parallel function calling with both open-source and close-source large language models.
Flowise on GitHub: Flowise offers a drag & drop user interface to build customized flows for large language models.

1/n Was December 8th, 2023, the day when we've come to realize that AGI technology has been democratized? That it cannot be confined to the few and the GPU-rich? Let me explain to you what happened yesterday. pic.twitter.com/syLBuCVqG6
— Carlos E. Perez (@IntuitMachine) December 9, 2023

Mixtral 8x7B in LangSmith Playground

Thanks to our friends at @thefireworksai, you can try out the newest @MistralAI mixtral-8x7B model from LangSmith Playground and Hub for free!

s/o to @fireworksai for the experimental chat fine-tune as well!

Sign up for LangSmith here:… pic.twitter.com/tYHvP2jCaT
— LangChain (@LangChainAI) December 10, 2023

RAG over Complex PDFs 📑

The issue with basic RAG strategies (chunking, top-k), is that they’re fine with plain .txt essays, but they do terribly over complex documents - w/ embedded objects like tables, diagrams 📊, and hierarchical sections 🪆

You can solve this with… pic.twitter.com/jSMnzdDXIS
— LlamaIndex 🦙 (@llama_index) December 10, 2023

An increasing use case in retrieval is not only fetching the top-k most similar chunks to queries, but exploring entity relationships.

The way you can do that is with knowledge graphs, and now it's easier than ever to explore how to use them in @llama_index.

Simply download our… https://t.co/ibDZvSpfAf
— Jerry Liu (@jerryjliu0) December 8, 2023

AI Transcription Without a Subscription

December 8, 2023

For those that follow, you’ll know I’m currently obsessed with AI, voice to text transcription, and the intersection of AI and voice-to to text transcription.

I wrote some thoughts about “the perfect voice transcription tool” - which unfortunately still doesn’t exist.

But what did happen this week is that MacWhisper found a way to 3x the speed of their transcription model. And that boost in speed is enough to make it better than Otter or HappyScribe for my use case.

So yesterday I unsubscribed from HappyScribe.

MacWhisper transcribes locally on your machine. You can trade accuracy for speed, and it has a free tier. I’ve paid for it because I want to support Jordi, and because I want to run batches of audio files through it.

The quality isn’t quite as good as HappyScribe, but since its local I can quickly get ChatGPT (or Jordi’s MacGPT) to fix it up. The added time to fix the differential errors is less than the time it takes to upload to HappyScribe, wait for the transcription, and download the file.

Modular Frankenstein

December 7, 2023

I’ve been working on my AI “content machine” for polySpectra. As I wrote before, quantity is very easy, but quality is a challenge.

It is very easy to output nonsense. I have been able to achieve a high throughput of “SEO-optimized nonsense”, but I have been having a hard time getting technical content that isn’t more work to edit than it would have been to just write myself (or write step by step by “holding the AI’s hand”).

I also got “greedy” and was trying to build a system that would go “all the way” from ideas/keywords to fully written articles. This was too ambitious.

So now I’m taking a more modular approach. First building up the foundational concepts and research and structure, which will later serve as the training data for AI-assisted writing. My modular approach also involves a human-in-the-loop at every stage - because nothing is more annoying than propagating errors with AI.

But my eye is on scalability, so I’m making sure that each stage of the process is able to run in parallel, concurrently. In other words, the non-human steps should take the same amount of time for one or one thousand.

My big “it’s alive” moment today was getting GPTResearcher from Tavily to run concurrently:

This did about 15 reports in about 3 minutes. I haven’t pushed to see when I hit my OpenAI API limit.

As breadcrumbs, here are the “resource reports” that I generated: https://polyspectra.com/tags/resource-reports/. These are not very engaging, nor are they meant to be, but they will serve as the foundation for the next step…

In addition to the human oversight at each step, this modular approach also let’s me mix and match the best tools for the job. Tavily is great for research, but the writing style is pretty rigid, and I don’t feel like re-writing it’s guts. So use it just for the step that it excels at.

Dalle3 made this diagram. I don't know what the symbol in the middle is, but I think you get the point.

🧟‍♂️ It's alive!