Cold Hard Cache 💾

📊 Common Patterns Seen in Successful Software Projects

Pattern Description Dogfooding: Creators use their own software heavily, improving it organically. “Thin to Thick” Clients: Many successful apps evolve from simple frontends to more powerful, client-heavy experiences. Adoption Loop: Usage -> Feedback -> Improvement -> More Usage. A positive feedback cycle. Breakouts: A successful internal tool becomes open-source or commercialized (e.g. Kubernetes, VS Code). Core + Plugin Architecture: Keeps the core lean while enabling rich extensibility (e.g., browsers, IDEs). Rewrite moments: Major inflection points often include painful rewrites (Twitter, Slack, etc. all rewrote core systems at scale).

"A Name, an Address, a Route" Haiku — Found in RFC 791: DARPA’s 1981 Internet Protocol

A name indicates what we seek.

An address indicates where it is.

A route indicates how to get there.

The internet protocol deals primarily with addresses.

(Not technically a haiku).

Link.

What happened to tor?

On March 4, 2025, the New York Times shut down its .onion site which it had launched in 2017. Is tor dead? tor published metrics speak to a volume of activity that doesn’t look like a big drop. It might just be that there is so much less publicity about it nowadays that it feels like that. Also, it seems to have lost its status as anything but a place for criminals. Maybe. One theory is that many of the onion proxy networks are run by the fbi. I don’t know.

I did some research:

In short — Tor isn’t dead at all.

Its code-base, funding, and user-base are still moving forward, but the hype cycle and many clearnet “onion directories” have stagnated, so a newcomer who googles for .onion links mostly meets blog-posts frozen in 2014-2018. Behind that veneer, Tor Browser 14.5 shipped two weeks ago, the network still serves ~2.4 million daily users, and the project’s 2024 fundraising goal was met. What changed is (a) the way journalists talk about Tor, (b) a big technical migration that broke most older onion addresses, and (c) law-enforcement pressure that discouraged public link-lists.

Horseless Carriages

Creating A Family Free using GEDCOM

A broad category of how genai is useful is by turning unstructured data into structured. Creating a family tree (GEDCOM format) is a good example of that: I gave it stream of consciousness collection of details about who is who (this person is so-and-so's kid...) and asked for gedcom format.

Then I imported the gedcom file into https://www.familyecho.com/ and then exported to pdf and printed.

Analysis on my ChatGPT data

I did an initial analysis of my ChatGPT history export. Here's a surface-level, first-pass analysis:

Content Types

Most messages are text, with assistant messages also including code, thinking, and reasoning_recap. User messages are primarily text, with a small amount of multimodal_text.

Conversation Length

Some conversations are significantly longer (e.g., "Genkit to Gemini Refactor" has the most messages).

Message Length (Word Count)

Assistant messages: ~236 words on average.
User messages: ~222 words on average, but with high variability, including some very long inputs.

python
df = df.withColumn("word_count", size(split(col("content_text"), r"\s+")))
df_user = df_user.withColumn("word_count", size(split(col("content_text"), r"\s+")))
df.select("word_count").describe().show()
df_user.select("word_count").describe().show()

Temporal Trends (Monthly Aggregations)

The data spans Feb 2023 to May 2025. I tracked metrics like average words per conversation, messages per month, TTR, and subjectivity/objectivity over this period, revealing fluctuations and trends in language style.

python
monthly = df.withColumn("month", date_format(col("create_time"), "yyyy-MM")) \
    .groupBy("conversation_title", "month") \
    .agg(
        sum("word_count").alias("word_count_per_conversation"),
        avg("word_count").alias("avg_word_count"),
        count("*").alias("message_count"),
    ) \
    .orderBy("conversation_title", "month")

monthly.show(n=5)

User messages show varying monthly averages for word count, TTR, and subjectivity.

Next Steps

The goal is to track the quality of GPT responses over time, which I haven't achieved yet. To do this, I plan to use LLM-based evaluation for quality assessment.

This analysis provided a baseline for understanding message patterns and trends, but further refinement is needed to track the quality of assistant responses effectively.

Writing

Published an article with AI Predictions between now and may 2026.

I also published as an unlisted article, a deadpan satirical composition, Acting without Acting by Jason Alexander. I think it is pretty funny: Best part is chapters 7-10. Other parts I think are especially entertaining are the prologue, intro, chapter 1, the excercises, and the two stories (vapor and stillness). An acquaintance said it was slop but I think it is high quality slop, and that it has my tongue in cheek sense of humor.

I am also writing a corporate blog post on Agentspace, coming soon.

Finally, I am thinking I will write a series of posts with some business ideas and implementation ideas I have been kicking around.