Astro - Hacker News

111 comments

godelski 6 hours ago ago
```
  data + plural = number
  data - plural = research
  king - crown = (didn't work... crown gets circled in red)
  king - princess = emperor
  king - queen = kingdom
  queen - king = worker
  king + queen = queen + king = kingdom
  boy + age = (didn't work... boy gets circled in red)
  man - age = woman
  woman - age = newswoman
  woman + age = adult female body (tied with man)
  girl + age = female child
  girl + old = female child
```
The other suggestions are pretty similar to the results I got in most cases. But I think this helps illustrate the curse of dimensionality (i.e. distances are ill-defined in high dimensional spaces). This is still quite an unsolved problem and seems a pretty critical one to resolve that doesn't get enough attention.
[-]
- n2d4 5 hours ago ago
  For fun, I pasted these into ChatGPT o4-mini-high and asked it for an opinion:
```
   data + plural    = datasets
   data - plural    = datum
   king - crown     = ruler
   king - princess  = man
   king - queen     = prince
   queen - king     = woman
   king + queen     = royalty
   boy + age        = man
   man - age        = boy
   woman - age      = girl
   woman + age      = elderly woman
   girl + age       = woman
   girl + old       = grandmother
```
  The results are surprisingly good, I don't think I could've done better as a human. But keep in mind that this doesn't do embedding math like OP! Although it does show how generic LLMs can solve some tasks better than traditional NLP.
  The prompt I used:
  > Remember those "semantic calculators" with AI embeddings? Like "king - man + woman = queen"? Pretend you're a semantic calculator, and give me the results for the following:
  [-]
  - franga2000 4 hours ago ago
    
    This is an LLM approximating a semantic calculator, based solely on trained-in knowledge of what that is and probably a good amount of sample output, yet somehow beating the results of a "real" semantic calculator. That's crazy!
    The more I think about it the less surprised I am, but my initial thoughts were quite simply "now way" - surely an approximation of an NLP model made by another NLP model can't beat the original, but the LLM training process (and data volume) is just so much more powerful I guess...
    
    [-]
    
    CamperBob2 4 hours ago ago
    
    This is basically the whole idea behind the transformer. Attention is much more powerful than embedding alone.
    
    [-]
    
    godelski an hour ago ago
    
    The transformers are initialized by embedding models...
    Your embedding model is literally the translation layer converting the text to numbers. The transformers are the main processing unit of the embeddings. You can even see some self-reflection in the model as the transformer is composed of attention and a MLP sub-network. The attention mechanism generates the interrelational dependence of the data and the MLP projects up into a higher dimension before coming down so that this can untangle these relationships. But the idea is that you just repeat this process over and over. The attention mechanism has the benefit over CNN models because it has a larger receptive field, so can better process long range relationships (long range being across the input data) where CNNs bias for local relationships.
  - nbardy 5 hours ago ago
    
    I hate to be pedantic, but the llm is definitely doing embedding math. In fact that’s all it does.
    
    [-]
    
    n2d4 11 minutes ago ago
    
    Sure! Although I think we both agree that the way those embeddings are transformed is significantly different ;)
    (what I meant to say is that it doesn't do embedding math "LIKE" the OP — not that it doesn't do embedding math at all.)
  - godelski an hour ago ago
    
    > The results are surprisingly good, I don't think I could've done better as a human
    I'm actually surprised that the performance is so poor and would expect a human to do much better. The GPT model has embedding PLUS a whole transformer model that can untangle the embedded structure.
    To clarify some of the issues:
    data is both singular and plural, being a mass noun[0,1]. Datum is something you'll find in the dictionary, but not common in use[2]. The dictionary lags actual definitions. I mean words only mean what we collectively agree they mean (dictionary definitely helps with that but we also invent words all the time -- i.e. slang). I see how this one could trick up a human, feeling the need to change the output and would likely consult a dictionary but I don't think that's a fair comparison here as LLMs don't have these same biases. King - crown really seems like it should be something like "man" or "person". The crown is the manifestation of the ruling power. We still use phrases like "heavy is the head that wears the crown" in reference to general leaders, not just monarchs. king - princess I honestly don't know what to expect. Man is technically gender neutral so I'll take this one. king - queen I would expect similar outputs to the previous one. Don't quite agree here. queen - king I get why is removing royalty but given the previous (two) results I think is showing a weird gender bias. Remember that queen is something like (woman + crown) and king is akin to (man + crown). So subtracting should be woman - man. The others I agree with. These were actually done because I was quite surprised at the results and was thinking about the aforementioned gender bias. > But keep in mind that this doesn't do embedding math like OP!
    I think you are misunderstanding the architecture of these models. The embedding sub-network is the translation of text to numeric tokens. You'll find mention of the embedding sub-networks in both the GPT3[3] and GPT4 papers. Though they are given lower importance than other works. While much smaller than the main network, don't forget that embedding networks are still quite large. For the smaller models they constitute a significant part of the total parameter count[4]
    After the embedding sub-network is your main transformer network. The purpose of this network is to perform embedding math! It is just that the goal is to do significantly more complicated math. Remember, these are learnable mappings (see Optimal Transport). We're just breaking it down into their two main intermediate mappings. But the embeddings still end up being a bottleneck. It is your literal gateway from words to numbers.
    [0] https://en.wikipedia.org/wiki/Mass_noun
    [1] https://www.merriam-webster.com/dictionary/data
    [2] https://www.sciotoanalysis.com/news/2023/1/18/this-data-or-t...
    [3] https://arxiv.org/abs/2005.14165
    [4] https://arxiv.org/abs/2303.08774
    [4] https://www.lesswrong.com/posts/3duR8CrvcHywrnhLo/how-does-g...
    
    [-]
    
    n2d4 15 minutes ago ago
    
    You are being unnecessarily cynical. These are all subjective. I thought "datum" and "datasets" was quite clever, and while I would've chosen "man" for "king - crown" myself, I actually find "ruler" a better solution after seeing it. But each to their own.
    The rant about network architecture misses my point, which is that an LLM does not just do a linear transformation and a similarity search. Sure, in the most abstract sense it still just computes an output embedding from two input embeddings, but only in a very distant, pedantic way. (Actually, to be VERY pedantic, that would not even be true, because ChatGPT's tokenizer embeds tokens, not words. The in- and output of the model is more than just the semantic embedding of words; using two different but semantically equivalent words may result in different outputs with a transformer LLM, but not in a word semantics model.)
    I just thought it was cool that ChatGPT is so good at it.
  - refulgentis 3 hours ago ago
    
    ...welcome to ChatGPT, everyone! If you've been asleep since...2022?
    (some might say all an LLM does is embeddings :)
- Affric 6 hours ago ago
  
  Yeah I did similar tests and got similar results.
  Curious tool but not what I would call accurate.
- mathgradthrow 4 hours ago ago
  
  Distance is extremely well defined in high dimensional spaces. That isn't the problem.
- gweinberg 5 hours ago ago
  
  I got a bunch of red stuff also. I imagine the author cached embeddings for some words but not really all that many to save on credits. I gave it mermaid - woman and got merman, but when I tried to give it boar + woman - man or ram + woman - man, it turns out it has never heard of rams or boars.
- thatguysaguy 5 hours ago ago
  
  Can you elaborate on what the unsolved problem you're referring to is?
montebicyclelo 6 hours ago ago

> king-man+woman=queen
Is the famous example everyone uses when talking about word vectors, but is it actually just very cherry picked?
I.e. are there a great number of other "meaningful" examples like this, or actually the majority of the time you end up with some kind of vaguely tangentially related word when adding and subtracting word vectors.
(Which seems to be what this tool is helping to illustrate, having briefly played with it, and looked at the other comments here.)
(Btw, not saying wordvecs / embeddings aren't extremely useful, just talking about this simplistic arithmetic)
[-]
- jbjbjbjb 4 hours ago ago
  
  Well when it works out it is quite satisfying
  India - Asia + Europe = Italy
  Japan - Asia + Europe = Netherlands
  China - Asia + Europe = Soviet-Union
  Russia - Asia + Europe = European Russia
  calculation + machine = computer
  [-]
  - trhway an hour ago ago
    
    democracy - vote = progressivism
    I'll have to mediate on that.
- gregschlom 5 hours ago ago
  
  Also, as I just learned the other day, the result was never equal, just close to "queen" in the vector space.
  [-]
  - charcircuit 2 hours ago ago
    
    And queen isn't even the closest.
    
    [-]
    
    mcswell 2 hours ago ago
    
    What is the closest?
    
    [-]
    
    charcircuit 2 minutes ago ago
    
    Usually king is.
- Retr0id 5 hours ago ago
  
  I think it's slightly uncommon for the vectors to "line up" just right, but here are a few I tried:
  actor - man + woman = actress
  garden + person = gardener
  rat - sewer + tree = squirrel
  toe - leg + arm = digit
- raddan 6 hours ago ago
  
  > is it actually just very cherry picked?
  100%
- bee_rider 3 hours ago ago
  Hmm, well I got
```
    cherry - picker = blackwood
```
  if that helps.
- groby_b 5 hours ago ago
  
  I think it's worth keeping in mind that word2vec was specifically trained on semantic similarity. Most embedding APIs don't really give a lick about the semantic space
  And, worse, most latent spaces are decidedly non-linear. And so arithmetic loses a lot of its meaning. (IIRC word2vec mostly avoided nonlinearity except for the loss function). Yes, the distance metric sort-of survives, but addition/multiplication are meaningless.
  (This is also the reason choosing your embedding model is a hard-to-reverse technical decision - you can't just transform existing embeddings into a different latent space. A change means "reembed all")
antidnan 7 hours ago ago

Neat! Reminds me of infinite craft
https://neal.fun/infinite-craft/
[-]
- thaumasiotes 5 hours ago ago
  
  I went to look at infinite craft.
  It provides a panel filled with slowly moving dots. Right of the panel, there are objects labeled "water", "fire", "wind", and "earth" that you can instantiate on the panel and drag around. As you drag them, the background dots, if nearby, will grow lines connecting to them. These lines are not persistent.
  And that's it. Nothing ever happens, there are no interactions except for the lines that appear while you're holding the mouse down, and while there is notionally a help window listing the controls, the only controls are "select item", "delete item", and "duplicate item". There is also an "about" panel, which contains no information.
  [-]
  - n2d4 5 hours ago ago
    
    In the panel, you can drag one of the items (eg. Water) onto another one (eg. Earth), and it will create a new word (eg. Plant). It uses AI, so it goes very deep
    
    [-]
    
    thaumasiotes 5 hours ago ago
    
    No, that was the first thing I tried. The only thing that happens is that the two objects will now share their location. There are no interactions.
    
    [-]
    
    n2d4 5 hours ago ago
    
    Probably a bug then, you can check YouTube to find videos of people playing it (eg. [0])
    [0] https://youtu.be/8-ytx84lUK8
spindump8930 7 hours ago ago

First off, this interface is very nice and a pleasure to use, congrats!
Are you using word2vec for these, or embeddings from another model?
I also wanted to add some flavor since it looks like many folks in this thread haven't seen something like this - it's been known since 2013 that we can do this (but it's great to remind folks especially with all the "modern" interest in NLP).
It's also known (in some circles!) that a lot of these vector arithmetic things need some tricks to really shine. For example, excluding the words already present in the query[1]. Others in this thread seem surprised at some of the biases present - there's also a long history of work on that [2,3].
[1] https://blog.esciencecenter.nl/king-man-woman-king-9a7fd2935...
[2] https://arxiv.org/abs/1905.09866
[3] https://arxiv.org/abs/1903.03862
[-]
- nxa 6 hours ago ago
  
  Thank you! I actually had a hard time finding prior work on this, so I appreciate the references.
  The dictionary is based on https://wordnet.princeton.edu/, no word2vec. It's just a plain lookup among precomputed embeddings (with mxbai-embed-large). And yes, I'm excluding words that are present in the query because.
  It would be interesting to see how other models perform. I tried one (forgot the name) that was focused on coding, and it didn't perform nearly as well (in terms of human joy from the results).
  [-]
  - kaycebasques 6 hours ago ago
    
    (Question for anyone) how could I go about replicating this with Gemini Embedding? Generate and store an embedding for every word in the dictionary?
    
    [-]
    
    nxa 6 hours ago ago
    
    Yes, that's pretty much what it is. Watch out for homographs.

lcnPylGDnU4H9OF 6 hours ago ago

Some of these make more sense than others (and bookshop is hilarious even if it's only the best answer by a small margin; no shade to bookshop owners).

  map - legend = Mercator projection
  noodle - wheat = egg noodle
  noodle - gluten = tagliatelle
  architecture - calculus = architectural style
  answer - question = comment
  shop - income = bookshop
  curry - curry powder = cuisine
  rice - grain = chicken and rice
  rice + chicken = poultry
  milk + cereal = grain
  blue - yellow = Fiji
  blue - Fiji = orange
  blue - Arkansas + Bahamas + Florida - Pluto = Grenada

maxcomperatore an hour ago ago

Just use a LLM api to generate results, it will be far better and more accurate than a weird home cooked algorithm
lightyrs 7 hours ago ago
I don't get it but I'm not sure I'm supposed to.
```
    life + death = mortality
    life - death = lifestyle

    drug + time = occasion
    drug - time = narcotic

    art + artist + money = creativity
    art + artist - money = muse

    happiness + politics = contentment
    happiness + art      = gladness
    happiness + money    = joy
    happiness + love     = joy
```
[-]
- bee_rider 6 hours ago ago
```
    Life + death = mortality  
```
  is pretty good IMO, it is a nice blend of the concepts in an intuitive manner. I don’t really get
```
   drug + time = occasion
```
  But
```
   drug - time = narcotic
```
  Is kind of interesting; one definition of narcotic is
  > a drug (such as opium or morphine) that in moderate doses dulls the senses, relieves pain, and induces profound sleep but in excessive doses causes stupor, coma, or convulsions
  https://www.merriam-webster.com/dictionary/narcotic
  So we can see some element of losing time in that type of drug. I guess? Maybe I’m anthropomorphizing a bit.
- grey-area 7 hours ago ago
  
  Does the system you’re querying ‘get it’? From the answers it doesn’t seem to understand these words or their relations. Once in a while it’ll hit on something that seems to make sense.
__MatrixMan__ 5 hours ago ago

Here's a challenge: find something to subtract from "hammer" which does not result in a word that has "gun" as a substring. I've been unsuccessful so far.
[-]
- mrastro 5 hours ago ago
  
  The word "gun" itself seems to work. Package this as a game and you've got a pretty fun game on your hands :)
- aniviacat 4 hours ago ago
  
  Gun related stuff works: bullet, holster, barrel
  Other stuff that works: key, door, lock, smooth
  Some words that result in "flintlock": violence, anger, swing, hit, impact
- neom 5 hours ago ago
  
  if I'm allowed only 1 something, I can't find anything either, if I'm allowed a few somethings, "hammer - wine - beer - red - child" will get you there. Guessing given that a gun has a hammer and is also a tool, it's too heavily linked in the small dataset.
- Retr0id 5 hours ago ago
  
  Well that's easy, subtract "gun" :P
- downboots 5 hours ago ago
  
  Bullet
- soxfox42 3 hours ago ago
  
  hammer - red = lock
- tough 5 hours ago ago
  
  hammer + man = adult male body (75%)
  [-]
  - rdlw 4 hours ago ago
    
    Close, that's addition
jumploops 6 hours ago ago

This is super neat.
I built a game[0] along similar lines, inspired by infinite craft[1].
The idea is that you combine (or subtract) “elements” until you find the goal element.
I’ve had a lot of fun with it, but it often hits the same generated element. Maybe I should update it to use the second (third, etc.) choice, similar to your tool.
[0] https://alchemy.magicloops.app/
[1] https://neal.fun/infinite-craft/
grey-area 7 hours ago ago

As you might expect from a system with knowledge of word relations but without understanding or a model of the world, this generates gibberish which occasionally sounds interesting.
rdlw 4 hours ago ago

I've always wondered if there's s way to find which vectors are most important in a model like this. The gender vector man-woman or woman-man is the one always used in examples, since English has many gendered terms, but I wonder if it's possible to generate these pairs given the data. Maybe to list all differences of pairs of vectors, and see if there are any clusters. I imagine some grammatical features would show up, like the plurality vector people-person, or the past tense vector walked-walk, but maybe there would be some that are surprisingly common but don't seem to map cleanly to an obvious concept.
Or maybe they would all be completely inscrutable and man-woman would be like the 50th strongest result.
nxa 6 hours ago ago

This might be helpful: I haven't implemented it in the UI, but from the API response you can see what the word definitions are, both for the input and the output. If the output has homographs, likeliness is split per definition, but the UI only shows the best one.
Also, if it gets buried in comments, proper nouns need to be capitalized (Paris-France+Germany).
I am planning on patching up the UI based on your feedback.
GrantMoyer 3 hours ago ago

These are pretty good results. I messed around with a dumber and more naive version of this a few years ago[1], and it wasn't easy to get sensinble output most of the time.
[1]: https://github.com/GrantMoyer/word_alignment
galaxyLogic 3 hours ago ago

What about starting with the result and finding set of words that when summed together give that result?
That could be seen as trying to find the true "meaning" of a word.
skeptrune 7 hours ago ago

This is super fun. Offering the ranked matches makes it significantly more engaging than just showing the final result.
ericdiao 6 hours ago ago

Interesting: parent + male = female (83%)
Can not personally find the connection here, was expecting father or something.
[-]
- ericdiao 6 hours ago ago
  
  Though dad is in the list with lower confidence (77%).
  High dimension vector is always hard to explain. This is an example.
afandian 6 hours ago ago

There was a site like this a few years ago (before all the LLM stuff kicked off) that had this and other NLP functionality. Styling was grey and basic. That’s all I remember.
I’ve been unable to find it since. Does anyone know which site I’m thinking of?
[-]
- halter73 5 hours ago ago
  
  I'm not sure this is old enough, but could you be referencing https://neal.fun/infinite-craft/ from https://news.ycombinator.com/item?id=39205020?
e____g 2 hours ago ago

man - intelligence = woman (36%)
woman + intelligence = man (77%)
Oof.
Jimmc414 4 hours ago ago

dog - cat = paleolith
paleolith + cat = Paleolithic Age
paleolith + dog = Paleolithic Age
paleolith - cat = neolith
paleolith - dog = hand ax
cat - dog = meow
Wonder if some of the math is off or I am not using this properly
yigitkonur35 4 hours ago ago

shows how bad embeddings are in a practical way
neom 5 hours ago ago

cool but not enough data to be useful yet I guess. Most of mine either didn't have the words or were a few % off the answer, vehicle - road + ocean gave me hydrosphere, but the other options below were boat, ship, etc. Klimt almost made it from Mozart - music + painting. doctor - hospital + school = teacher, nailed it.
Getting to cornbread elegantly has been challenging.
downboots 4 hours ago ago

mathematics - Santa Claus = applied mathematics
hacker - code = professional golf
ericdiao 6 hours ago ago

wine - alcohol = grape juice (32%)
Accurate.
cabalamat 7 hours ago ago

What does it mean when it surrounds a word in red? Is this signalling an error?
[-]
- iambateman 7 hours ago ago
  
  Try Lower casing, my phone tried to capitalize and it was a problem.
- fallinghawks 7 hours ago ago
  
  Seems to be a word not in its dictionary. Seems to not have any country or language names.
  Edit: these must be capitalized to be recognized.
- nxa 7 hours ago ago
  
  Yes, word in red = word not found mostly the case when you try plurals or non-nouns (for now)
  [-]
  - rpastuszak 7 hours ago ago
    
    This is neat!
    I think you need to disable auto-capitalisation because on mobile the first word becomes uppercase and triggers a validation error.
matallo 6 hours ago ago

uncle + aunt = great-uncle (91%)
great idea, but I find the results unamusing
[-]
- HWR_14 5 hours ago ago
  
  Your aunt's uncle is your great-uncle. It's more correct than your intuition.
  [-]
  - matallo 5 hours ago ago
    
    I asked ChatGPT (after posting my comment) and this is the response. "Uncle + Aunt = Great-Uncle is incorrect. A great-uncle is the brother of your grandparent."
nikolay 7 hours ago ago
Really?!
```
  man - brain = woman
  woman - brain = businesswoman
```
[-]
- nxa 7 hours ago ago
  
  I probably should have prefaced this with "try at your own risk, results don't reflect the author's opinions"
  [-]
  - dmonitor 5 hours ago ago
    
    I'm sure it would be trivial to get it to say something incredibly racist, so that's probably a worthwhile disclaimer to put on the website
- dalmo3 7 hours ago ago
  
  I think subtraction is broken. None of what I tried made any sense. Water - oxygen = gin and tonic.
- sapphicsnail 7 hours ago ago
  
  Telling that Jewess, feminist, and spinster were near matches as well.
- karel-3d 7 hours ago ago
  
  woman+penis=newswoman (businesswoman is second)
  man+vagina=woman (ok that is boring)
- 2muchcoffeeman 7 hours ago ago
  
  Man - brain = Irish sea
  [-]
  - nikolay 7 hours ago ago
    
    Case matters, obviously! Try "man" with a lower-case "M"!
    
    [-]
    
    Alifatisk 7 hours ago ago
    
    Why does case matter? How does it affect the meaning?
    
    [-]
    
    bfLives 7 hours ago ago
    
    “Man” is probably being interpreted as the Isle of Man.
    https://en.m.wikipedia.org/wiki/Isle_of_Man
    
    G1N 7 hours ago ago
    
    Man (capital M) is probably being interpreted as some proper noun, maybe Isle of Man in this case?
nxa 3 hours ago ago

artificial intelligence - bullsh*t = computer science (34%)
[-]
- behnamoh 3 hours ago ago
  
  This. I'm tired of so many "it's over, shocking, game changer, it's so over, we're so back" announcements that turn out to be just gpt-wrappers or resume-builder projects.
  Very few papers that actually say something meaningful are left unnoticed, but as soon as you say something generic like "language models can do this", it gets featured in "AI influencer" posts.
fallinghawks 7 hours ago ago

goshawk-cocaine = gyrfalcon , which is funny if you know anything about goshawks and gyrfalcons
(Goshawks are very intense, gyrs tend to be leisurely in flight.)
MYEUHD 6 hours ago ago

king - man + woman = queen
queen - woman + man = drone
[-]
- bee_rider 6 hours ago ago
  
  The second makes sense, I think, if you are a bee.
kylecazar 5 hours ago ago

Woman + president = man
firejake308 7 hours ago ago

King-man+woman=Navratilova, who is apparently a Czech tennis player. Apparently, it's very case-sensitive. Cool idea!
[-]
- fph 7 hours ago ago
  
  "King" (capital) probably was interpreted as https://en.wikipedia.org/wiki/Billie_Jean_King , that's why a tennis player showed up.
  [-]
  - nxa 7 hours ago ago
    
    when I first tried it, king was referring to the instrument and I was getting a result king-man+woman=flute ... :-D
  - BeetleB 7 hours ago ago
    
    Heh. This is fun:
    Navratilova - woman + man = Lendl
zerof1l 7 hours ago ago

male + age = female
female + age = male
tlhunter 5 hours ago ago

man + woman = adult female body
blobbers 6 hours ago ago

rice + fish = fish meat
rice + fish + raw = meat
hahaha... I JUST WANT SUSHI!
downboots 5 hours ago ago

three + two = four (90%)
[-]
- LadyCailin 5 hours ago ago
  
  Haha, yes, this was my first thought too. It seems it’s quite bad at actual math!
7373737373 7 hours ago ago

it doesn't know the word human
quantum_state 4 hours ago ago

The app produces nonsense ... such as quantum - superposition = quantum theory !!!
TZubiri 6 hours ago ago

I'm getting Navralitova instead of queen. And can't get other words to work, I get red circles or no answer at all.
[-]
- gus_massa 6 hours ago ago
  
  From another comment, https://news.ycombinator.com/item?id=43988861 King (with capital K) was a top 1 male tenis player.
adzm 7 hours ago ago

noodle+tomato=pasta
this is pretty fun
[-]
- growlNark 7 hours ago ago
  
  Surely the correct answer would be `pasta-in-tomato-sauce`? Pasta exists outside of tomato sauce.
kataqatsi 6 hours ago ago

garden + sin = gardening
hmm...
G1N 7 hours ago ago

twelve-ten+five=
six (84%)
Close enough I suppose
woodruffw 7 hours ago ago

colorless+green+ideas doesn't produce anything of interest, which is disappointing.
[-]
- dmonitor 5 hours ago ago
  
  well green is not a creative color, so that's to be expected
ezbie 6 hours ago ago

Can someone explain me what the fuck this is supposed to be!?
[-]
- mhitza 5 hours ago ago
  
  Semantical subtraction within embeddings representation of text ("meaning")