Willow

0:00

Oh shit.

0:02

Oops.

0:03

Okay.

0:04

Wow.

0:05

Okay.

0:06

Let's get going.

0:07

Um, I'm going to go on my snack.

0:10

I got so excited.

0:13

I'm going to go on my snack.

0:15

I got so excited.

0:17

Okay.

0:18

Oh shit.

0:19

Oops.

0:20

Okay.

0:21

Wow.

0:22

Okay.

0:23

Let's get going.

0:24

Um, I'm going to go on my snack.

0:26

I got so excited.

0:27

Okay.

0:28

And a little bit.

0:29

Yeah.

0:30

You're going to ask me to introduce myself and my project.

0:32

I think that I'm going to be awkward, right?

0:34

But it'd be so fun.

0:35

I didn't mean to be thinking.

0:36

It's been recording for like, two years.

0:38

I'm going to be like,

0:40

I didn't mean to be thinking.

0:41

I didn't mean to be thinking.

0:42

It's been recording for like,

0:43

fifty minutes though.

0:44

Yeah.

0:45

I know.

0:46

I know what you've been doing.

0:47

Okay.

0:48

Are you concerned?

0:49

Mmm.

0:50

Wait.

0:51

I know you've been preparing me and teasing me into it.

0:52

I've done a marvelous job.

0:53

But I have been aware of it.

1:01

Okay.

1:02

Yeah.

1:03

Yeah.

1:04

Yeah.

1:05

Yeah.

1:06

Yeah.

1:07

Yeah.

1:08

Yeah.

1:09

Yeah.

1:10

Yeah.

1:11

Yeah.

1:12

Yeah.

1:13

Yeah.

1:14

Yeah.

1:15

Yeah.

1:16

Yeah.

1:17

Yeah.

1:18

Yeah.

1:19

Yeah.

1:20

Yeah.

1:21

Yeah.

1:22

And the music you just heard was a creation by our guest today, which is Aliyosha from the Willow Project.

1:31

Throughout the episode, you might hear a few more tunes gathered here and there.

1:38

And if you want to hear more, just check out their website, worm-blossom.org, which they

1:44

host together with Sammy.

1:47

Today, we will be hearing a bit of a more, I guess, Aliyosha called it theoretical computer

1:52

science.

1:53

Take on these kind of protocols and specifically a walkthrough of Willow and what makes Willow

2:02

particularly secure compared to most other protocols in the scene.

2:07

So with no further ado, let's dive in.

2:23

Thank you so much for joining me here today, Aliyosha.

2:45

We have been chatting away for a solid 20 minutes already.

2:52

And now it's time for the official podcast.

3:00

So you and I have known each other for quite a few years now.

3:05

True.

3:06

And it started in the, I want to say infamous, but it's not very infamous.

3:11

It's more like famous, but it's also not famous.

3:13

It should be infamous.

3:14

And what we're talking about is of course, it's got a lot.

3:21

For those who have no clue what's got a lot is, do you want to give a quick debrief on

3:27

the sailor's network?

3:29

Oh, no.

3:30

You get an unusual debrief on that if I'm the one to do it.

3:36

I'm sure I can give it a try.

3:38

So in a year that might have been 2015 or something, a person in New Zealand named Dominic

3:46

Tar decided that the world needs better technology for certain things, including social networking.

3:53

I mean, actually you wanted to build a package manager, which is something I can deeply relate

3:56

to, but it got derailed a bit.

4:00

So secure scatterbout is this protocol by which people or rather their computers can

4:07

exchange information in a way that is not dependent on their always being a direct

4:11

internet connection available.

4:13

So Dominic basically lived on a sailboat and didn't have internet for very long periods

4:20

of time, but that shouldn't stop him from just writing down stuff like in a diary and

4:28

maybe later just synchronizing it with other peers.

4:30

That's what scatterbout is essentially.

4:32

You write a kind of diary of entries and you tell your computer who your friends are and

4:39

then the data is automatically exchanged between you whenever there's connectivity and crucially

4:45

that connectivity doesn't need to be direct.

4:48

So if I wanted to see Dominic's post, I wouldn't need to connect to Dominic directly, but rather

4:53

I would just need to find anyone else who already had Dominic's post and then I would

4:56

be able to receive them and also verify that they had not been altered, that they really

5:02

were what Dominic had intended.

5:04

But why not just use internet though?

5:08

Why it works over the internet potentially.

5:12

What's the difference?

5:13

Why don't you just like Facebook message someone or something?

5:18

Right.

5:19

Right.

5:21

So it might come as a bit of surprise, but I'm not terribly fond of Facebook and neither

5:31

was Dominic Tarr.

5:32

So hi, come on.

5:35

Because you actually gone quite far with that.

5:43

Do you have a smartphone?

5:47

I do.

5:48

You do.

5:49

Yes.

5:50

I've never seen you use one.

5:52

That's correct because you've never seen me being controlled whether I have a ticket

5:57

in public transport in Berlin.

5:59

That's why I have a smartphone.

6:02

Oh, so it's kind of forced into the system aspect.

6:06

Yep.

6:07

Yeah.

6:08

Makes sense.

6:09

But yeah, it touches on the whole bigger picture of things though.

6:16

And while we met on Scolobot, you are currently working on a project called Willow, which

6:26

is the next generation of these kind of networks that would work even if you're out

6:31

sailing on a sailboat and you don't have access to internet but want to communicate with your

6:35

friends.

6:39

So first of all, if we skip the whole Y of like, okay, Facebook is horrible.

6:48

Surveyor of the state, society, you know, of these things.

6:51

I knew this wasn't Blizzard and I wouldn't have to say it.

7:01

Yeah, they are.

7:03

But if we then dive instead straight into Y Willow, because that's not as implicit.

7:12

Fair.

7:13

Why did you start Willow when Scolobot was around?

7:19

So first of all, it's difficult for me to answer that because I never sat down and said,

7:26

these are the three reasons that make me want to do this thing, right?

7:31

Instead, it's a whole set of feelings and drives that lead into putting the time into

7:40

this.

7:41

But as far as I have managed to identify, like career and concepts in what is driving me,

7:47

I would say the two primary ones are the fragility of the default way of going about it and a

7:57

sort of carelessness toward users, where tools can be turned against the users.

8:03

And the fragility aspect of it, that's what we just, you know, move to the sidelines as

8:09

being implicit in Facebook's bed for multiple reasons.

8:15

But I mean-

8:16

So when you say fragility, if I'm translating it into my own terms, just to understand, does

8:21

that mean like security risks?

8:24

And like, or what do you mean with fragility?

8:27

Not necessarily security, but just we are dependent on certain information technology,

8:35

right?

8:36

We treat it as infrastructure and we treat it as being there and working and stuff starts

8:41

breaking in the real world once computers start breaking, which is very wild if you consider

8:47

the role that computer plays 40 years ago.

8:50

And this fragility, like one of the most basic aspects of course is just when Dominic

8:55

on his boat doesn't have an internet connection, he cannot send stuff to the Facebook server.

9:00

So that's not going to work.

9:02

So in that sense, it's very fragile, it depends on always available internet connection, right?

9:07

But then also, it's fragile at really every single part and every single link of the chain

9:16

that leads from a sending message to be receiving message, right?

9:20

It goes through a single server that server might be hacked, there might be government

9:25

regulation that forces it to shut it down.

9:31

And the idea of these local first peer-to-peer projects that kind of unifies the whole space,

9:40

I think, is to combat that fragility and to replace it with a project that actually

9:47

more inspired by nature, I would say, where there's a lot of like optimistically trying

9:52

things out in many directions at the same time, right?

9:55

Which can be as simple as optimistically replicating the data across many machines instead of just

10:00

a single direct path, right?

10:03

It's very natural for computer scientists to say, let's find the one shortest path and

10:07

send data that way because that's the most efficient.

10:09

It's also the most fragile thing you could possibly do.

10:12

All the peer-to-peer projects or most of them are going away that tries to avoid this

10:20

fragility, right?

10:21

And in the infamous, hopefully, Scuttlebutt, especially in the early days, there was the

10:27

saying of no global singletons and rejecting every entity, every concept of which there

10:35

would be only one and on which you would be dependent, which is not...

10:40

What's like...

10:41

Okay, so I know singletons is like...

10:44

Because we talked about this in previous podcasts, in example, a singleton, like in one way one

10:50

could say a centralized server, but in another way one could also say DHT, right?

10:55

Like a distributed hash table.

10:56

Exactly, or a blockchain, right?

10:59

So there is kind of shades where the peer-to-peer projects position themselves along this axis

11:07

of how thoroughly do you avoid global singletons.

11:10

You can...

11:11

I mean, you can go pretty far.

11:13

You can say there's only one internet.

11:15

You shouldn't use the internet.

11:17

You should use any kind of available network.

11:21

That's a rather extreme way of taking things, but it's also kind of sensible, I would say,

11:27

it makes sense to not hard code dependence on the internet and to your protocol.

11:31

Yeah.

11:32

Right, and like...

11:34

And this is why we're those, right?

11:37

Well, that's one side of it, but really if all I cared about was not having global singletons,

11:45

then I was to be unscattered but, because scattered but is probably the most pure experiment

11:54

in this regard that actually gained traction.

11:59

But then the other aspect that I mentioned beyond just fragility would be, well, essentially,

12:05

the ability to webinise the network, the system, the tools against its users.

12:11

And this is what you mentioned with Care for People.

12:15

Yes.

12:16

Yeah.

12:18

So rather than being enabling webinisation of the network against its users, you're trying

12:27

to take an approach of Care for People.

12:29

But what does that mean in practice when you're building a protocol?

12:32

Because it's not like, is it more heart emojis?

12:36

Maybe you should about that as a strategy.

12:41

You know, there's at least five differently coloured heart emojis, right?

12:45

And so you've just encode everything using different colours of the heart emerging.

12:49

We might be onto something.

12:51

It's not exactly human readable, but as a human, whatever you look at, you get a good vibe.

12:56

But we are actually slightly onto something, because if one looks at your website, like

13:01

Willow, and you're also a personal website, is it worm.blossom?

13:08

Worm-blossom.org.

13:10

Worm-blossom.org.

13:13

And where you have this expression, which is also very much thanks to your co-creator,

13:25

Sammy.

13:26

And she is a fantastic cartoon artist.

13:30

And so it's a little protocol.

13:33

Comic artist.

13:35

Oh, comic artist.

13:37

Sammy would yell at me if I didn't jump in here right now.

13:39

What's the name?

13:42

You'd have to ask Sammy that.

13:44

But let's just call her an illustrator.

13:47

Okay, illustrator.

13:48

Well, she's a fantastic illustrator, and like the whole Willow protocol is very well documented,

13:54

one of the best of its kind.

13:56

Which is something we could talk about, and we will talk about pretty soon.

14:02

But is this also part of the care you're talking about?

14:08

No, and yes.

14:10

Okay.

14:13

I don't know.

14:14

Feelings are mushy things.

14:16

Okay, okay, this was actually before.

14:19

I slightly interrupted you by raising my hand.

14:23

And that was because you keep coming back to your feelings.

14:28

And me as knowing you, you have a lot of feelings, of course, as anyone does.

14:35

But your feelings are often seemingly, intuitively connected to deeper thought.

14:43

Because something that the listener might not know,

14:46

you are not pure developer TM.

14:49

You're actually like a mathematician, right?

14:52

Well, technically no, but...

14:54

And a musician.

14:56

Well, technically no, but...

14:58

Yeah.

15:03

Actually, no, I shouldn't do that.

15:04

Yes, I said identify as a musician.

15:07

Yeah.

15:07

And screw labels.

15:09

So yes, I'm a musician, damn it.

15:11

Yes, you are.

15:13

But no, I like the proper training.

15:15

Like, I don't know enough analysis, and not enough topology to call myself a mathematician.

15:21

I'm sorry.

15:23

Yes, the maths computer scientist I've taken a fairly theory heavy

15:27

and maths laden route.

15:28

Let's phrase it like that.

15:29

Let's phrase it like that then.

15:31

But when you talk about feelings,

15:34

are those feelings maybe also connected to,

15:39

okay, I'm trying to remember the terminology we just used,

15:41

but like these theory-laden perspectives that you also hold?

15:47

That's a good question.

15:49

Like, I've...

15:50

I've always been very guided, both in which things I'm exploring

15:59

and in which shape that exploration takes,

16:03

by a sense of aesthetics almost.

16:05

Like, seeing solutions that are not beautiful

16:11

makes me want to replace them.

16:14

Which is both a strength and a curse in some sense.

16:19

It's a strength because it leaves me dissatisfied with things that

16:27

many other people who don't then develop their own protocols from scratch

16:33

will just accept.

16:36

And I do think the problems are real that we can identify there.

16:42

So it's good to have a sort of innate motivation to try to improve on things.

16:50

Right, so that's how I would say it's a strength.

16:54

And I guess it's a weakness in two senses.

16:56

One, purely in the conventional academic slash mathematical sense.

17:03

That's not a valid argument.

17:07

Right, so communicating stuff needs a different kind of work.

17:14

And two also, sometimes reality is just messy,

17:17

and I need to stop myself from considering messiness as unappealing or unaesthetic.

17:25

But I think this is also...

17:26

Okay, this is a weird maybe relation to Mink,

17:29

but there's this anime called Orb.

17:33

You've never heard of it.

17:34

No, it's a quite nice anime I would say.

17:38

But the topic of the anime is about mathematicians who started

17:47

looking at, is it called the heuristic bottle?

17:51

The basically that the sun is in the center and the earth is rotating around the sun,

17:57

which was like a thing only have herrhetics would say back in the day.

18:02

Oh, is it a place of Poland?

18:06

Good question.

18:07

I don't actually know where geographically it is, but one of the perspectives that I

18:14

keeps coming back to is that these mathematicians that were looking at

18:21

what convinced them was the truth, even though they had enjoyed getting killed for it,

18:26

because they were saying something that was going against the word of God, so to speak.

18:33

They kept coming back to the sense of beauty and cleanness,

18:38

and that's something that me, I've never been able to relate to,

18:41

because I'm not a mathematician, and I haven't dove deep into that space.

18:49

But I'm also understanding that as an artist, I'm not like someone who speaks the language of math,

18:56

so to speak.

18:59

There is a sense of an a beauty that isn't necessarily a rejection of messiness,

19:04

but it can also be the beauty of nature.

19:11

In many ways.

19:14

I mean, I do know that being guided by a sense of beauty is fairly common amongst mathematicians.

19:20

Like, I'm not the weird one out there, at least not amongst mathematicians.

19:26

There is a beautiful piece of writing called Lockhart's Lament, which does a good job of conveying

19:34

the beauty in mathematics to non-mathematicians.

19:38

So I'll ask you to put a link to that into the description.

19:42

Also, how did we get here?

19:44

It feels like a vast change of it.

19:47

The starting point of this was how you're weaving in illustrations and weaving in your own music is

19:57

available on word.org, or word slash blossom.org.

20:02

And you're taking this approach to your work that is rooted in care for people.

20:10

And care for people is multifaceted.

20:15

Yeah, and accordingly, I can take my answer to very many different directions.

20:22

Let's start with the answer that doesn't lead us immediately back to where we actually started

20:26

from, but let's go somewhere completely different.

20:28

Not completely different.

20:30

So the comics and the music and the playful updates are a form of self-defense.

20:40

Because working on this stuff is difficult and it takes discipline and it takes a lot of time

20:47

that you cannot put into other things.

20:50

And especially if you are on a deadline due to grant work, for example, and you need to just

20:55

deliver this feature because food costs money in our society.

21:01

I had that that takes us tall and like we've reached multiple times points or at the very least

21:13

I have reached points where it became very difficult for me to actually do the work

21:16

despite having obligations that we should do the work.

21:21

So at some point, Sam and I sat together virtually in a call and talked about what we might do

21:31

to help us with that and to bring more joy into the process.

21:35

And this started with me suggesting let's do a dev diary, which is actually a tradition

21:42

we took from Scuttlebutt, where people would just post regular updates on which work they

21:48

have been doing.

21:49

And so we never developed this concept of let's just do weekly updates about everything we've done.

21:59

And it's funny. So I proposed that and Sammy was not amused. She didn't like the idea at all

22:08

because it was adding more work, which was not the work we were supposed to be doing.

22:14

There were other factors involved.

22:15

But let's leave it at that.

22:17

And so she took one day or two days of mulling it over and then she came back to me and said,

22:23

okay, we can do this, but then we have to do it right, which means making it exuberant and

22:31

actually expressing stuff beyond just the updates.

22:37

And then it was clear from the very start that she would be drawing effectively a webcomic.

22:43

And then that I would be doing background music for it.

22:47

So that's where that started. And then we ended this period of just being highly

23:15

motivated. And we used our own build framework for generating the website and we built that

23:21

we kept building on it. And that led into a very virtuous cycle where I at least suddenly wanted

23:31

to get work done so that I could post about it because it felt good. And also just having

23:38

the weekly update to work to work across the week.

23:42

It gave you a bunch of mini deadlines, which are great for motivation.

23:46

So that's how that came about. And then people seemed to like it,

23:52

which was also like when we realized, oh, there's people reading this. This is weird.

23:59

And there's people interacting with us because of this. Like we have a community, it's on Discord.

24:05

For now, it's obviously going to migrate to a Willow alternative. But no, people are showing

24:12

up there. And then we did this experiment of posting a little help wanted snippets where we'll say,

24:20

hey, here's an easy task that we could do. But also someone else could do it. And people started

24:27

doing these. So yeah, the whole site just like this website. It's been a good thing.

24:40

And it's still fun to do. And we're still dreaming up stuff we want to do with it. And then need to

24:47

balance that with the reality of putting too much time to the website is also maybe a bit

24:52

of an unwise decision. But yeah, so that's a huge tangent on why there's drawings and music

25:00

on our weekly devlog website. But now to come back to the other aspect of care for users. I mean,

25:12

it's parents. Basically, we have a responsibility. We design protocols that we want to be used by

25:20

non-technical people. We don't want people to use an application because it's built on top of

25:24

Willow. We want people to use an application because it's a good application. And ideally,

25:32

that application is built on Willow. And thus, the users inherit certain benefits. They're not

25:36

necessarily aware of. And hopefully they will never need to be aware of them even. It should be

25:43

invisible and getting off point. Well, actually, that's a great timing of you getting off point.

25:51

And because I'm a little bit curious here, because you mentioned that the user would inherit some

25:59

specific benefits without necessarily having to be aware of what specific benefits they're getting

26:06

from using an application that's running over Willow. But I'm curious here, like,

26:14

when we started this conversation, they were comparing with Scuttlebutt. But not everyone

26:18

was Scuttlebutt. So just to start off, what are the benefits of having an application over Willow?

26:29

What are the different qualities that come with that?

26:37

How does it differ from making a regular web application? You know what I mean?

26:43

I mean, regular web applications are fine except they're fragile. I think the more interesting

26:49

part is once you leave all that fragile stuff behind and move into the peer-to-peer world,

26:54

and then the community over the past decade or so, decades, technically has done a very good job

27:02

of building anti-fredgile stuff or, you know, some communities. Are you referring to work by

27:08

Taylorview, like anti-fredgility stuff? Not consciously, so. Oh, cool. Okay. I am roughly aware of that work.

27:18

But no, I just meant it's not fragile. It's an intuitive term. Yeah. Yeah. Which

27:26

holds high to tangent, loading very specific meaning, like technical meaning in a specific context,

27:33

one to very intuitive terms. It's a great cause for miscommunication. Yeah, that's very true.

27:40

So. We're staying focused. We can do this. I'm going to stay focused now. You don't have to stay focused.

27:47

So no. What? No. Why didn't you say so from the very start? So this morning, breakfast.

27:58

What did you have in a breakfast? Serious. What are we doing? Yes. What time?

28:06

How did they differ from other cereals? So why you should use Willow? Why Willow is better than

28:15

everyone else? No. Here we go. We actually had a conversation about this just last night because

28:21

we were out at a bar. Indeed. Our listeners don't need to know that. I mean, just organically replay

28:30

that full conversation. But I don't know. Okay. Since I was a slash sarcasm,

28:38

friend one who didn't pick that up. I'm not Gosh's voice.

28:43

Well, I thought we were just going to edit that out. But thank you very much. No, no, no. This is

28:49

staying. 100%. Yeah. Maybe we can just edit that into two parts like comedy episode.

29:01

Okay. Insert Jingle from Aliyasha. Oh, no, you meant editing. Darn it.

29:12

So welcome back. Now we're staying focused. We sidetracked. Now we're going to sidetrack into

29:19

two other topics. But before we sidetrack into two other topics, we're going to continue on the

29:25

thread of the qualities that are inherent of a protocol such as willow and maybe specifically

29:31

willow. Let's start with like what's protocols like willow, which are usually like routing agnostic,

29:38

which is a term, but basically that you could use it over sneaker net, like put stuff on USB or

29:46

that you could put it on a route, your data, the image network or Bluetooth and bounce it between

29:52

phones or the regular internet. And now we have the peer-to-peer quality. And then we also have

29:59

the local first quality. And like willow is from my perspective, a very classic example of like

30:07

a peer-for-peer application or a protocol. Yeah. But here's what we're trying to provide beyond that.

30:15

We are trying very hard to enable features such as strong deletion, deletion of metadata,

30:24

user agency over which stuff propagates where and how to get rid of it later.

30:30

So number one, on a regular application, I can just delete my data. Why does that matter?

30:35

You can delete your data. Wow. Why have we thought of that yet?

30:47

No. So here's the thing. There's several facets to the question of deletion in distributed systems.

30:55

So once you put something on the internet, anyone might have it, right? Like it might be downloaded

31:03

by somebody else. And if you then delete it on your machine locally, that doesn't really help

31:09

because it's still on different machines. And you can ask other people nicely to also delete

31:15

their copies of the data. And if all of them comply, then yay, you deleted the thing truly.

31:20

However, if somebody took a screenshot and printed it out, then no computer protocol is going to be

31:29

able to break into their home and destroy that printout. So there are certain limits to which

31:36

forms of deletion you can achieve. But in this peer-to-peer space where one of the core tenants

31:42

is essentially that data stored redundantly on multiple machines. And you don't really know

31:47

in advance which machines those might be. There is a very tempting way of going about deletion,

31:53

which is just to give up on it. You might say, well, in no chance, you can't force people to

31:58

delete it. It's open source. They might just modify that client and not delete stuff.

32:02

So why even bother? That's one approach. And a surprisingly large number of protocols takes

32:09

that approach and makes choices in their data models. So kind of that choices, that's it at the very

32:17

heart of how you insert data into these systems that rely on deletion, not really being a priority.

32:26

Right? Scatterbut is... Well, it should be infamous for many reasons. One of them is that

32:34

its core data structure, essentially whenever you post new data, the new data contains the

32:42

reference to prior data. And Scatterbut says you have to kind of check the validity of that

32:49

reference. And that also includes changing the validity of, well, the thing you're referencing,

32:53

that references thing before it as well. So you have to check that step as well. And you kind of

32:58

have to step through the full chain and verify everything, which means you can't delete anything.

33:04

Everything has to be always available in order to perform this sort of verification.

33:09

So Scatterbut is inherently unable to delete stuff up to certain caveats.

33:15

So basically, on regular internet, it's difficult to get something deleted because

33:21

whomever could download it. On Scatterbut it was essentially an inbuilt function of the protocol

33:29

to make sure that things did not get deleted. Yes. And in Scatterbut, like the data structure

33:34

is called an append-only log. And more recently, there's been a lot of protocols exploring stuff

33:40

like Merkeljags, which is roughly put a generalization of this, where instead of having only one preceding

33:48

thing, you need to validate, you kind of multiple things you need to validate.

33:51

That's very rightly speaking, what a Merkadag is. And yeah, all of these are very

33:58

anti-deletion in some sense. And I'm not buying the argument that says, well,

34:06

don't expect things to go away if you've put them on the internet. Like, just don't

34:10

put things on the internet then. But I do want to put things on the internet. And I get it can't be

34:16

that black and white. So at the very least, the protocol should be able to, like, I should be able

34:23

to signal intent, I would like this to be deleted. Because sure, there might be malicious peers

34:34

who will not respect this intent. But you know, most human beings are kind of okay. And also,

34:40

most human beings don't modify this software. And they run their computer. So

34:47

I actually think the prior is more important. But any of yeah. Anyway, so if most people

34:58

do honor deletion requests, so if I post something and then I later delete that,

35:04

then the malicious user has to obtain the data kind of in the window where it was not yet deleted,

35:12

right? And that window relative to the duration of the universe, that window is going to shrink

35:18

and shrink and shrink and shrink. Right? Like, even if it's been online for two weeks before I

35:23

deleted it for two years, like in 50 years time for the vast majority, it should have been deleted.

35:29

And this becomes especially important because like one of the use cases that we saw in Scuttlebutt

35:34

was a lot of the networks where Scuttlebutt suddenly saw uprights and downloads was like,

35:41

for example, Myanmar when the word broke out. And that kind of I remember was quite worrying at the

35:46

time because a lot of the qualities of Scuttlebutt is definitely not made for safe communication

35:52

because if someone eventually found out that, whoa, their communication has been compromised when

35:58

they were trying to have safe communication with their peers in Myanmar and they just could not

36:04

delete anything and became a huge security risk. Yeah, I know at least one person who,

36:10

when that happened, suddenly had to spend all their time making sure that people did not use

36:15

Scuttlebutt over there. Yeah, so being able to solve this is quite like, deeply critical.

36:23

But then Willow enables the vision?

36:29

It's by signaling. It enables the vision in the way that you just described.

36:36

Yes, but it also attempts to do that fairly thoroughly. So here is another problem with

36:42

distributed systems where, or with our particular brand of distributed systems, where essentially

36:48

data might take a long time to get somewhere and might take different paths and you don't know

36:53

when which updates will arrive somewhere. So if I first say, let's use the default example from

37:01

our own website, we write some data and we give it kind of the name, I hate my boss.

37:11

Or Trump. Sure, let's stay with our boss. So I don't actually hate my boss.

37:17

So it's easy for me to do use that example. You post a poop emoji kind of to the, and you name

37:26

that thing, I hate my boss. And later you, for mysterious reasons, regret posting that.

37:33

And you might want to get rid of that. Well, you can delete the poop emoji by saying,

37:39

please delete the data that I named, I hate my boss. There's a certain problem with propagating

37:46

a marker saying, please delete the data, I named, I hate my boss across the network.

37:51

Again, it doesn't quite suffice to convince your boss that you never hated them in the first place.

37:56

If on their machine, they get a request, please delete, I hate my boss. And the boss is like,

38:02

what? Yeah, like even if they never got the poop emoji, they still got the metadata that something

38:07

happens somewhere. Right. And that's, for example, something that I, I don't know many other projects

38:14

that try to solve that, but we found a quite elegant solution there.

38:18

But Willow has this hierarchical approach to effectively naming the data you put in there.

38:25

You can just think of it as like a path in a file system. Whenever I want to insert data into Willow,

38:31

I essentially select a folder inside Willow and that folder might live in a different folder or

38:37

directory. Right. Until there's actually the data file. And what we can do is essentially,

38:43

we have something that looks like deleting a folder high up in the hierarchy. So if I

38:51

post, I hate my boss inside something code, well, posts, for example, then I could just say,

38:59

please delete the posts on the directory. And then that is the request that gets circulated around

39:07

that kind of stays around. Right. So everyone knows I had a post directory launch,

39:13

once, which makes sense of kind of the education we're using to communicate as a post directory.

39:19

But there's no trace of me having posted. I hate my boss inside the directory.

39:27

So that was one of the first things that you said differentiated Willow from other somewhere,

39:34

like peer for peer protocols. I can just also give you the next thing that's on my mind as you

39:39

say that, because it fits well with what I just have been described, which is that we give these

39:46

hierarchical names to data. So essentially whenever you publish something, it's like placing it in a,

39:53

like, at a certain place in a file system. But that's surprisingly rather atypical.

40:01

It's atypical, because many systems are built on a different kind of addressing called content

40:08

addressing, where you don't assign a name that has meaning to certain data. But instead what you do

40:17

is you take the data, you put it into a magic concept called a cryptographic secure hash function.

40:24

And what this hash function spits out is a random looking string of garbage that uniquely

40:31

identifies the data and that random looking string has been, has been computed from the data you

40:38

put into that. So if, you know, if some monkey sits at a typewriter and writes the works of Shakespeare,

40:44

and at a different typewriter, a different monkey happens to also write the exact works of Shakespeare,

40:50

and both monkeys independently put their output through the same hash function. The shorter

40:56

random looking identifier, which is a lot shorter than the actual works of Shakespeare is going to

41:00

look, there's going to be identical for both of them. And then systems use that to request data,

41:08

for example. The same technology is also used in the append only lots of scuttlebutt or the

41:14

generalization to mercury's, because when I spoke earlier of new things, reference old things,

41:21

what that actually means is new things contain this, contain the digest of the case of scuttlebutt,

41:28

the previous, the previous message. Yeah. And well, that's a certain problem, because

41:37

suppose I was able to delete an old thing, but somebody has the new things that's

41:45

still pointing to the old thing by manner of just including the digest of the old thing.

41:52

Now I can confirm the guess as to what the old thing has been. Like, if I'm like, my boss might

42:00

think, well, they probably posted a poop emoji. So they take a poop emoji and they feed it into

42:07

the hash function, and they get a digest. And if that digest kind of matches the thing,

42:14

like the old thing, of which the new thing includes the insecure digest, then they can confirm that

42:21

guess. And it's got what you can never get rid of that, because you needed to keep this

42:25

this chain. So basically content addressing has immediate consequences for deletion. And

42:33

there's, there's multiple ways of looking at it. So one of them is to treat this as a feature and say,

42:40

it's censorship resistant, and you never get link rot. Right. So link rot is the concept that

42:45

when you have a hyperlink on the web, so one website points to different websites. Quite often,

42:50

the other website doesn't exist any longer and you just get, sorry, I couldn't find this. And

42:56

with these, these let's call them cipherlinks, because it's kind of a cute name,

43:02

with these cipherlinks that there isn't really a thing, because anyone who still has that old

43:07

website knows its hash and can just feed it back. So you can't really delete stuff, because it

43:14

doesn't even matter whether something has been created or are not. Like, it doesn't matter if I

43:20

posted the poop emoji, if anyone who might not have been me posted the poop emoji,

43:24

but would have the same hash. Right. So then that thing would be retrievable by that hash.

43:30

Like there's no direct connection to me as an entity with intent in this kind of system.

43:38

And I'm not quite sure why I'm going with this. You were going into the direction of like how

43:44

will it visit differently? And you know that that's where you want me to go, because I'm using

43:49

too much time, I wasn't going there at all.

43:53

Well, then I carried you right and I would not know.

43:57

Then I tried reading my own mind for another second.

44:04

Oh, yes. I know where I was going. Linkrod.

44:06

Yeah, you must go.

44:07

So yes. So this kind of content addressing, it can be said to solve linkrod, because it doesn't

44:14

matter whether the server where the data originally lived goes offline, because that never mattered

44:19

in the first place. All that mattered was that somebody figured out that a particular piece of

44:25

data maps to a particular digest. And from that moment on that, that connection has been made,

44:32

and anyone who ever had the data can now answer the request for the digest.

44:37

And it's not dependent on a single server, which can eliminate the main cause of linkrod.

44:41

But the problem here is, of course, you also can't deliberately induce linkrod.

44:47

Like sometimes you just want your website to not be on the web anymore.

44:53

So building the web on this technology, which many projects have enthusiastically tried to do

44:59

in the name of censorship, resistance, and well, anti-fragility actually.

45:05

That has deep flaws, because it robs humans of the agency to retract something.

45:14

And one of the core ideas behind WOODO is that we believe that there is a different way.

45:21

So if we have kind of the classic way the web does it, which is links just point to a location.

45:29

And by removing the data from that location, you can delete stuff, but also

45:34

locations just rot away because you need to pay for them on the web,

45:38

and servers run on electricity, and servers grow old.

45:43

So you kind of get linkrod back into the system, and then there's the opposite approach of

45:49

making links really, really unrotable to the point where you can't get rid of them, even if you

45:54

wanted to. And what WOODO does instead is something different. It allows people to assign meaningful

46:03

names to the data. So I as a user can say, I want this poop emoji to be addressed not as

46:12

the secure head, the secure digest of the poop emoji. And neither I want to say, well,

46:17

you need to retrieve the poop emoji from this particular server. But rather, I just say, well,

46:21

this poop emoji, it's going to be a reachable under the name I hate my boss, as posted by me,

46:27

where me in the sense is an entity participating in the network, which,

46:31

practically speaking, just means some public key of a group of graphically secure

46:37

signature scheme. The standard form of identity in distributed networks,

46:42

like the public and private key. Yeah, pretty much. Instead of a login with the password.

46:49

Right. Yeah, I think going to the details of that would actually take us off too much on the

46:53

attention. So we will. Yeah, let's just assume that we know what a public key and a secret key

47:01

in a signature scheme. I didn't. So this is kind of important. It's not that there's like on the

47:07

whole world, everybody now associates, I hate my boss with a poop emoji, rather that's tied to me.

47:13

But if somebody asks, well, which data did Ayasha assigned to, I hate my boss, then they would be

47:20

getting back a poop emoji. Yeah. And the way this word is really works, it's really just I

47:26

create a record saying the name I hate my boss now maps to a poop emoji, signed Ayasha.

47:32

Yeah. Where instead of signing Ayasha, I actually use my secret key to generate a secure signature

47:37

that anyone who knows my public key can verify as actually belonging to that, like having been

47:42

produced by the person having the secret key for that public key, poor, this is.

47:46

But no one else would be able to reproduce that like signature.

47:50

Exactly. Only I am able to produce the signature. And therefore, by trying to guess if you had

47:55

posted poop emoji, even by trying to guess that you posted poop emoji or thinking that they have

48:01

the answer, they wouldn't be able to replicate the exact same post because they don't have your

48:07

private key. Yeah. So nobody can pretend that I posted a poop emoji, except for me myself.

48:15

And then it's not really pretending. And so the cool thing about this is this does not rot

48:21

on its own. Right. The signature is going to stay valid until the end of time. At least that's

48:26

what cryptographers pretend. And that's great. And we're going to make that a sound too. Right. So

48:32

if as long as the cryptographic primitive stays secure, the thing is just as durable as a content

48:40

addressing thing, you're right. It's just like, if I die, then in 40 years, there's still going to be

48:48

kind of my signature on I hate my boss being mapped to a poopy. How did we end?

48:56

I don't think I've said poop emoji this often my entire life. In total. Good thing it's just a

49:04

private conversation between my good friends and myself. Yeah, it's not a dependable log.

49:10

So suppose I'm deeply unhappy with that. And I want to change that poop emoji to a unicorn emoji.

49:21

Then I can. I just have to issue the new record saying, I am holder of the secret

49:28

fee for the following public key. Short-hand for that. I am Yasha. Now associate the

49:35

friend about a boss with a unicorn emoji with an irony emoji. Let's just pretend there's an

49:41

irony mode just probably an iron ingot and then something. Yeah. So I can just do that. And

49:49

all I really need then is to attach to these two signed records one saying, I have my boss poopy

49:55

emoji and one saying, I have my boss irony emoji. All I need is a way for telling which of them

50:00

is newer than the other. And then kind of everyone who gets the newer one can throw away the old one.

50:08

And this way we still get the ability to override stuff to mutate data and also effectively to

50:16

deletion willows just a special case of overwriting some stuff. And so you get the

50:23

intentional parts where you deliberately want to remove data from the web. But you get rid of the

50:30

passive link rod that is inserted into the web depending on your point of view,

50:34

either by virtue of being dependent on locations or by virtue of being dependent on capital flowing

50:40

into a system. Yeah. And that's one of the big hypotheses that we're exploring that this actually

50:47

works and that it creates value. So this whole concept, like where did it start? Where does it come

50:55

from? We all designed it on our own. No, it is for Asian whatsoever. We're really young.

51:05

Yeah. Willow, and in particular the ideas that I just talked about, is based on a prior project

51:18

called Earth Star. And Earth Star in turn grew out of Scuttlebutt actually in a sense,

51:25

because there was a user on Scuttlebutt called Cinnamon who is no longer with us sadly. But Cinnamon

51:31

at the time saw problems with Scuttlebutt that not many other people were talking about at the time,

51:39

many of which I've talked about by now, like the the paragraphs of being unable to delete things

51:45

and kind of how structurally they anchored in the usage of the abandoned only log. And

51:51

Cinnamon eventually realized that Scuttlebutt cannot or should not be the future and started

51:58

their own protocol, which was called Earth Star. And Earth Star was based on this idea of these

52:06

signed bindings of mapping human legible names deliberately to data rather than just using content

52:13

addressing. And then Sammy joined that effort actually way before me. And the way I then later

52:22

joined, I caught COVID and took some aspirin and then felt very sassy. And for some reason

52:32

decided to write a little document how I would do Earth Star from Scuttlebutt if I could,

52:39

but I would never do that obviously. I gave it the worst possible name so that nobody could

52:46

ever take it seriously, which was soil sun as opposed to Earth Star. So soil sun, which nobody

52:53

should name their protocol soil sun. I had a good, all right? And especially when it's okay.

53:06

No. It's very fertile. Indeed. And Sammy for some reason took it seriously. And Sammy for some

53:15

reason wanted to implement it and acquired funding and gave it a good name. So Willow actually comes

53:24

from the aspirin that I took. Oh, I didn't know this part of the story. That's beautiful. Now you

53:31

know. Oh my god. Yeah. But there is also a certain part that you brought in and I can assume

53:39

what did you call it? Soil. Soil sun. Yeah. Soil sun. The tagline was a minimalistic reimagining

53:47

of Earth Star. Did you bring in something that you're quite well known in our circles for is like

53:54

rich base sub reconciliation? I did not. Well, depending on how you would like to treat causality,

54:04

I did not bring in orange base set reconciliation when with the soil sun draft, the soil sun draft

54:12

just started out as we me removing a bunch of stuff. And then as it turned into Willow,

54:18

later adding back other kinds of stuff, no way to set reconciliation. Yeah. There's there's a good

54:26

story there actually. So back in the day, it's it's 2019, I think. I have been active on scutter

54:38

but at one point, the strange thing happened where like a real the real world professor joined the

54:44

network. Yeah, this person joined the network showed off some work of this and was really cool.

54:51

And you know, you you stalk these people on social media and turns out they chaired the

54:58

computer networking group at the University of Basel that professor is called Christian

55:04

Tuden or CFT was a scutter part and stuff escalated. We ended up writing a paper. So a CFT and Eric

55:14

Lavera, who later also did a post after a position in Basel and me and Dominic Tar himself wrote this

55:25

indeed. We wrote those paper on Sophia's cut about and actually got it published and that was

55:30

quite fun. And you know, at one point, Eric and I got invited to Basel University and we spent

55:38

two days, I think, there just working on this paper and I running out our thoughts. And through

55:44

that, I got to know Christian Julian, who in 2019 then invited me to do a semester in his group as an

55:53

intern, essentially. And during that semester, I set out thinking a lot about the pendant only logs.

56:01

But what actually happened very much inspired by postings of Cinnamon on Scutterbutt about kind

56:07

of the dangers with a pendant only logs. And not only the dangers, but also this inherent limitation

56:14

of one thing has to come after the other, which is not great if you want to use multiple devices

56:19

because you know, you might just post stuff on one device and on the other concurrently before they

56:25

have talked to each other. You might want to pick, you probably want to be able to do that, but then

56:29

the append only log does weird things and is unhappy and your data becomes corrupted and everyone has

56:34

to reject it and to sadness. So Cinnamon and I were both exploring on Scutterbutt, notions of how to

56:43

avoid that. And at the time, I was very fascinated with category theory of all things, which is this

56:50

arcane branch of mathematics. And very central to that branch of mathematics is the notion of

56:57

everything of every concept having a kind of dual concept that you get for free, kind of you can look

57:03

at everything from two different viewpoints. But they're essentially the same, but kind of different.

57:10

And I was wondering what is the dual to the log or rather sequence, because there's certain

57:17

context and computer science where kind of the operation of saying first this, then that exists,

57:23

and then the dual notion for that is either that or the other. The important part being that either

57:29

that or the other, the order doesn't matter. Right, you always choose one of them, it doesn't matter

57:35

whether you placed one first or the other first. You can still choose the same thing and

57:41

irrespective of how they were placed, I'm rambling, as opposed to sequencing stuff,

57:48

we're saying first this, then that there's clearly a notion of one comes before the other.

57:54

So I was wondering a lot about how does the the logliness of secure Scutterbutt, which is all about

58:02

sequencing stuff in order. Well, if the dual to that is not caring about the order and just

58:08

having a collection of stuff, which we call a set in mathematics, how does that relate to each other?

58:14

And then is there a nice way for efficiently reconciling the difference between two sets?

58:21

Because we never set the C explicitly, but the main reason why Scutterbutt uses this highly

58:26

restrictive and kind of yucky append-only log is because it makes for extremely efficient and

58:34

simple data synchronization. I just like, if you and I meet and both of us are interested in the

58:41

data that Dominic has published, then I just say, well, I have the first 200 things Dominic has

58:46

published. And you say, oh, cool, but I already have 212 things. So what you then do is you just

58:52

send the newest 12 things to me and that's it. So all we really needed to exchange in order to

58:57

get communication going were two numbers, right? Who has how much stuff? That's extremely cool and

59:03

simple and doesn't work at all for unordered collections, so for sets. So I ended up spending

59:11

most of my time in Basa trying to figure out how to do that, like how to reconcile sets.

59:18

And so I read a bunch of papers. I found this one paper that had a solution, but which was ugly.

59:27

We talked about it now. I'm driven by things not being pretty enough. And that paper,

59:34

what it did essentially was it forced a certain structure on the things in the set. So rather

59:42

than saying I inserted things in this order in the sets in the set, and now that's the structure

59:48

that wouldn't work, because if you insert things in different order, you still end up with the same

59:52

set. If you insert the same things, right, if I first insert Apple and then banana or if I first

59:59

insert banana and then Apple, I still get the set containing banana and apple.

1:00:03

And so you can't really do it based on the ordering, but what you can do is you can just say, well,

1:00:09

for any possible combination of fruits, I'm just going to define kind of a rigid structure of them,

1:00:18

which could be added simple. It's kind of just sort them alphabetically, for example. For various

1:00:22

reasons, that's not efficient to just sort them in the list, but you need to arrange them in a tree,

1:00:27

which is a particular kind of data structure we don't really need to get into. But what they did is

1:00:31

they built a tree whose shape depended uniquely on the contents. Then they put

1:00:39

Merkle labels on that, which we probably don't need to go into. But if you know what a Merkle tree is,

1:00:45

that's the core idea of theirs. They just do a special unique Merkle tree for every set, and then

1:00:51

you just kind of compare the first part of each tree, the root node, do they have the same Merkle

1:00:58

label? If so, then you know the trees are identical. If not, then you kind of compare the

1:01:02

left and the right child. Rocky speaking, that's the ideal here. The paper being

1:01:07

called, I don't actually know the name. I will put the link somewhere.

1:01:10

And you've written a paper on this? No, that's the thing that I found ugly.

1:01:15

Oh, but you've written a paper on rangeface, so very conservation.

1:01:19

Yes. So which was my take on how to do things instead, where the important part is effectively,

1:01:27

you do a very similar technique of kind of saying I compare kind of the highest point on the tree,

1:01:32

and if they are equal, then I know that everything is equal. And if they are not,

1:01:35

kind of look at the next lower layer in the tree and start comparing points in that layer and so on.

1:01:44

But the core difference in my work was that I didn't mandate a particular shape of tree,

1:01:52

but rather I would say you can use any kind of tree that stores the same sort of data.

1:01:59

And you can see the main work somehow, and I probably shouldn't explain the details

1:02:05

right now that would take a while. But if you are curious about it, then go read the paper.

1:02:11

No, don't read the papers. That paper, that paper was written so that it would make it past

1:02:20

reviewers. I mean, yeah, actually you'd read the paper.

1:02:25

So think about it. What do you, where would you send people?

1:02:30

I shouldn't be stumped by this.

1:02:34

I'm deeply saddened that I am stumped by this.

1:02:39

So before I wrote the paper, I actually just put a little write up on GitHub that does a lot of

1:02:45

hand waving, but probably is a way better entry point than the paper itself.

1:02:50

So I can link to that.

1:02:53

And I can look for some other resources.

1:02:55

And right now we are at time because I'm trying to keep these compact enough that they're

1:03:01

in listenable. And so is there any final note you'd like to pitch in before we close this

1:03:10

beautiful introduction chapter?

1:03:16

Thank you.

1:03:24

So

Willow

Episode description

Persons

Aljoscha Meyer

Zenna Elfen