A few months ago, I did a hike to a lake called “Lac Vert” (Green Lake) in France. It’s a mountain lake located close to the Italian border. I found it remarkable how vividly green the lake was. Although the name describes its appearance well, I was still surprised. This made me wonder: is it common for lakes to have appropriate names, reflecting their properties?

Lac Vert

On the other hand, sometimes geographical names are misleading. I used to live in a city near Moscow called Dolgoprudny (meaning “Long Pond”). The city’s name origin is not apparent upon visiting it and I think it’s mildly misleading for a couple of reasons. First of all, the pond is located outside of the city bounds. Secondly, the dimensions of the “Long Pond” are 1100m x 150m, which is long, but not remarkably long. In fact, even if the pond was 100 times longer, it would still be unremarkable: there are tons of long bodies of water in the world, called rivers. The fact that it’s a pond would’ve merely been a fun fact, like the fact that the Caspian Sea is actually a lake.

Anyways, I decided to find out how accurate lake names are regarding color. I determined the 10 most common lake names in different countries and analyzed their average colors.

Collecting data

If data grew on trees, the world would be a forest

The required data isn’t just lying out there in a thoughtfully prepared format. It is available, but you have to reach for it. I enjoy harvesting my data myself. I like my data freshly picked, unpackaged, unprocessed, in just the right quantity. Thankfully, as of this article, data is as ripe as ever, perfect for a nested JSON.

The recipe I am following today is simple:

  1. Collect all lakes in a country, store their names and coordinates.
  2. Determine and keep only the 10 most common lake names.
  3. Determine the color of each lake by fetching satellite image data for each lake center.
  4. Calculate the average color for each name.

For the first step, I used Overpass Turbo to collect all lakes in a country. The example query for France retrieves all nodes, ways and relations labeled as water bodies with “lac” (lake) in their name. I do not trust the “water: lake” tag on OSM, so instead filter by name, since niche labels are more likely to be missing. The output we need are the tags, center coordinates and let’s grab the count too.

[out:json];
{{geocodeArea:France}}->.searchArea;

(
  node["natural"="water"]["name"~"lac", i](area.searchArea);
  way["natural"="water"]["name"~"lac", i](area.searchArea);
  relation["natural"="water"]["name"~"lac", i](area.searchArea);
);
out tags center qt;

out count;

As a result, a total of 3067 French lakes are exported to a JSON, just the way I like it.

Lakes of France

For the next steps, I pull out my favourite and conviently appropriate tool, which is part of my everyday carry: TypeScript (I stopped using JS after a violent incident with a NaN). Determining the 10 most common lake names is straightforward, but what about the color? I decided to tackle it with the MapBox API, since it has a reasonable free tier. I fetched the satellite image tile for the zoomed-in region of each lake center and read a single pixel at the center of the tile, which should give us an idea of the lake’s color. I’m not going to describe the code here, since all software developers become tired of seeing array manipulations even before starting their career while grinding leetcode, and they become tired of figuring out APIs and doing clumsy data transformations after. And everyone else just doesn’t care. Personally, I’m here for the lakes.

Analysis

At night all lakes are black

I determined the colors of all the 10 most common lake names, not only color-related ones. Thus, the expected column is accurate only for color-named lakes, being based on a naive interpretation of the name. For the rest, it provides a reference symbol to ease understanding.

France

Name Count Expected Average Color
Noir (Black) 41
Bleu (Blue) 30
Blanc (White) 23
Vert (Green) 21
Grand (Large) 17 LAC
Petit (Small) 17 lac
Rond (Round) 12 O
Grande Tempête (Great Storm) 11 ~~~
Long (Long) 9 ———
Giaset 9 ?

Four of the 10 most common lake names are related to a color. Black lakes are a little bit darker than other lakes. Blue, white and green lakes are not distinguishable from each other at all. All the lakes have a light-blue-greyish tint.

Italy

Name Count Expected Average Color
Nero (Black) 42
Verde (Green) 27
Grande (Large) 20 LAGO
Lungo (Long) 13 ———
Azzurro (Light Blue) 12
Scuro (Dark) 10
Bianco (White) 9
Dei Cigni (Of Swans) 9 _^ ^_
Gelato (Frozen) 9 ***
Piccolo (Small) 8 lago

In Italy, we see five color-related names. Overall the lakes have a darker, greener and more contrasting colors. Black lakes are again the most common name by far. But this time, they are not darker than other lakes. The light blue and white lakes are noticeably lighter. A bit more common sense in Italy.

Russia

Name Count Expected Average Color
Чёрное (Black) 204
Круглое (Round) 187 O
Долгое (Long) 184 ———
Кривое (Curved) 162 ~
Белое (White) 138
Глубокое (Deep) 112 |…|
Щучье (Pike) 104 ><>
Глухое (Silent) 90
Светлое (Light) 88
Большое (Large) 88 ОЗЕРО

Again, the most common lake name is black. Average colors are remarkably consistent, all being dark green, noticeably darker than in France and Italy. This time, shape descriptions are more common, perhaps since every lake is the same color? Also, there are more than a hundred lakes swarming with fish: Pike! Food for thought.

Belarus

Name Count Expected Average Color
Белае (White) 48
Чорнае (Black) 43
Святое (Holy) 34
Доўгае (Long) 34 ———
Глыбокае (Deep) 18 |…|
Круглае (Round) 17 O
Глухое (Silent) 14
Вялікае (Large) 12 ВОЗЕРА
Крывое (Curved) 12 ~
Бяздоннае (Bottomless) 8

Unexpected: among the 10 most common lake names, only two refer to color, white and black. For the first time, white lakes are more common than black ones, but they are not even lighter. We can speculate that Belarusians have a tendency to call things “white”, including their country, which means “White Rus”. Although, in reality it’s the opposite: Belarusian lakes are actually on average even darker than in Russia. The third most common name is “Holy”, which came to my mind when I thought about the most common lake name in my country. Eight lakes are apparently bottomless. I’m not willing to verify this fact.

Conclusion

Let’s gather around the Round Lake and talk

Lake names are not very accurate regarding color.

Okay, I know you want to know it. Here is the color of the mentioned Lac Vert, extracted using my genius method of using the color of a single pixel from the satellite image.

Maybe single pixels are not very accurate too.

Pointless, really? Are you disappointed?