scraping atproto for the most followed accounts
1## Am I Famous Yet?
2
3Some simple atproto scripts to check if you're famous yet, or better yet your PDS is famous yet.
4
5### Running the script
6
7There are two main scripts here:
8
9#### The fetcher
10
11The fetcher is the most important part here. It takes about 10 minutes to get
12all the data from the none bsky pdses and then processes it into a 2 massive
13json file.
14
15To run the fetcher, run the following commands:
16
171. First we need to fetch a new instance of the pdses list. This isn't needed
18 if you already have all the pdses you want to check.
19
20```bash
21curl "https://raw.githubusercontent.com/mary-ext/atproto-scraping/refs/heads/trunk/state.json" -o data/data.json
22```
23
242. Then we can run the fetcher:
25
26```bash
27pnpm run gen
28```
29
30This will generate the `data/accounts.json` and `data/pdses.json` files.
31
32The `data/accounts.json` file contains all the accounts, their follower count,
33and their pds and any other information collected related to the account.
34
35The `data/pdses.json` file contains all the pdses, their accumulated follower
36count, the number of accounts on the pds, and the ratio of followers to
37accounts.
38
39#### The site
40
41The site is a simple static stite that displays all the data from the fetcher.
42
43To run the site, run the following command:
44
45```bash
46pnpm run site
47```
48
49This is really a build command, it will generate the site in the `dist` folder.
50
51There is already an instance of the site running at <https://isabelroses.github.io/amifamousyet>