diff options
Diffstat (limited to 'docs/operations.md')
-rw-r--r-- | docs/operations.md | 16 |
1 files changed, 10 insertions, 6 deletions
diff --git a/docs/operations.md b/docs/operations.md index 1e437f6..a7785f1 100644 --- a/docs/operations.md +++ b/docs/operations.md @@ -5,7 +5,7 @@ about the local systems and a timeline for our operations leading up to the results for [Tranco top-1m][] and [SANs in CT logs][] during April, 2023. [Tranco top-1m]: https://tranco-list.eu/ -[SANs in CT logs]: https://git.cs.kau.se/rasmoste/ct-sans/-/blob/main/docs/operations.md +[SANs in CT logs]: https://git.rgdd.se/ct-sans/about/docs/operations.md ## Summary @@ -14,13 +14,17 @@ unique two-label `.onion` domains were found from 285 Onion-Location sites. The time to conduct the full measurement for [SANs in CT logs][] was ~10 days. 3330 unique two-label `.onion` domains were configured from 26937 unique sites. -13956 of those unique sites have the same Onion-Location configuration as -Twitter, which likely means that they copied some of their HTML attributes. +13956 of those "unique sites" had the same Onion-Location configuration as +Twitter. At first this was surprising, but it was eventually explained by +onion-grab following redirects without attributing the configured Onion-Location +with the redirected destination, see [20] towards the bottom of the timeline. The collected data sets are available here: - - https://dart.cse.kau.se/onion-grab/2023-04-03-tranco.zip - - https://dart.cse.kau.se/onion-grab/2023-04-03-ct-sans.zip + - <https://dart.cse.kau.se/ol-measurements-and-fp/onion-grab/2023-04-03-tranco.zip>, + `sha256sum` 1f4a0b4009486bce83262f8e3a58ec50757c3f49305cfa427dadbb10dc4b8c1b + - <https://dart.cse.kau.se/ol-measurements-and-fp/onion-grab/2023-04-03-ct-sans.zip>, + `sha256sum` 8d476da6077c7bff2c0afbe444344c9549ad0d1b64cacfd525a7c65dec68529c For further information about system configurations and operations, read on. @@ -270,7 +274,7 @@ a good balance between found Onion-Locations, errors, and timeliness of results. The [ct-sans dataset][] that we will `onion-grab` in the full measurement was collected and assembled at 2023-04-03. It contains 0.91B unique SANs. -[ct-sans dataset]: https://git.cs.kau.se/rasmoste/ct-sans/-/blob/main/docs/operations.md +[ct-sans dataset]: https://git.rgdd.se/ct-sans/about/docs/operations.md To avoid biases like encountering the same errors at all VMs due to the order in which the sites were visited, the dataset is shuffled separately before use. |