đŁTHREAD: Itâs surprising to me that so many people were surprised to learn that Signal runs partly on AWS (something we can do because we use encryption to make sure no one but youânot AWS, not Signal, not anyoneâcan access your comms).
Itâs also concerning. 1/
Concerning, bc it indicates that the extent of the concentration of power in the hands of a few hyperscalers is way less widely understood than Iâd assumed. Which bodes poorly for our ability to craft reality-based strategies capable of contesting this concentration & solving the real problem. 2/
The question isnât "why does Signal use AWS?" Itâs to look at the infrastructural requirements of any global, real-time, mass comms platform and ask how it is that we got to a place where thereâs no realistic alternative to AWS and the other hyperscalers. 3/
Running a low-latency platform for instant comms capable of carrying millions of concurrent audio/video calls requires a pre-built, planet-spanning network of compute, storage and edge presence that requires constant maintenance, significant electricity and persistent attention and monitoring. 4/
Instant messaging demands near-zero latency. Voice and video in particular require complex global signaling & regional relays to manage jitter and packet loss. These are things that AWS, Azure, and GCP provide at global scale that, practically speaking, others (in the western context) donât. 5/
This isn't â'renting a server.' It's leasing access to a whole sprawling, capital-intensive, technically-capable system that must be just as available in Cairo as in Capetown, just as functional in Bangkok as Berlin. Particularly given the high stakes use cases of many who rely on Signal. 6/
Such infrastructure costs billions and billions of dollars to provision and maintain, and itâs highly depreciable. In the case of the hyperscalers, the staggering cost is cross-subsidized by other businessesâthemselves also massive platforms with significant lockin. 7/
Meaning that infrastructure like AWS is not something that Signal, or almost anyone else, could afford to just âspin up.â Which is why nearly everyone that manages a real-time serviceâfrom Signal, to X, to Palantir, to Mastodonârely at least in part on services provisioned by these companies. 8/
But even if Signal had the billions needed to recreate AWS, itâs not just about money. The talent to run these systems is rare & concentrated. The expertise, the tooling, the playbooks, the very language of modern SRE came out of these hyperscalers, and is now synonymous with 'the cloud.' 9/
o, yes, Signal runs on AWS. It also runs on your phone, which runs on iOS (Apple) or Android (Google). And on Dekstop, via Windows (Microsoft). Each of these presents similar dependencies on large entrenched tech companies, and concomitant barriers and risks. 10/
In short, the problem here is not that Signal âchoseâ to run on AWS. The problem is the concentration of power in the infrastructure space that means there isnât really another choice: the entire stack, practically speaking, is owned by 3-4 players. 11/
So, Signal does what we can to provide a service w integrity in the concentrated ecosystem we're working in. We protect your comms w end-to-end encryption, so that we can use AWS and others as a highway across which to send Signal data in ways that donât let AWS, or anyone else, gain access. 12/
To conclude: my silver lining hope is that AWS going down can be a learning moment, in which the risks of concentrating the nervous system of our world in the hands of a few players become very clear. And that this can help us craft ways of undoing this concentration and creating real choice â¤ď¸ 13/
@yawnbox I don't think you have a clear understanding of what you're talking about, and it might be fun for you to look a bit more deeply into how TOR works and its dependencies.
@Mer__edith Agree - if you want to run your service centralized. Neither my Mastodon nor my Matrix-server need anything but my own self-hosting. Of course they won't handle billions of concurrent customers - but a few tens of thousands similar to mine will. Together.
I simply don't think Signal being centralized is a good thing. It's your choice, but alternatives do exist and those do not need hyperscalers.
@troed I don't think you have a clear understanding of this space, but I hope you have a good time digging in and learning more.
@Mer__edith interesting engagement levels across different sites.
@Mer__edith great explanation and thank you for that.
Is it possible to build a future promoting other competitors to AWS gathering partners (other executive leadership with different companies) who rely heavily on these services to help level the playing field as well as give applications like Signal multiple options to run their infra? It would be complex, but it already is anyway. I've learned over the years to be as redundant as possible in every aspect of your infrastructure which includes things like cloud providers.
I have ran redundant workloads in AWS, GCP, Azure and tried to tie that in with smaller orgs like Linode, OVH and have had success. I understand that Signal is far more complex and your service needs to be real time whereas in oil and gas(the industry I was in mainly during my career) not everything needed to be as real time as your application.
We need someone to kickstart us into balancing out the playing field to give other, smaller competitors a chance at the same time helping to make these applications more redundant in the event of a disaster like we just had. Costs for such a design would be high but overtime that could balance out as we get more competition flowing.
@Mer__edith Is there any chance of Signal adding its own keyboard in the future? The input system seems like a pretty important privacy concern at the moment.
@Mer__edith It's possible to manage a real-time service without these companies, but it's challenging. I understand that it would hard for Signal to build and maintain its own infrastructure, but the questions is, if Signal team is considering what to do in the future to make impact of future outages (which may occur again) to have lesser impact on people who use it.
@Mer__edith Thanks for your condescending reply. I used to manage global SaaS within fintech with nodes in GCP, AWS and Azure and on multiple different continents.
@Mer__edith Hello Meredith, all fair points of course, and I believe that many understood what happened. Isn't there a way to distribute the load in a fashion that it would not fail when US-EAST-1 at AWS is down ? (either with AWS itself or with other providers to balance the risks)
@ericfreyss @Mer__edith
presumably that's on AWS to fix. the AWS deployment of Signal uses DynamoDB, which in theory is multi-region. however that service has a chokepoint in US-East-1 that is part of the AWS design and not something anyone other than them can fix - you can't choose to have the DNS for it be deployed somewhere else when you deploy to DynamoDB. it's AWS's task to make DynamoDB actually as multi-region as they advertise it being.
@Yuvalne @ericfreyss @Mer__edith
This was the comment I was hoping to find!
The issue seems to be how AWS itself is internally architcted for resilience, or in the case of the last outage contains points of failure.
This isn't surprising but it does present challenges. I doubt that AWS will fail the same way again but next time it will be some different unanticipated failure mode.
Hopefully AWS will now be looking at this and hopefully coming up with a solution đđ¤ˇââď¸
@troed @Mer__edith I might be missing your point but there does need to be A choice that doesn't require self-hosting; because I promise you my parents are not about to learn to run a Signal-like service for themselves, and I don't want to have to do it for them.
Otherwise services like WhatsApp exist, and they just work, so they'll just use that. Which means I have to use it too. The value of a messaging app is (IMO) ENTIRELY derived from who you know on it so it has to be easy to adopt.
There's something inbetween "everyone selfhosts" and "everyone uses a single instance". Since you're posting on Mastodon, I do think you know what it is ;)
@troed yes but I'm posting from a "centralised" Mastodon instance (by which I mean it's outside of my control and there is a network effect beyond just me when this instance experiences issues).
I'm on Mastodon because the people I'm interested in following are here. But I have less than zero interest in self-hosting my own server.
This is only a partial solution IMO, and only works as well as it does because Mastodon is not required to be "real time" like a voice call.
@_calmdowndear So, your parents don't need to self-host yet they can be on a network that doesn't require everyone to be on the _same_ centralized instance.
My other example was Matrix. It's like Signal, but decentralized like Mastodon. It fully supports voice and video calls.
@troed so "Mastodon is down" is a thing that could still happen if the right piece of infrastructure dies. Because that's what happens from the user perspective - yes, the network may be up but if "your" instance is dead and you can't interact with it, then who cares?
I can't comment on Matrix too much as I hardly ever use it (unless required by some software using it for support). But what I have seen of it is a very frustrating user experience.
@_calmdowndear Absolutely - just like your local ISP can go down and those affected say "the Internet is down".
The rest of the world still works fine though.
nearly everyone that manages a real-time serviceâfrom Signal, to X, to Palantir, to Mastodonârely at least in part on services provisioned by these companies
Mastodon doesn't, though?
There certainly will be servers hosted on AWS but when AWS went down, most Mastodon instances stayed up, and people were cracking jokes at more centralized platforms.
@datum Mastodon is distributed at the level of the protocol, not infrastructure. Sure, some people use a server in their closet, but most license hyperscaler infra to host their mastodon instance.
Meta note, we seem to be dealing with a confusion in what the term "distributed" means in this context.
@Mer__edith Thanks, your explanation is helpful. Does it suggest that text-messaging should perhaps be the focus? I'm amazed that video/audio is possible, but it incurs much more massive requirements - it seems that the video/audio service is constraining the possible infra choices for the text-message service.
@danstowell No, because no modern and useful comms platform provides only text messaging. And if your platform doesn't provide normative features and functionality, people won't use it. And if they don't use it--even if you do--you can't use it either. The network effect rules communications effectiveness.
@Mer__edith The reason AWS is affordable is because they are *still* subsidizing your invoices by 80%. When doing price comparisons assume that one day AWS will start charging 5x more once they have succeeded in killing the rest of the competition. All of them will. Im certain if AWS was charging what it costs to run that service that colo or any other option would be just as viable from a price perspective. They absolutely cannot compete on anything other than price. They are really bad at it
@buherator not only does Signal already do that as @Mer__edith mentioned, but routing to GCP is exactly what Signal did during the latest outage, hence why for many users it recovered before AWS did.
It's easy to gloat about AWS but if Hetzner goes down it takes half the Fediverse with it. Same problem, different vendor.
@marta Decentralization at the level of protocol--in Mastodon's case via ActivityPub--does not mean decentralization at the level of infrastructural dependence.
@Mer__edith maybe Signal should reevaluate its standpoint against federation then? This would enable different entities to run different parts of the network in different regions and will remove the need for a globe spanning compute platform...
@Mer__edith @troed This is a really shitty reply, especially when multiple experienced people are expressing the same opinion.
@JadedBlueEyes @troed I'm sorry if it landed harsh.
First, I don't think not knowing things is inherently bad or shameful--it's where learning starts, etc. Second, there's a misunderstanding here, whoever is expressing it: decentralization at the level of a protocol--ActivityPub or w/e else--is NOT the same thing as decentralization of infra. People running mastodon instances, or Matrix servers, or other fedi systems, are also in most cases leasing infrastructure from hyperscalers to do so.
You're claiming something that isn't true (no, most people are not using hyperscalers - and yes - I know many people who run Matrix-instances) - and then respond to people that we don't have a clear understanding of this space.
It didn't "land harsh", btw. It just completely changed my impression of you since you actually seem to believe it.
@Mer__edith @buherator thank you. I'm just a Signal nerd who spent like half of the outage being on the forum and I'm in essence just repeating stuff said there 
đ #39C3
With respect Meredith, iâm talking about decentralized protocols and their capability to not depend so heavily on the service providers youâre arguing for. Tor Project has shown how possible it is (i used to work there, and itâs spelled Tor not TOR).
I listened to Moxieâs aversions to decentralization for years. Thatâs what I keep seeing now, with posts like these. I also understand the value of huge cloud providers, Iâve worked for many companies who use them, and have worked for them, and I understand why you depend on them and how important that is to a high quality service. Thank you for all that you all do.
But what conversations does Signal Foundation actually have on the topics of resiliency through decentralization? How much money could you save by allowing the community to take on aspects of the network? How much resiliency and trust could be gained, without losing performance?
Hetzner isn't a hyperscaler though. Meredith seems to confuse people choosing a VPS instead of hosting at home with using advanced AWS (et.al) services for scaling.
There seems to be a real disconnect in understanding here.
@buherator @Mer__edith i know users in different regions got service at different times. i remember someone mentioning on the forum that Singapore was one of the hardest hits, taking many hours to come back online. and of course there were the users who weren't affected at all, which to my understanding were mostly in North America.
i got service back in Central Europe relatively early on, when AWS was still coming back online.
What's the difference if everybody uses the same few VPS providers? The technical details of a massive outage don't really matter - the problem is that we have very large single points of failure, but the immense technical demands of modern tools mean that there are few alternatives.
@buherator
this is the thread i'm taking my info from.
https://community.signalusers.org/t/signal-service-outage-2025-10-20/72317?u=rassilon1963
If you use AWS (et.al) services it's difficult to move your SaaS to another provider. If you're using a VPS it's trivial.
(And we aren't all using Hetzner, or OVH, etc)
@yawnbox @Mer__edith Tor is basically a glorified network protocol (albeit very smart) so having it distributed by design is less of an issue.
I agree that making Signal more robust through decentralisation would be great, but this sort of thing gets more difficult the higher up the stack you go, especially when it wasn't part of the core design principles.
@davep @Mer__edith like count here also may not be accurate unless you view it from mastodon.world
@yawnbox @Mer__edith I think moxie made a pretty clear point on his view of decentralized vs centralized with his ccc-talk a few years ago.
That aside, Tor may be able to provide in terms of uptime, but latency? If I'd be in signals boots I would not build on a network run by people on all kinds of hosting with no sufficient ability to have some kind of guarantee on the QoS aspect.
@Mer__edith
I remember SETI distributed a program that could, if installed on PC's, help them to run calculations to detect alien intelligence signs from radiotelescope observation files.
I wonder why there wouldn't be an option on signal desktop to allow PC's to be able to carry part of the traffic as a node (like Session messenger does).
This could help alleviate the effects of the next AWS outage, wouldn't it?
@Teratogenese @Mer__edith History has an example of Skype (pre microsoft) "super nodes". It was very problematic.
@Mer__edith What was surprising to me was that a problem in one AWS availability zone caused a persistent outage for my Signal connection that lasted at least 3 hours. I expected there would be some sort of automated failover to another zone or cloud provider happening in the background, limiting the downtime to, say, maximum 15 minutes. I hope your internal post-mortem is taking a look at how to be better prepared next time.
@jwildeboer @Mer__edith Thereâs a thread elsewhere which I have since lost where an SRE pointed out that not all regions are equal: not only is EC1 the default region when setting up a service but it has capabilities that the other regions do not. As a result, even if oneâs AWS deployment is distributed over several regions, depending on the complexity and needs of the app, it may require EC1 for a tiny bit of functionality which, if absent, breaks the app.
@Mer__edith (For me this unexpected outage was quite dramatic, as it happened at exactly the time my son was going into surgery and we communicate a lot via Signal, so my "best wishes!" didn't go through and his "All went well" arrived with a significant delay. Murphy, I know. Things go wrong when you least expect it. He is fine and all is good :)
@toxy And that's what SRE (Site Reliability Engineering) should solve â to have a (temporary, even manual for these very rare cases) failover solution to, if needed, a completely different provider, in case that one provider goes down. That is my hope. That even if this rare case never happens again, there should be a plan available. @Mer__edith
@jwildeboer @Mer__edith I suspect this outage has been a bucket of cold water for many organisations. The other obvious factor is cold-hard cash. Iâm no expert but I suspect an iron-clad cloud-failover redundancy service at scale would be pretty damn expensive to the point of economically unviable for many outfits.
@Mer__edith @systemadminihater
There are two issues with that disaster scenario.
First - AWS has numerous healthy competitors including google's GCP and Microsoft's Azure and Oracle's OCI, and they are gaining rather than losing market share. Their immanent demise is (sadly) unlikely. So robust competition seems pretty assured. And even if those three disappeared, there are other wildly strong competitors (Cloudflare!) waiting in the wings.
Second - self hosting never went away, and at a certain scale it's not just viable but still beats AWS in many ways. Point being - a 5x price increase would mean a sudden mass exodus from AWS. But that's unlikely as well - AWS pricing is both consistent and downward trending, with certain weird exceptions.
The reality is there is no magical superior never-failing alternative to AWS. Shit happens. You do not succeed by never failing - you succeed by reducing the impact of the fails. Hosting on high-resiliency proven platforms like AWS is part of that strategy.
If Signal usage is a single-point-of-failure issue for someone - that's their problem, not Signal's.
@Mer__edith What if, instead of running a global comms platform for millions of people that requires AWS level infrastructure, we run a bunch of small, local ones that all federate and interop with each other? đ
@Mer__edith Maybe we should focus more on cloud interop, to reduce migration costs and enable better market dynamics? Too bad, EU is mostly just wasting money with their sovereignty initiativesâŚ
@promovicz what would you suggest the EU should do differently? They cannot force US companies to do anything unless it affects EU citizens, and Amazon's services aren't targeted at consumers. On the other hand, the DMCA makes reverse engineering a felony
States have no obligation to be fair with monopolies.
* regulate their prices (consumer or otherwise)
* withdraw their advantages (tax, market, regulatory)
* use cartel law
* use state money against monopolies
maybe:
* reintroduce our DMCA waivers
(Germany had a law for it)
â
@davep @yawnbox @Mer__edith Pretty sure that wouldn't even be as big of an issue as long as you don't try to exit the network.
You could even potentially improve the throughput ability by making every client that wants to use the network a node that relays traffic when it doesn't have active calls, however that's not suited to be automatically activated on mobile devices with limited power or even data caps. (But I would imagine people would be willingly donate resources to such a network if a simple separate application was offered the same way as it's done with TOR already)
@toxy My fear is that many organisations will shrug this off as "too rare to prepare for" event :( I hope Signal can help here by sharing a possible solution (should they find one, of course) @Mer__edith
@jwildeboer @Mer__edith Iâm not sure. The big cloud providers have been laying off costly (experienced) staff left, right and centre so I suspect these outages will become more common in short order. Itâs not that the current staff lack the knowledge per se but that lack of muscle memory and hands to the pump will greatly increase downtime in what are in many cases, legacy systems.
@jwildeboer @toxy @Mer__edith You need to run the failover solution all the time, because if it isn't life tested, it will very likely not work.
So you better design for at least a federated system. That's a much better design decision, anyways.
@yawnbox @Mer__edith Not once the number of active users approach even a fraction of number of active Signal users, without vastly expanding number of exit nodes etc. That could happen but it requires large scale benevolent effort and resource expenditure.
@troed @Mer__edith Than you, obviously, should have no problem setting up an alternative with the same features and sizing as signal đ
@Mer__edith @JadedBlueEyes @troed in reality the hyperscalers barely register. See https://fedidb.com/stats and https://blog.benjojo.co.uk/post/who-hosts-the-fediverse-instances for more details. Hosted from people's homes likely outnumbers all of the hyperscalers combined.
@jonathan @Mer__edith @JadedBlueEyes @troed that doesnât change the fundamental point though; you can run a mastodon server on a raspberry pi but you cannot run something like signal without using at least 1 of about 3 or 4 american tech companies
I don't know if we're having the same discussion. No, you cannot run a centralized service like Signal without doing so - but since no one is claiming that either I'm not sure what your point is.
You can however run something like Signal that's decentralized. We know this, since it exists. It's called Matrix, and many people in the Fediverse also run Matrix instances.
@Mer__edith not so surprising, as it's very difficult to do anything at any scale online and avoid AWS entirely.
The surprise shouldn't be about Signal, it should be a rallying cry to build diverse infrastructure.
Mastodon? The comparison is flawed.
You can run Mastodon without AWS, etc. just fine.
You even can run an instance in your basement if you want.
@m_berberich
@Mer__edith
Could it be that by now other messaging services are now **more #privacyâfriendly than Signal**? Thatâs whatâs being claimed here: https://privacyspreadsheet.com/messaging-apps
And, based on Meredithâs comment, would a platform like **SimpleX**âif it ever matched Signalâs user baseâ also have to rely on **AWS**? đ¤¨đ¤
@troed @jonathan @Mer__edith @JadedBlueEyes I feel we all understand the landscape, what exists and why
this just feels like a really unproductive, unhelpful thread from foss decentralised self-hosting absolutionists, trying to say they understand how to run signal better than its president
use matrix, by all means; try getting your family and friends to use it too. good luck. âperfectâ really can be the enemy of good (or in this case, privacy).
So far it seems Meredith does not know how decentralized service like the Fediverse and Matrix work (the claims that most use hyperscalers). No one is however claiming that we know how to run _the centralized service Signal_ better. We're saying maybe don't be centralized.
"try getting your family and friends to use [Matrix] too. good luck."
Thanks, yeah, the whole family does - including my elderly parents. It seems you might not know the subject you're having opinions on?
@daniel @Mer__edith Even _IF_ it were possible to create a black box version of "distributed Signal mesh node in a box" that you could run in your basement to help make Signal more tolerant - I mean with enough $ and willpower Im sure it could be done - there's still the question of: if you don't control physical access to the node, there's still potential for attack regardless of how much encryption and protection. Would you ever be able to trust it completely?
@tezoatlipoca @daniel @Mer__edith with global hyperscalers like AWS you don't exactly control who has physical access to your node either. You simply trust in Amazon holding up to their promises đ¤ˇ
@Mer__edith Does not answer the most basic issue.
Why the hell signal fail if us FAIL.
IT's simply you that messed up.
Also magnificent lie on how "only aws exist", like you are "forced" to use everything from aws and not use multiple one in multiple country.
Took nearly a week for this answer. god resign and let somone else take over.
@Mer__edith @buherator so much that 1 dns issue and the whole network is done, when you totally stable president will say to bezoz kill signal i'm sur you will survive ....
@Mer__edith At least for messaging I found that 30s latency are acceptable to most. People no longer expect their tools to be fast, and they type slowly.
That does not solve video and audio; WebRTC may help.
One step forward could be to change education so that more than a few handful people understand how peer to peer networks of the past managed to connect millions of people with a handful of servers and ISDN bandwidth.
My humble contribution:
https://www.draketo.de/software/p2p-talk.pdf
https://www.draketo.de/software/p2p-talk
@Mer__edith â push functionality out of the cloud bit by bit, as people re-learn what works and what fails.
Experience with that is scarce, though. Interest seems quite big, but polluted by crypto-coin agenda.
Anyway: thank you for your writeup!
@ArneBab @Mer__edith they are american, tech bro, they expect to have their cat picture in 0.000003s, it's so dumb, even call can have an 1 or 2 s lattency.
@lexinova do they actually expect that?
Donât they have at least 200ms of transitions and animations before they even see the message?
Iâve been communicating with IRC over Hyphanet for a while now that has 30s round-trip-time, and itâs strangely barely noticeable.
The main issue may be the notification "server received your message" (â â ) and the info "other side is typing".
@ArneBab @Mer__edith For me her answer can be 2 thing :
1. Corpo bullshit to calm userbase,
2. US fried brain that lost contact with reality on how half of the world don't have a low lattency network and everything work anyway.
@lexinova Hosting infrastructure for multimedia communication of 100 million usersš with just 50 employeesš actually is hard.
š https://backlinko.com/signal-stats
So I do not think your points apply.
@ArneBab @Mer__edith Did you remember that actually, only people who enable "relaying for call" would need this low latency ?
because by default, signal try to connect directly to the other user and relay if it fail.
As such, in most normal case, this low lattency won't apply at all.
and for picture / video even if it take 20s instead of 10 to download on your device what will it change ?
@promovicz @Mer__edith You mean something like https://sovereigncloudstack.org/ ?
The intial phase was also financed by the EU and German government ;)
@ArneBab @davep @yawnbox Note that in the specific use case of Signal: given their threat model, "direct peer-to-peer connections by default" are not desirable. You'll need to bounce the audio&video traffic by default to make it more costly to infere who is talking with whom.
So the fact that working NAT and IPv6 help rely less on TURN servers won't help decentralize that much.
@dryak Maybe a start could be to switch to direct peer-to-peer connection if Signal sees that both sides are in the same subnet (i.e. on the same wifi).
In that case they connect to the signal server for connection with a voice-data profile *at the same time* which already gives away that they are talking, so staying in the subnet with a direct peer-to-peer connection would reduce the total privacy loss.
@dryak but firstoff, to stop discussing with too little information: the reason they use a forwarding server is that a single device canât send video to 40 people via direct connections.
Hereâs their description: https://signal.org/blog/how-to-build-encrypted-group-calls/
That also shows the level of complexity involved already.
@Mer__edith does that mean that it could be made possible to fallback to a system where messages still get through, just not in real time?
i'm more concerned that signal do best practices in terms of redundancy/failover and data encryption for cloud. it's quite possible to do cloud (or multi-cloud) robustly but it takes extra effort/engineering/expense over more minimal uses of a cloud service.
can you give any details of your multi regional, multi-cloud use, active/fast failover, etc.?
i love this goal but you've done a great job of laying out why it's really hard, really expensive, and daunting enough that even the EU is struggling with it.
protocol changes can help but at a certain point, you hit speed of light and i don't think we're quite at the stage of engineering around that yet. :D
thanks for this thread. good to discuss these things, not just lay blame on a provider failure.
@Mer__edith So you know don't forget how mozzila died. like you know they took decision uppon decision against their community and died.
The community is angry that an US DNS can down nearly all the network, and you simply tell us you will do nothing.
If signal continue on this path ... you could go with mozilla in the same building i guess.
@ArneBab @davep @yawnbox there's lots of alternatives that don't need the big players - but not for organisations that need to scale like Signal *and* are pretty small themselves.
Signal is and has always been a tradeoff between ease of use, onboarding and security/reliability.
And especially in an activist context both are important. Both are also important in order to provide secure communication to the masses.
But yes, it's also advisable for communities with any tech and org skills to create backup communication because centralisation always also creates power imbalances and threats.
But currently I'm happy if I can activists off Telegram and Discord đ
@Mer__edith To clarify, are you multi-region within AWS? A bit hard to find info on this.
(A little concerning that so many people are worried about the privacy implications of Signal running on AWS... maybe some missing education on E2EE.)
@varx @Mer__edith clearly not, they messed up their infra and say it's AWS.
The only fact that US downtime lead to a worldwide downtime of signal show their system is not well done at all.
@Mer__edith but its also concerning that a failure somewhere in the US brings down Signal users in Switzerland, Turkey , Iceland, Sierra Leone and Nigeria. Yes hyperscalers are nice as they are everywhere but the dependency becomes brutal and expensive. Im running myself a backbone across multiple continents and redundancy is an absolute must. Signal has failed here. Blaming only AWS is the wrong path. Signal is big enough to do it better. Thats why I am a long term subscriber of it.
@afink @Mer__edith i definitly lost trust in signal.
because in my bood the fact they had a downtime is not the issue.
The fact they blame AWS, are condescending over the one that ask why it happen, and do absolutly noting to fix the issue on the other hand, tell more about signal ... than any bug or PR com.
@forthy42 @jwildeboer @toxy @Mer__edith and nothing prevent them to use the federated one as failover, and allow the user to force the federation if they want.
With federation they would reduce their need, reduce the price of running signal, and having federation as faillback when AWS crash again.
@Mer__edith @danstowell funny how you lie so much by omiting many possibility that would allow both AWS and federation (running at the same time), Seem you are just a fake that lick the ass of big tech while "fighting" against them in public, resign and allow true people to find a solution that work.
@Mer__edith I'm sure millions of folks appreciate your service and the work you all do. but I think that 2025 Oct 20 outage revealed Signal has a de facto SPOF on AWS us-east-1. its an architectural flaw not unique to Signal, obvs, since it bit lots of other systems around the world. I do hope your team can devise a way to change your architecture to remove that SPOF
@synlogic4242 @Mer__edith it also show the hypocrit nature of signal, saying they fight big tech, while licking their ass to maintain their server.
@Mer__edith @buherator wha you use multiple, so much to say google and amazon.
Google was useless to mitigate, and the world failed due to an US server failure, that show your design is failed.
I would fire the guy that allowed that if it happened in my company.
You, you do nothing.
@Yuvalne @buherator @Mer__edith it's an appologic message.
GCP was used, maybe, Signal failed ? definitively.
Result, it was useless the DRP failed fire the guy you made it.
Sorry but a it helped is not good enough
Use a privacy respecting keyboard then. But fair point for sure, especially for newbies
@Rhababerbarbar @geofurb @Mer__edith on many device it do not exist (aka iphone) where apple one is the most private.
@Mer__edith The even worst part imo is that usually when a smaller player comes into play and they have LOTS of brains and rare talent and passion and love for what their doing, will either get lost because people's everyday life is soooo dependent on these evil corps. OR these corps. will buy them out... BUT that means that more effort needs to be put in decentralization, as way of life of sorts. As a stand. Of stop using one thing for everything and instead try to have more things that do ONE thing and do it well. Decentralization = more control from the person and less control from the big players. Centralization is the exact opposite... But it's easy... :/
@nakdim @Mer__edith and you have tech bro like signal, where they know better and ignore the feedback when something like this happen and user are angry they whinne it's not their fault and do nothing to prevent it for the future and will repeat when the next issue will arrive.
@Mer__edith I understand what you're saying but I also wonder if maybe you've been sold a bill of goods. I run a real time communications company (admittedly very much smaller) and one of our core philosophies has always been data sovereignty. Of course as you rightly noted we can't build data centers but there's plenty of space for lease globally where you can scale you're own gear. We've made access to the physical layer a priority, maybe Signal can as well?
@lexinova Not necessarily. I recall something about AWS *itself* having some things still centralized in us-east-1. And Signal might rely on other vendors that aren't multi-region. You get the idea.
I'm honestly fine with some downtime here and there. I think people should be more chill about it in general.
@varx i'm not chill about it because they serve us fallacy and lie, and do nothing to prevent it to happen again, if they had say "we screwed up make so it don't happen again" i would have said nothing, but no here we had the corpo bullshit and a condescending answer.
@lexinova Realistically, they have to make tradeoffs about how they spend their time and energy.
They're not saying it here (and it would be nice if they did!) but they could choose between multi-cloud resilience or doing product development. I think most people would prefer the latter.
@varx here for me, it's not a tradeoff, they just try to ignore it and do nothing about it, saying hyperscaller is a lie, my company use hyperscaller, but unlike the liar of @Mer__edith you can combine them, by having US one in USA, EU one in EU, CN one in china etc.
they put all their eggs in the same basket and now they lie their teeth out to make us thing it's not their fault.
@varx Also @Mer__edith try of deflecting the responsability show she's unfit to be the ceo of @signalapp as #CISO of my company i had to appologize and take the blame for multiple time where partner screwed up.
because when you are the chief you are responsible
AWS failed, she's the signal chief it's HER responsability, so her fault.
if she cannot accept this, she's not a good CEO and must resign.
@jdormansteele2 @varx but for many of us again it's not the fact the issue happen that make us angry.
it's the fact signal do not take it's responsibility and blame their partner, as i said in another post the chief of a product is always responsible, and blaming other won't change this.
Their reaction to this failure disapoint me even more than the issue itself :
1 week to adresse de issue.
2 when adressing it blaming partner
3 be condescending on other and lie on hyperscaler necessity.
Theses 3 point are what piss me off ... not the fact Signal failed after AWS
@simonzerafa @ericfreyss @Mer__edith
i do want to mention something important, which is that Signal doesn't only use AWS, and actually the fact they don't was key to them recovering faster than AWS did by rerouting users. however when the issue on AWS is as fundamental as the DNS for DynamoDB being broken, there's gonna be downtime for some users, there's not really a way to go around that.
looking at your replies to replies here that seem to make sense to me (especially re decentralization), you're telling them they don't know what they are talking about. well I definitely don't.
like with debates re #ATproto and #ActivityPub, I have thoughts but I know that we really need to see the experts debate each other more somehow. I don't think it happens enough. so I'd say the same re #signal and #matrix etc.
@wjmaggos meredith gets a lot of replies that could be answered with a little bit of research or could be answered by anybody, which may explain the short replies; but i can try to answer you
@wjmaggos what meredith is explaining is the scope of the infrastructure that signal absolutely needs in order to provide the service it does at the quality it requires, and that there are extremely few options that satisfy those needs
to say building their own infra would be prohibitively expensive wouldn't begin to describe it
@wjmaggos signals threat model and security guarantees can't be met with decentralization like matrix, fedi, or atproto, either. when disparate servers communicate, they have to know how to relay messages between each other, which leads to a lot of metadata leakage (as is the case with matrix)
tor likewise has mitigations for time based correlation attacks, which are great for its use case; but would cripple signals quality
@wjmaggos i agree that it sucks to rely on amazon (or google or microsoft), but its the reality of internet infrastructure right now
it's relatively true that there are some attacks one could leverage in this scenario (particularly "harvest now decrypt later" attacks), but signal was very quick to adopt post-quantum cryptography which should provide a pretty good confidence that those attacks aren't feasible for the foreseeable future
@xyhhx @wjmaggos @Mer__edith ewwwww signal shill ewww hey fedi cancel this user
@soop @wjmaggos @Mer__edith @xyhhx just say you don't have friends and keep it pushing dog
@repeattofade @troed @Mer__edith @JadedBlueEyes those are great arguments, but the one Meredith is taking on this thread is "look, Mastodon can't/doesn't do distributed infra" when the data says that actually yes it does.
I actually don't have a strong view on the centralisation of Signal, but seeing Meredith talk down to people (see replies on this and other threads) and then be wrong on the facts is pretty galling.
@mrRobot
@Mer__edith
Not every service has to implement every bestâpracticeâusers have different threat models. For me, **Perfect Forward Secrecy (PFS)** is essential, and itâs missing from both Threema and iMessage. Likewise, **clientâside scanning** in iMessage is a hardâno. Without PFS and with onâdevice content analysis, the communication isnât private for my needs.
In the past, Signal was the messenger with bestâpractice safeguards
@jack @Mer__edith Iâve actually been using #SimpleX recently myself in addition to #signal for conversations where I want the strongest privacy posture. Itâs doing a lot right: strong FS properties, reduced metadata exposure, and no central identity system. So Iâm not arguing from inside Appleâs walled garden here.
Though I still get uneasy when #privacy gets reduced to a binary grid of checkboxes. Private enough always depends on who the adversary is, what they want, and what the people youâre talking to will actually use. Cryptography isnât the only variable in the real world.
On #Apple/iMessage: the original client-side photo-hash plan was a real problem, though that rollout was scrapped. The current âcommunication safetyâ feature for minors doesnât break adult E2EE. It might evolve into something dangerous later, but at the moment calling it disqualifying feels more like a principled fear of direction than a present-day compromise.
#Threemaâs history with limited PFS and audit transparency is worth criticizing. Though if theyâve now implemented PFS across the board, Iâd prefer to see more independent scrutiny before writing them off as perpetually ânot private enough.â Once again, context matters.
SimpleX gives stronger guarantees for high-value targets, sure. Yet adoption friction introduces its own kind of risk. If I push everyone I know into tools they wonât stick with, theyâll drift right back to far worse defaults. Usability failures can erase all the theoretical privacy wins.
So my stance looks something like this:
⢠Use the strongest tool that your social graph will realistically sustain
⢠Treat all vendors as temporary allies, not saviors
⢠Demand transparency and scrutiny from whoever holds the keys
⢠Always ask âprivate enough for whom?â
Security is a negotiation with reality, not a purity contest. You and I may have similar goals and even use the same tool for sensitive chats. We just assign slightly different weight to the tradeoffs.
Happy to keep comparing notes as these apps evolve. Yesterdayâs âmaximumâ becomes tomorrowâs minimum pretty quickly.
@mrRobot
@Mer__edith
> Demand transparency and scrutiny from whoever holds the keys
Is it really possible to achieve full infrastructure transparency for a Contact Discovery Service? Can we actually verify that it runs inside a protected AWS VM (with eg ORAM) and not just an untrusted execution environment?
Considering that social graphs are a **highâvalue target for intelligence agencies**, such verification would be crucial. Or am I wrong?
@troed @Mer__edith hopefully the pros and cons in this comparison could provide useful context:
https://hapyyr.com/@bogo/115401249466782443
But ultimately this is not a conversation about features. It is about priorities. And as already said on this thread, decentralization is not compatible with low latency.
I've indeed seen several people claim that low latency requires centralization, but I don't think I've seen anyone show why that would be true.
My (way back when) background is in telecom and that has always been low latency and decentralized.
I've been using #Signal from the very beginning (TextSecure times), and I've been advocating Signal a lot.
But the centralized architecture, instead of a federated decentralized approach is something I never liked. Also the focus on BigTech platforms (IOS, Android) is something I do not like. I'm using a #Librem5 #Linux phone, but there is no official primary client for Linux.
Still, I'm donating, but I would appreciate addressing centralism and #BigTech dependency.
I donât believe federation is possible without leaking a huge amount of metadata, but I believe decentralisation is. It would be great to see Signal actively investigating the options in the design space that enable it.
The client side is a bigger problem. The choice of license has two major impacts:
It is very rare for a protocol to become ubiquitous without a permissively licensed reference implementation.
@troed would it sound better to you if we substitute "requires" with "is easier with"? At least from my point of view this shifts the burden of proof to you.
@mapto It's not like I'm talking about some unknown magic here.
https://social.woefdram.nl/item/a6a9ef48-97a5-40d0-a953-4dafcd367253
@yawnbox @Mer__edith How about a North American Signal, and a European Signal?
Those two could federate and if a government on either continent goes bonkers, it can at least not take down all of Signal.
Fair point.
I would say creating a privacy respecting keyboard app would be better then?
Thats like using a #VPN only in the browser, or adding #FreeSoftware apps on a system full of #spyware... doesnt make much sense and you leak more data than you think.
@Rhababerbarbar i'm using graphene so my keyboard is safe, but i agree moving to another os is the priority in those case.