Building Microservices Oreilly - Summary and Notes - Chapters 1 - 4
Here's a copy of my loose notes from this great introduction to Microservices. It's a really novel way of looking at an old problem. It's also increasily relevant as I am becoming responsible for larger and larger systems. The world has come a loooooong way since I started getting into development. It's interesting (and a little scary) to see so many of the patterns we've grown accusomted to over the years, finally start to change again.
Preface:
Micro Services - Distributed systems with their own lifecycles which collaborate together, modeled around business domains
Chapter 1:
Domain-Driven Design: representing the real world in code
Continuous Delivery: easily get software into production, every check in
Hexagonal Architectures: less layered architectures to hide business logic
Virtualization: provision and resize machines at will
Agile: Small teams owning lifecycle
Netflix: how to build anti-fragile systems at scale (Microservices)
MS do one thing well, like Linux commands
We want modular code, but that breaks down in big systems
- Gather together those things that change for the same reason, and separate those things that change for different reasons
As you build smaller systems interdependence increase, but so does complexity from moving parts
You should be able to change independently of each other and be deployed by themselves without requiring consumers to change
The golden rule: can you make a change to a services and deploy it by itself without changing anything else?
Each MS and its team can pick their own tech stack
Ex: social network, stores user interactions in graph DB, posts in document store
Should be able to rewrite Microservice in two weeks, this mitigates risk of embracing new tech
Resilience Engineering: the bulkhead, failure shouldn't cascade, isolate the problem and the rest of the system can carry on working
Monolithic: Scale everything together, big and risky to change, one code change and redeploy everything
Microservices: Scale just the parts we need, deploy that service independent, align arch to organization, design for everything, not just web, easy to replace, can be complicated, it's not a golden hammer
Ms is NOT SOA
Ms isA form of SOA
Shared libs, this looses heterogeneity, same language, same platform, but they do have their place
OSGI is a thing, we may see it in java 9, more complex than benefits, lends to coupling
Chapter 2
This is the authors opinion of the architect role
Arch and tech lead are same role, vision for a program of work, coordinate teams
documents and diagrams to crate a perfect system in unknown future
let's focus on creating framework so the right systems can emerge, continue to grow as we learn more
arch is like a town planner for the environment, we optimize the layout of the city
we help influence direction, don't build a sewer here, do that here, (we are zoning)
teams can pick a diff tech stack, but remember they need to move between teams too
a system doesn't accommodate users, it also accommodates devs and ops
multiple messaging systems are a nightmare to consume, stay focused
be worried about what happens between the boxes and be liberal in what happens inside
join a team for a short period and pair program
rules are for the obedience of fools and the guidance of wise men
find out where the org is heading and align technology to it
principals (goals)
ex:
- Decrease time to market: delivery teams have full control over sdlc
- Grow in other countries: principle entire system to be portable and allow for local deployment
practices: coding guidelines, log data centrally, rest, only centos, dev teams own lifecycle, everything in aws,
* change more frequently
these guide how the system evolves
use sample code people can run the embody these p and p
Netflix: a cohesive system made of many small parts with autonomous lifecycles, but all coming together (don't loose sight of the bigger picture)
monitoring: system wide, not service wide, all services emit health and monitoring
metrics: Graphite
health: Nagios
(integrate means connect to each other, interface means what the do this on)
Interfaces: HTTP/REST
* pick a small number of defined techs to help integrate
be wary of failure of downstream calls
governance: setting controls in place for sticking to the vision
* make it easy to do the right thing
* use (live, domain specific) code examples (exemplars) to show best practices
programs to help generate this stuff, Dropwizard and Karyon
mandate circuit breaks like Hystrix
mandated frameworks can make for unhappy developers
deviating from tech vision might have short term gain, but long term cost
the arch looks at bigger pictures and balances
arch job ensures tech vision, governance ensures we are building matches this vision
don't require working practices that make devs miserable, keep up to date with tech, knows when to make the right tradeoffs
chair (and champion) the group that makes these decisions
be cool, let teams make mistakes and learn from them
ms has more opportunity to own something
Chapter 3 - Modeling Services
Let’s think about boundaries of ms through MusicCorp example
two keys: loose coupling and high cohesion
loose coupling
* doesn't bind tightly to other services so changes don't cause other services to change too
* limit number of calls between services, chattiness is tight coupling
high cohesion
* related behaviors sit together
* not having to make changes in lots of different places (slow and risky)
bounded context: domain has multi bctx with things in them that don't need to be communicated outside. they share some things with other ctxs, they all have interfaces that decided what it will share
* they have specific responsibility enforced by explicit boundaries
* cells are similar, they define what comes in, what stays in, and what goes out
ex: finance and warehouse dept, they have internal process and external data, finance doesn't care how warehouse works, but needs to know some things about inventory, warehouse can have an internal representation and a diff external representation we expose.
* same name could have diff meaning in contexts, a return; customer preints shipping label, sends package, waits for refund. the warehouse creates restock request, waits on package, restocks item
think about what should be shared and how it's diff from your internal rep, identify a boundaries in domain where like minded capabilities should live
bounded contexts are like compositional boundaries: we can keep modules hidden inside boundary to reduce coupling. lets find bounded contexts and create them as modUles with hidden models
* MSs should clearly align to bounded contexts
* service boundaries align to bounded contexts
* MS represent bounded contexts
don't split them to quickly, can lead to issues, example of doing this and merging back to monolithic to better understand, then splitting back a year later. its actually easier to split existing than a new one because of experience
don't think about data shared between ms think about capabilities contexts provide to the rest of the domain
* this can lead to designing CRUD based services
* "what does this context do", then "what data does it need to do that"
BCs can contain other BCs, you can even take a warehouse BC and split it out into parts others can use after a while
* warehouse split to.... order fulfillment, goods receiving, inventory
if a team ends up maintain multi MSs, consider nesting them, are they closly tied?
chunk of arch to simp testing, don't want to stub multi services in the warehouse context
think of data sent around MSs like hand filled forms
case study: incorrectly modeled system grows, front half stateless, back half rpc directly connected to DB, passing around db objects makes this brittle, change db, have to change everything (onion arch), but not if you're just passing what you need through the web services
Chapter 4 Integration (how these things talk to each other)
Getting integration right is most important for MS
if ms adds new fields when sending out, existing consumers shouldn't be impacted
avoid integration tech that dictates what tech stacks we use to implement our MS (SOAP forces Java)
a DB is basically a big shared API that when changed causes waves down your application, lets hide the db and focus on the concepts that pass around the app
split it up right and you can store some of your data in a type of db and other data in a document store or something, do this by hiding db implementation from consumers (loose coupling)
don't spread logic amongst consumers (bad for cohesion), only customer services should change customers
synchronous: waits for operation to complete
* request response: init request wait for response, but can be async too with callbacks (let me know when done)
asynchronous: doesn't wait, carries on
* event-based: says "this thing happened", other parties know what to do with that (highly decoupled)
orchestration: complex locic to deal with processes that stretch across the boundaries of services (create customer, send email notification, inform billing, all handled by one big brain), inherently request/response
choreography: say customer was created, decoupled, subscribe to this event, business process don't map as well to this, additional work to track, needs good monitoring
comm methods...
rpc: remote procedure call, soap, protocol buffers, thrift. separate interface def, tight coupling same underling tech, gen client and server stubs, rmi fast start but coupled, removes complexity, if lots of in process calls marshalling and unmarshalling can be significant bottleneck, the network is not reliable, brittle, if you change the interface, everything needs to be updated (lock step), rpc still better than database integration
protobuffs and thrift mitigate by working better with missing fields
rest: has "resources", internal rep could be diff than ext rep, has "verbs", get post put and these can all point to one endpoint, put post get from one endpoint, built on http with chacing and load balancers, security and auth certs. soap over http but ignores verbs
HATEOAS: hypermedia as engine of application sate, links, web page, user follows links, clients should perform interactions with the server via these links to other resources
* at amazon.com shopping cart button has moved over time, humans smart enough to see it though
* the gist: a response with product data, but also a "link" embedded in an xml response to the purchase endpoint
* this works well with xml, but not json
* big decouple between client and server
* it's a little chatty
* some rest implementations are starting to look like rpc, not good, watch out
* cant easily generate stubs for rest
* jersey has proper rest support (put and delete)
* rest over http good choice for service to service
websockets: stream data for browser, low latency, small, not well supported, can run on the fast UDP protocol (less error checking)
--- event based async communication
rabbitmq: publish event to broker allow consumer to be informed, good for loosely coupled, event-driver arch, careful not to push to much logic to the middleware (the enterprise service bus)
atom: rest compliant, publish event to feed, consumers poll, careful the message could get processed twice (link: synchronized messaging), we have to manage shared state
with event based, be careful, lots of things can go wrong when things go down
ex: bank pricing system, events picked off queue would bring down worker, failed backed to the queue and eventually brought down all the workers, implemented a message hospital (dead letter queue) to be manually addressed
* use good monitoring, and correlation ids to trace messages through the system
reactive ext (rx): observe outcome of operations, solves multi calls to single operation, (like how zookeeper notifies all listeners something happened, observer pattern?)
---
shared libraries dangerous in ms arch, avoid, creates tight coupling, changes to shared code causes ripple effect
ex: common domain object changes, all services now need to be updated, and messages in queus are invalid (things like logging libs should be find though), logging libs ok, you can hand copy in libs to avoid coupling, keep as much logic out of client libs as possible
Netflix libs do service discovery, failure modes, logging, and things unrelated to the service they perform, that's ok to share, but even still can be problematic, we need to be able to release independent of each other
if a service retrieves a customer record, then passes info of it to another service, that means it could be out of date in the future, make sure to pass the id for sanity checks
try to avoid versioning, rest helps with this because internal changes don't break external output (if done right), your parsing should be tolerant of changes in rest payloads, accept alot, send a little
semantic versioning: MAJOR, backward incompatible, MINOR, backwards compatible, PATCH, bug fix no breaks
this works well with dist systems
coexist interfaces for backwards compact, give time to move over, then deprecate (expand api, then contract)
/v1/customer
/v2/customer
but have to fix bugs in two places
think holistically, desktop, mobile device, wearable, brick and mortar store, your services will feed them all
weigh the benefits, binary proto fast, but bad for web browsers, maybe your service could take params and return only what is asked
or you can put services over services to allow web to interact easier, a backend for frontends, this is not an api gateway which connects to all services for ease of access but can end up holding too much behavior
------
buying services: COTS commercial off the shelf software, SaaS, "buy if your use of the tool isn't special to your business", this come with lack of control though, the vendor controls your destiny, expensive to customize, migrate off SaaS with a service front end that can eventually be switched out when ready or in steps, intercept calls and use "strangler application pattern"
avoid database integration, consider rest/rpc tradeoffs, but rest and request/response integration a good starting point, choreography over orchestration, avoid a system the requires versioning with tolerant response reading, use compositional layers