码农知识堂 - 1000bd
  •   Python
  •   PHP
  •   JS/TS
  •   JAVA
  •   C/C++
  •   C#
  •   GO
  •   Kotlin
  •   Swift
  • Streaming Systems


    1 Streaming

    文章目录

    • 1 Streaming
      • What is streaming
        • On the Greatly Exaggerated Limitations Of Streaming
          • Lambda Architecture
        • Event Time VS Processing Time
          • Event Time
          • Processing Time
      • Data Processing Patterns
        • Bounded Data
        • Unbounded Data Batch
        • Unbounded Data Streaming

    picture

    What is streaming

    Streaming System
    two impport dimensions that define the shape of a given dataset

    cardinality and constituion

    the cardinality of a dataset dicates its size
    the most salient aspect of cardinality being where a given dataset is finite or infinite
    coarse cardinality in a dataset
    Bounded data A type of dataset that is finite in size
    Unbounded data
    Cardinality imposes additional burdens on consumer
    Constitution dictates its physical manifestation
    Table
    Stream
    it’s the constitution pipeline developers directly interact with in most data processing systems today(both batch and steaming)

    constitution |how something is made up of different parts

    On the Greatly Exaggerated Limitations Of Streaming

    Lambda Architecture

    the basic idea is that you run a streaming system alongside a batch system,both performing essentially the same calculation.
    Unfortunnately,maintaining a Lambda system is a hassle:you neede to build,provision,and maintain two independent versions of your pipline and then alse somehow merge the results from the two piplines at the end.

    hassle
    a situation that is annoying because it involves doing sth difficult or complicated that needs a lot of effort 困难;麻烦

    As someone who spent years working on a strongly consistent streaming engine, I also found the entire principle of the Lambda Architecture a bit unsavory

    unsavory > adj unpleasant, or morally offensive

    corollary > noun something that results from something else

    antiquity > the distant past (= a long time ago), especially before the sixth century

    Event Time VS Processing Time

    cogently > in a way that is clearly expressed and is likely to persuade people

    To speak cogently about unbounded data processing requires a clear understanding of the domains of time involed

    Event Time

    This is the time at which events acutally occurred

    Processing Time

    This is the time at which evnets are observed in the system.

    skew > verb to cause something to be not straight or exact; to twist or distort ||
    adj not straight

    In an ideal world,event time and processing time would always be equal,with events being processed immediately as they occur.Reality is not so kind,however,and the skew between event time and processing time is not only nonzero,but often a highly variable function of the characteristics of the underlying input sources,execution engine,and hardware.

    • Shared resouce limitations like network congestion,network partitions,
    • Software causes such as distributed system logic contention,
    • Features of the data themselves,like key distribution,variance in throughput,or variance in disorder

    congestion | a situation in which a place is too blocked or crowded,causing difficulties

    plot verb to mark or draw something on a piece of paper or a map
    noun the story of a book, film, play, etc.

    contention | the disagreement that results from opposing arguments
    underlying > real but not immediately obvious
    在这里插入图片描述

    Data Processing Patterns

    Bounded Data

    Unbounded Data Batch

    Unbounded Data Streaming

  • 相关阅读:
    如何处理前端文件上传?
    UE4 设计模式:单例模式(Singleton Pattern)
    【毕业设计】STM32单片机的智能手环 - 蓝牙手环 物联网
    nuxtjs生命周期、项目创建、声明式导航与编程式导航、动态路由、嵌套路由、配置式路由、定制默认应用模板、扩展默认布局
    Java,异常处理的两种方式与自定义异常类
    面向OLAP的列式存储DBMS-8-[ClickHouse]的常用聚合函数
    python深拷贝和浅拷贝
    《基于Tensorflow的知识图谱实战》 --- 有趣的词嵌入--word embedding(二)| fastText和预训练词向量
    力扣 886. 可能的二分法
    use “git branch --unset-upstream“ to fixup
  • 原文地址:https://blog.csdn.net/chixushuchu/article/details/127674714
  • 最新文章
  • 攻防演习之三天拿下官网站群
    数据安全治理学习——前期安全规划和安全管理体系建设
    企业安全 | 企业内一次钓鱼演练准备过程
    内网渗透测试 | Kerberos协议及其部分攻击手法
    0day的产生 | 不懂代码的"代码审计"
    安装scrcpy-client模块av模块异常,环境问题解决方案
    leetcode hot100【LeetCode 279. 完全平方数】java实现
    OpenWrt下安装Mosquitto
    AnatoMask论文汇总
    【AI日记】24.11.01 LangChain、openai api和github copilot
  • 热门文章
  • 十款代码表白小特效 一个比一个浪漫 赶紧收藏起来吧!!!
    奉劝各位学弟学妹们,该打造你的技术影响力了!
    五年了,我在 CSDN 的两个一百万。
    Java俄罗斯方块,老程序员花了一个周末,连接中学年代!
    面试官都震惊,你这网络基础可以啊!
    你真的会用百度吗?我不信 — 那些不为人知的搜索引擎语法
    心情不好的时候,用 Python 画棵樱花树送给自己吧
    通宵一晚做出来的一款类似CS的第一人称射击游戏Demo!原来做游戏也不是很难,连憨憨学妹都学会了!
    13 万字 C 语言从入门到精通保姆级教程2021 年版
    10行代码集2000张美女图,Python爬虫120例,再上征途
Copyright © 2022 侵权请联系2656653265@qq.com    京ICP备2022015340号-1
正则表达式工具 cron表达式工具 密码生成工具

京公网安备 11010502049817号