澶辨晥閾炬帴澶勭悊 |
Data Algorithms with Spark PDF 涓嬭澆
鐩稿叧鎴浘錛?/strong>
![]() 涓昏鍐呭錛?/strong>
Spark Architecture
computer in a reasonable amount of time. When you have large volumes ofdata,using a single computer to analyze and process that data (and store it)might be prohibitively slow, or even impossible. This is why we want to useSpark.
Spark has a core library and a set of built-in libraries (SQL, GraphX錛孲tream ing, MLlib), as shown in Figure 1-3.As you can see, through itsDataSource API,Spark can interact with many data sources, such as |