谁在讲这个视频？讲师背景怎么样？

这个视频是Albert Peng讲的。匠人学院的老师基本都在澳洲的IT公司工作过，像Atlassian、Canva这些，所以讲的内容都比较实用，不是纯理论那种。

视频多长时间？我有时间看完吗？

大概 01:26:35，不算太长。通勤的时候或者午休的时候看看挺合适的，干货比较多，没有太多废话。

除了官网，还能在哪看这个视频？

可以去 YouTube 或者 B站看，我们在那边也有账号，会同步更新。在澳洲的话YouTube看起来比较流畅。国内的话B站更方便一些。

匠人学院是做什么的？还有其他学习资源吗？

匠人学院主要是帮在澳洲的华人学IT、找IT工作的。有Web开发、数据分析、设计这些课程，也有1对1的辅导，帮你改简历、练面试什么的。想了解更多可以去 jiangren.com.au 看看。

01:26:35

匠人 TV

基于AWS平台的数据流设计, Spark程序的开发,优化（TDD改造）和部署

Q: 除了官网，还能在哪看这个视频？

可以去 YouTube 或者 B站 看，我们在那边也有账号，会同步更新。在澳洲的话YouTube看起来比较流畅。国内的话B站更方便一些。

发布2020/05/13

时长01:26:35

讲师1 人

视频简介

This video takes an in-depth look at data flow design on the AWS platform, as well as the development, optimization, and deployment of Apache Spark programs, with a particular emphasis on how to improve code quality through test-driven development (TDD). The content covers key concepts in data engineering, including how to build data flows, optimization strategies for Spark jobs, and how to efficiently deploy large-scale data processing tasks. The video details the application of AWS in data engineering, such as how to use services such as Amazon S3, Lambda, and EMR to build data flows, and how to combine Spark to achieve large-scale data processing. At the same time, the introduction of the TDD method helps ensure the stability and maintainability of the code, making the development process more standardized. Through practical demonstrations, viewers can learn how to use unit testing and continuous integration during the development process to improve the reliability of data processing tasks. This video is suitable for data engineers and developers who want to improve their data processing capabilities, especially professionals engaged in cloud computing and big data analysis. The content covers both basic concepts and practical guidance to help viewers master the best practices for efficient data flow development in the AWS environment, laying a solid foundation for development in the field of data engineering. 该视频深入探讨了在AWS平台上进行数据流设计，以及Apache Spark程序的开发、优化和部署，特别强调如何通过测试驱动开发（TDD）提升代码质量。内容涵盖数据工程的关键概念，包括数据流的构建方式、Spark作业的优化策略，以及如何高效部署大规模数据处理任务。视频详细介绍了AWS在数据工程中的应用，如如何利用Amazon S3、Lambda和EMR等服务构建数据流，并结合Spark实现大规模数据处理。同时，TDD方法的引入有助于确保代码的稳定性和可维护性，使开发流程更加规范化。通过实例演示，观众可以学习如何在开发过程中使用单元测试和持续集成，提高数据处理任务的可靠性。该视频适用于数据工程师和希望提升数据处理能力的开发者，特别是从事云计算和大数据分析的专业人士。内容既涵盖基础概念，也提供实践指导，帮助观众掌握在AWS环境下进行高效数据流开发的最佳实践，为在数据工程领域的发展打下坚实基础。

讲师阵容

Albert Peng数据工程师

数据科学Kaggle实战班 50-500万条数据分析:VMOR数据分析项目集训

What is normalization, and why is it important in database design?

技术面试中等

👁️ 3795✏️ 2

挑战一下 →

How would you implement a performance benchmarking tool for Node.js applications?

技术面试困难

👁️ 3687✏️ 0

挑战一下 →

What is a Chef Node and what is its importance?

技术面试简单

👁️ 3615✏️ 8

挑战一下 →

常见问题

这个视频主要讲什么内容？

This video takes an in-depth look at data flow design on the AWS platform, as well as the development, optimization, and deployment of Apache Spark pr...

视频的讲师是谁？

本视频由Albert Peng主讲。匠人学院的讲师均来自澳洲顶尖IT公司，拥有丰富的实战和教学经验。

在哪里可以观看这个视频？

你可以在匠人学院官网观看本视频，我们也在 YouTube 和 Bilibili 平台同步更新。

视频简介

讲师阵容

相关面试题挑战

What is normalization, and why is it important in database design?

How would you implement a performance benchmarking tool for Node.js applications?

What is a Chef Node and what is its importance?

常见问题