Evaluation of data collection table design In task 1, a new customer designed a data collection form for entering information and Megavideo employees were asked to create a database for entering customer details. It is necessary to set phone mask input mask, gender confirmation rule, county default value, and finally report on all Megavideo members. The target audience of my data capture table is a new Megavideo customer, because they have to fill it. My database's target audience is an employee of Megavideo.
Designing the term list: The term list records the assessment of due diligence activity. This is a combination of supply and demand for risk management, negotiation, and navigation. The list of terms must be of entrepreneurs, and / or VC wants to educate entrepreneurs and need to help in the process. This will be a partnership between the parties, preferably with a solid and reliable foundation. Decision-Making Governance: According to the company's management model, there are companies that need GP sponsorship, companies that use the voting method, companies that must obtain consensus (in my opinion, the priority is low). This kind of thinking can bring high returns if it turns out to be the opposite fact. By definition, the opposite way of thinking is not consistent. Reward incentives affect decision making, so strict analysis is necessary to ensure consistency between them.
First, I will introduce the concept of data modeling, the design process that carefully defines the relationship between table schema and data to understand the business schema and dimensions. Learn more efficient queries and data partitioning that allows you to backfill data. After this section, readers will understand the basics of data warehousing and pipeline design. In the next section, I will analyze Airflow's work anatomy. Readers learn how to use sensors, operators, and transport to manipulate the concepts of extraction, transformation, and loading. We will focus on ETL best practices based on actual examples such as Airbnb, Stitch Fix, Zymergen.
Configuration: ETL is obviously complicated and you need to be able to describe the data flow of the data pipeline briefly. Therefore, it is important to evaluate how to create an ETL. Will it consist of UI, domain specific language, or code? Today, the notion of constructing code makes it more general, as it allows the user to program customizable pipelines programmatically. UI, monitoring, warning: Even if there is no error in the job itself, an error (such as cluster failure) inevitably occurs in a long-running batch process. Therefore, monitoring and alerting are important to keep track of the progress of long-running processes. How does the framework provide visual information on the progress of work? Do you display alerts and warnings in a timely and accurate manner?