riversongs Posted February 5, 2024 Report Share Posted February 5, 2024 Free Download Generative Ai For Data EngineeringPublished 2/2024MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHzLanguage: English | Size: 2.80 GB | Duration: 6h 8mHands-On Beginner's Guide to GenAI and LLMs for Data Engineering with Python and SQLWhat you'll learnUse large language models to create Python code to implement data pipelinesUse LLMs to solve data loading, data transformation, and data quality assessment challengesCreate databases and analytic data models using generative AICreate Python, SQL, and Bash scripts to implement common data engineering tasksRequirementsFamiliarity with working with data, such as with spreadsheetsAbility to read and follow Python and SQL code is helpfulDescriptionGenerative AI tools such as ChatGPT, Claude, and Bard are making data engineering more accessible and more efficient.If you work with spreadsheets or business intelligence tools but aren't too familiar with Python or SQL, then generative AI can help you analyze data and build your own data pipelines and ETL/ELT processes.If you are a data engineer, then GenAI can help you focus your efforts on the problem domain and designing a data architecture while spending less time writing code that can be generated by a machine.Generative AI and LLMs will not replace data engineers or data analysts but those who know how to use these AI tools will be able to build more capable and reliable data pipelines faster. They will also have access to a tool that can help you develop your Python, SQL, and data modeling skills by providing a variety of examples of functional code and help with error messages and troubleshooting processes that do not work as expected.Learn Data Engineering Techniques as Well as Data Engineering ToolsIn this course, you will learn how to break down data engineering problems into a series of tasks that can be automated using Python, SQL, and command line scripts generated by a large language model (LLM). Prompting an AI to "generate a data pipeline to do X, Y, and Z" will probably not get you the results you expect. LLMs are powerful tools, but they are not oracles. As with any tool, we need to understand what the tool is capable of and how to use the capabilities to meet our needs. This course shows you how to think through a data engineering problem, incrementally build components of a solution, and combine those components into functional data pipelines.This course is organized into several topics that cover the fundamental skills needed to begin work in data engineering using GenAI, including:Introduction to large language models, foundation models, and other AI topics related to data engineering. This course uses Claude AI from Anthropic, a large language model that is both well suited to data engineering code generation and free to use.Working with CSV and JSON filesData quality and data cleaning, including statistics and visualizationsExtraction transformation and load (ETL)/ extraction, load, and transform (ELT) processesRelational and NoSQL databasesData modeling using dimensional data model patternsWorking with JSON data in relational databases such as PostgreSQLUnderstanding more advanced components of the modern data stack, including Apache Airflow, Apache Spark, Great Expectations and dbtThe course begins with the most basic of data engineering tasks: working with files. You will learn how to quickly filter, transform, and find problems in data sets made up of comma-separated value (CSV) and JSON files. You'll also see how we can create samples from large data sets to efficiently experiment with different solutions to our data engineer needs. You will learn how to generate code that uses command line utilities like awk, a text processing and data extraction tool, and jq, a tool for parsing, filtering, and transforming JSON data. If you are not familiar with tools like awk and jq, that is no problem. In this course, you will learn how to describe what you want in a solution so the LLM can choose an appropriate tool for the job.Data quality is a primary concern in any data engineering project. Fortunately, with GenAI and a basic understanding of data quality checks, you can quickly generate scripts to check for common data quality problems and apply transformations to the data to correct for those problems. Statistics and visualizations are important tools for ensuring data quality. In this course, you will learn how to use basic statistics and visualizations to help with data quality and data exploration. And because generative AI is used to generate code, you can spend more time learning about statistics, visualizations, and how to apply them to your problem domain and less time trying to find syntax errors or debug a logic error in your code.Databases are the foundation of many applications and data analysis platforms. You will learn about relational databases as well as NoSQL databases and when to use them. Databases are complicated systems that require that we describe how we want to structure our data. This process is known as data modeling. This course will introduce data modeling with a focus on dimensional modeling, a commonly used data model pattern in data analytics. You will also learn how to generate SQL code to implement dimensional models, load data into your database, and query and analyze data once it is loaded.The course concludes with an explanation of more advanced data engineering tools, including Apache Airflow for data pipeline orchestration, Apache Spark for large-scale analytics and machine learning, Great Expectations for data quality control, and dbt for transforming data. These tools are widely used in data engineering but have required some coding skills to use. With generative AI, they are now more accessible to those who understand how to use LLMs like Claude, Bard, and ChatGPT.Now is a great time to become a data engineer because the demand for data engineering skills is high and we now have tools in place that allow us to focus on the problems we are solving while accelerating how quickly we can create scalable, reliable data pipelines.OverviewSection 1: IntroductionLecture 1 IntroductionLecture 2 Course OverviewLecture 3 Data Engineering, Data Analysis, and Data ScienceLecture 4 Generative AI and Large Language Models (LLMs)Section 2: Working with CSV FilesLecture 5 CSV and JSON FilesLecture 6 Command Line Utilities for Working with FilesLecture 7 Filtering Rows of a CSV FileLecture 8 Combining Commands in a Shell ScriptLecture 9 Sampling with PythonLecture 10 Modifying a Shell ScriptLecture 11 Scheduling Jobs with CronLecture 12 Cron Job Scheduling ExampleSection 3: Working with JSON FIlesLecture 13 Working with JSON FilesLecture 14 Installing jqLecture 15 Filtering JSON files with jqLecture 16 Loading JSON into PythonSection 4: Data Quality in Data EngineeringLecture 17 Overiew of Data QualityLecture 18 Sales DataLecture 19 Missing Values in Data FilesLecture 20 Adding Documentation to ScriptsLecture 21 Range ChecksLecture 22 Working with Dates and TimesLecture 23 Checking Date and Time FormatLecture 24 Visualizations for Data QualitySection 5: Working with Pandas in PythonLecture 25 Working with PandasLecture 26 Statistics using DataframesLecture 27 Generating Synthetic DataSection 6: JSON SchemasLecture 28 JSON Schemas for Data Validation 1Lecture 29 JSON Schemas for Data Validation 2Section 7: Working with DatabasesLecture 30 Relational DatabasesLecture 31 NoSQL DatabasesLecture 32 PostgreSQLLecture 33 Installing PostgreSQLLecture 34 Creating PostgreSQL SchemasLecture 35 Creating Tables in PostgreSQLSection 8: Dimensional Modeling for Data AnalysisLecture 36 Dimensional ModelingLecture 37 Loading Sales Data into Staging TablesLecture 38 Loading Dimension Data into Staging TablesLecture 39 Creating Location DimensionLecture 40 Creating Products DimensionLecture 41 Create Date DimensionSection 9: Populating a Fact Table in a Dimensional ModelLecture 42 Creating a Sales Fact TableLecture 43 Preparing Data for AggregationLecture 44 Aggregating Staging DataLecture 45 Loading Sales Fact TableLecture 46 Generating SQL Queries for a Dimensional ModelSection 10: JSON in PostgreSQLLecture 47 JSON in PostgreSQLLecture 48 Creating a Table with a JSON Column in PostgreSQLLecture 49 Loading JSON Data into PostgreSQLLecture 50 Querying JSON Data in PostgreSQLSection 11: Next Steps Learning Data EngineeringLecture 51 What to Learn Next?Lecture 52 Apache Airflow for OrchestrationLecture 53 Apache Spark for ETL/ELT and AnalyticsLecture 54 Great Expectations for Data Quality ControlLecture 55 dbt for Data Transformation with SQLSection 12: Course Wrap UpLecture 56 ConclusionPeople who work with data and want to build data manipulation scripts faster and develop more complex data pipelinesHomepagehttps://www.udemy.com/course/generative-ai-for-data-engineering/Download ( Rapidgator )https://rg.to/file/3578df8c4c3d3dae2ff9488ce6a8a5be/aqxwh.Generative.Ai.For.Data.Engineering.part3.rar.htmlhttps://rg.to/file/6ea61f9fc1f1db500dc68eb7d6750224/aqxwh.Generative.Ai.For.Data.Engineering.part1.rar.htmlhttps://rg.to/file/85ff49470e6a2e65c1f06ac98197de2b/aqxwh.Generative.Ai.For.Data.Engineering.part2.rar.htmlUploadgighttps://uploadgig.com/file/download/88f1ea27ae2784A5/aqxwh.Generative.Ai.For.Data.Engineering.part2.rarhttps://uploadgig.com/file/download/987910b3c03472b9/aqxwh.Generative.Ai.For.Data.Engineering.part1.rarhttps://uploadgig.com/file/download/b48260927e19a7Ad/aqxwh.Generative.Ai.For.Data.Engineering.part3.rarDownload ( NitroFlare )https://nitroflare.com/view/54D281312D48167/aqxwh.Generative.Ai.For.Data.Engineering.part3.rarhttps://nitroflare.com/view/5DDA34CEBE5FC40/aqxwh.Generative.Ai.For.Data.Engineering.part1.rarhttps://nitroflare.com/view/9A2058D020F2C2A/aqxwh.Generative.Ai.For.Data.Engineering.part2.rarFikperhttps://fikper.com/ZPNO2zpllA/aqxwh.Generative.Ai.For.Data.Engineering.part3.rar.htmlhttps://fikper.com/a5Ngqz6mFt/aqxwh.Generative.Ai.For.Data.Engineering.part1.rar.htmlhttps://fikper.com/iKj6OZJjVu/aqxwh.Generative.Ai.For.Data.Engineering.part2.rar.htmlNo Password - Links are Interchangeable Link to comment Share on other sites More sharing options...
Recommended Posts
Please sign in to comment
You will be able to leave a comment after signing in
Sign In Now