{"id":8992,"date":"2021-06-23T10:32:13","date_gmt":"2021-06-23T09:32:13","guid":{"rendered":"https:\/\/ee.yelkdev.site\/?p=8992"},"modified":"2024-03-28T13:48:36","modified_gmt":"2024-03-28T13:48:36","slug":"common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them","status":"publish","type":"post","link":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/","title":{"rendered":"Common pitfalls of data pipeline projects, and how to avoid them"},"content":{"rendered":"<p><span style=\"font-weight: 400;\">Knowing, understanding and managing your data throughout its lifecycle is more important than it has ever been. And more difficult.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Of course, the <\/span><a href=\"https:\/\/www.statista.com\/statistics\/871513\/worldwide-data-created\/\"><span style=\"font-weight: 400;\">never ending growth in data volume<\/span><\/a><span style=\"font-weight: 400;\"> is partly responsible for this, as are also countless processes that need to be applied to the data to ensure it is usable and effective. Which is why data analysts and data engineers turn to data pipelining.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Added complexity is involved when, In order to keep abreast of the latest requirements, organisations need to constantly deploy new data technologies alongside legacy infrastructure.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">All three of these elements mean that, inevitably, data pipelines are becoming more complicated as they grow. In the final article in our data pipeline series, we have highlighted some of the common pitfalls that we have learned from our experience over the years and how to avoid them. These are also part of our <\/span><a href=\"https:\/\/playbooks.equalexperts.com\/data-pipeline\"><span style=\"font-weight: 400;\">Data Pipeline Playbook.<\/span><\/a><\/p>\n<h2><b>About this series<\/b><\/h2>\n<p><span style=\"font-weight: 400;\">This is the final post in our six part series on the data pipeline, taken from our latest playbook. Now we look at the many pitfalls you can encounter in a data pipeline project. In the series before now, we looked at <a href=\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/what-is-a-data-pipeline\/\">what a data pipeline is<\/a> and who it is used by. Next we looked at the <a href=\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/what-are-the-benefits-of-data-pipelines\/\">six main benefits of a good data pipeline<\/a>, part three considered the \u2018must have\u2019 <a href=\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/six-must-have-key-principles-of-data-pipeline-projects\/\">key principles of data pipeline projects<\/a>, and part <a href=\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/six-essential-practices-of-data-pipelines\/\">four<\/a> and <a href=\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/five-more-practices-that-will-ensure-a-successful-data-pipeline-project\/\">five<\/a> covered the essential practices of a data pipeline. So here\u2019s our list of some of the pitfalls we\u2019ve experienced when building data pipelines in partnership with various clients. We\u2019d encourage you to avoid the scenarios listed below.<\/span><\/p>\n<h2><b>Avoid tightly coupling your analytics pipelines with other business processes<\/b><\/h2>\n<p><span style=\"font-weight: 400;\">Analytics data pipelines provide data to produce insights about your customers, business operations, technology performance, and more. For example, the role of a data warehouse is to create an historical record of data that can be mined for insights.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">It is tempting to see these rich data sources as the best source of data for all data processing and plumb key business activities in these repositories. However, this can easily end up preventing the extraction of insights it was implemented for. Data warehouses can become so integrated into business operations &#8211; effectively acting as the Operational Data Store (ODS) &#8211; that they can no longer function as a data warehouse. Key business activities end up dependent on the fast processing of data drawn from the data warehouse, which prevents other users from running queries on the data they need for their analyses.<\/span><\/p>\n<img decoding=\"async\" class=\"aligncenter wp-image-8995 size-large\" src=\"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_content-1200x735.png\" alt=\"\" width=\"1200\" height=\"735\" \/>\n<p><span style=\"font-weight: 400;\">Modern architectures utilise a micro-service architecture, and we advocate this digital platform approach to delivering IT functionality (see our <\/span><a href=\"https:\/\/playbooks.equalexperts.com\/digital-platform\"><span style=\"font-weight: 400;\">Digital Platform Playbook<\/span><\/a><span style=\"font-weight: 400;\">). Micro-services should <\/span><b><i>own their own data<\/i><\/b><span style=\"font-weight: 400;\"> &#8211; and as there is unlikely to be a one-size-fits-all solution to volumes, latencies, or use of master or reference data of the many critical business data flows implemented as micro-services. Great care should be taken as to which part of the analytics data pipelines they should be drawn from. The nearer the data they use is to the end users, the more constrained your data analytics pipeline will become over time, and the more restricted analytics users will become in what they can do.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">If a micro-service is using a whole pipeline as part of its critical functionality, it is probably time to reproduce the pipeline as a micro-service in its own right, as the needs of the analytics users and the micro-service will diverge over time.<\/span><\/p>\n<h2><b>Include data users early on<\/b><\/h2>\n<p><span style=\"font-weight: 400;\">We are sometimes asked if we can implement data pipelines without bothering data users. They are often very busy interfacing at senior levels, and as their work provides key inputs to critical business activities and decisions, it can be tempting to reduce the burden on them and think that you already understand their needs.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In our experience this is nearly always a mistake. Like any software development, understanding user needs as early as you can, and validating that understanding through the development, is much more likely to lead to a valued product. Data users almost always welcome a chance to talk about what data they want, what form they want it in, and how they want to access it. When it becomes available, they may well need some coaching on how to access it.<\/span><\/p>\n<h2><b>Keep unstructured raw inputs separate from processed data<\/b><\/h2>\n<p><span style=\"font-weight: 400;\">In pipelines where the raw data is unstructured (e.g. documents or images), and the initial stages of the pipeline extract data from it, such as entities (names, dates, phone numbers, etc.), or categorisations, it can be tempting to keep the raw data together with the extracted information. This is usually a mistake. Unstructured data is always of a much higher volume, and keeping it together with extracted data will almost certainly lead to difficulties in processing or searching the useful, structured data later on. Keep the unstructured data in separate storage (e.g., different buckets), and store links to it instead.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We hope that this article, along with all the others in the series, will help you create better pipelines and address the common challenges that can occur when building and using them. Data pipeline projects can be challenging and complicated, but done correctly they securely gather information and allow you to make valuable decisions quickly and effectively.\u00a0<\/span><\/p>\n<h2><b>Contact us!<\/b><\/h2>\n<p><span style=\"font-weight: 400;\">For more information on data pipelines in general, take a look at our <\/span><a href=\"https:\/\/playbooks.equalexperts.com\/data-pipeline\"><span style=\"font-weight: 400;\">Data Pipeline Playbook<\/span><\/a><span style=\"font-weight: 400;\">.\u00a0 And if you\u2019d like us to share our experience of data pipelines with you, get in touch using the form below.<\/span><\/p>\n<div class=\"window-main-col\">\n<div class=\"js-fill-card-detail-desc\">\n<div>\n<div class=\"window-module\">\n<div class=\"u-gutter\">\n<div class=\"editable\">\n<div class=\"description-content js-desc-content\">\n<div class=\"current markeddown hide-on-edit js-desc js-show-with-desc\" dir=\"auto\">\n<p dir=\"auto\">\n\t\t\t\t\t\t<script>\n\t\t\t\t\t\t\twindow.hsFormsOnReady = window.hsFormsOnReady || [];\n\t\t\t\t\t\t\twindow.hsFormsOnReady.push(()=>{\n\t\t\t\t\t\t\t\thbspt.forms.create({\n\t\t\t\t\t\t\t\t\tportalId: 7208712,\n\t\t\t\t\t\t\t\t\tformId: \"83acdf22-cf43-47ba-b91f-0428264b824a\",\n\t\t\t\t\t\t\t\t\ttarget: \"#hbspt-form-1758975709000-9643979107\",\n\t\t\t\t\t\t\t\t\tregion: \"eu1\",\n\t\t\t\t\t\t\t\t\t\n\t\t\t\t\t\t\t})});\n\t\t\t\t\t\t<\/script>\n\t\t\t\t\t\t<div class=\"hbspt-form\" id=\"hbspt-form-1758975709000-9643979107\"><\/div><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"js-plugin-sections\"><\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Our experience has taught us that building data pipelines can be complex, especially if you are new to them. Here are some of the pitfalls we\u2019ve come across when building data pipelines in partnership with clients. It\u2019s our hope it will help you to navigate them in the future.<\/p>\n","protected":false},"author":133,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","inline_featured_image":false,"footnotes":""},"categories":[5],"tags":[185,188,187,192],"location":[397],"class_list":["post-8992","post","type-post","status-publish","format-standard","hentry","category-our-thinking","tag-data","tag-data-engineering","tag-data-management","tag-data-pipelines-playbook"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v25.9 (Yoast SEO v25.9) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Common pitfalls of data pipeline projects, and how to avoid them | Equal Experts<\/title>\n<meta name=\"description\" content=\"From raw inputs to when to include data users, here we look at some of the most common pitfalls when creating data pipelines, and how to avoid them.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Common pitfalls of data pipeline projects, and how to avoid them\" \/>\n<meta property=\"og:description\" content=\"Our experience has taught us that building data pipelines can be complex, especially if you are new to them. Here are some of the pitfalls we\u2019ve come across when building data pipelines in partnership with clients. It\u2019s our hope it will help you to navigate them in the future.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/\" \/>\n<meta property=\"og:site_name\" content=\"Equal Experts\" \/>\n<meta property=\"article:published_time\" content=\"2021-06-23T09:32:13+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-03-28T13:48:36+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_fb.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"630\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Simon Case\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:title\" content=\"Common pitfalls of data pipeline projects, and how to avoid them\" \/>\n<meta name=\"twitter:description\" content=\"Our experience has taught us that building data pipelines can be complex, especially if you are new to them. Here are some of the pitfalls we\u2019ve come across when building data pipelines in partnership with clients. It\u2019s our hope it will help you to navigate them in the future.\" \/>\n<meta name=\"twitter:creator\" content=\"@EqualExperts\" \/>\n<meta name=\"twitter:site\" content=\"@EqualExperts\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Simon Case\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/\"},\"author\":{\"name\":\"Simon Case\",\"@id\":\"https:\/\/www.equalexperts.com\/#\/schema\/person\/461daa125fcd274669eb26dfbef20369\"},\"headline\":\"Common pitfalls of data pipeline projects, and how to avoid them\",\"datePublished\":\"2021-06-23T09:32:13+00:00\",\"dateModified\":\"2024-03-28T13:48:36+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/\"},\"wordCount\":952,\"publisher\":{\"@id\":\"https:\/\/www.equalexperts.com\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_content-1200x735.png\",\"keywords\":[\"data\",\"data engineering\",\"data management\",\"data pipelines playbook\"],\"articleSection\":[\"Our Thinking\"],\"inLanguage\":\"en-GB\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/\",\"url\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/\",\"name\":\"Common pitfalls of data pipeline projects, and how to avoid them | Equal Experts\",\"isPartOf\":{\"@id\":\"https:\/\/www.equalexperts.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_content-1200x735.png\",\"datePublished\":\"2021-06-23T09:32:13+00:00\",\"dateModified\":\"2024-03-28T13:48:36+00:00\",\"description\":\"From raw inputs to when to include data users, here we look at some of the most common pitfalls when creating data pipelines, and how to avoid them.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#primaryimage\",\"url\":\"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_content-1200x735.png\",\"contentUrl\":\"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_content-1200x735.png\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.equalexperts.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Common pitfalls of data pipeline projects, and how to avoid them\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.equalexperts.com\/#website\",\"url\":\"https:\/\/www.equalexperts.com\/\",\"name\":\"Equal Experts\",\"description\":\"Making Software. Better.\",\"publisher\":{\"@id\":\"https:\/\/www.equalexperts.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.equalexperts.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.equalexperts.com\/#organization\",\"name\":\"Equal Experts\",\"url\":\"https:\/\/www.equalexperts.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/www.equalexperts.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2018\/08\/Equal_Experts_Logo_CMYK_Colour.jpg\",\"contentUrl\":\"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2018\/08\/Equal_Experts_Logo_CMYK_Colour.jpg\",\"width\":719,\"height\":340,\"caption\":\"Equal Experts\"},\"image\":{\"@id\":\"https:\/\/www.equalexperts.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/x.com\/EqualExperts\",\"https:\/\/www.linkedin.com\/company\/equal-experts\/?viewAsMember=true\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.equalexperts.com\/#\/schema\/person\/461daa125fcd274669eb26dfbef20369\",\"name\":\"Simon Case\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/www.equalexperts.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e197e1e0b9cc0e8781990d364393d74c819c76bdd8ca064ca08a79c4a897e0e3?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e197e1e0b9cc0e8781990d364393d74c819c76bdd8ca064ca08a79c4a897e0e3?s=96&d=mm&r=g\",\"caption\":\"Simon Case\"}}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Common pitfalls of data pipeline projects, and how to avoid them | Equal Experts","description":"From raw inputs to when to include data users, here we look at some of the most common pitfalls when creating data pipelines, and how to avoid them.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/","og_locale":"en_GB","og_type":"article","og_title":"Common pitfalls of data pipeline projects, and how to avoid them","og_description":"Our experience has taught us that building data pipelines can be complex, especially if you are new to them. Here are some of the pitfalls we\u2019ve come across when building data pipelines in partnership with clients. It\u2019s our hope it will help you to navigate them in the future.","og_url":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/","og_site_name":"Equal Experts","article_published_time":"2021-06-23T09:32:13+00:00","article_modified_time":"2024-03-28T13:48:36+00:00","og_image":[{"width":1200,"height":630,"url":"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_fb.png","type":"image\/png"}],"author":"Simon Case","twitter_card":"summary_large_image","twitter_title":"Common pitfalls of data pipeline projects, and how to avoid them","twitter_description":"Our experience has taught us that building data pipelines can be complex, especially if you are new to them. Here are some of the pitfalls we\u2019ve come across when building data pipelines in partnership with clients. It\u2019s our hope it will help you to navigate them in the future.","twitter_creator":"@EqualExperts","twitter_site":"@EqualExperts","twitter_misc":{"Written by":"Simon Case","Estimated reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#article","isPartOf":{"@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/"},"author":{"name":"Simon Case","@id":"https:\/\/www.equalexperts.com\/#\/schema\/person\/461daa125fcd274669eb26dfbef20369"},"headline":"Common pitfalls of data pipeline projects, and how to avoid them","datePublished":"2021-06-23T09:32:13+00:00","dateModified":"2024-03-28T13:48:36+00:00","mainEntityOfPage":{"@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/"},"wordCount":952,"publisher":{"@id":"https:\/\/www.equalexperts.com\/#organization"},"image":{"@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#primaryimage"},"thumbnailUrl":"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_content-1200x735.png","keywords":["data","data engineering","data management","data pipelines playbook"],"articleSection":["Our Thinking"],"inLanguage":"en-GB"},{"@type":"WebPage","@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/","url":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/","name":"Common pitfalls of data pipeline projects, and how to avoid them | Equal Experts","isPartOf":{"@id":"https:\/\/www.equalexperts.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#primaryimage"},"image":{"@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#primaryimage"},"thumbnailUrl":"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_content-1200x735.png","datePublished":"2021-06-23T09:32:13+00:00","dateModified":"2024-03-28T13:48:36+00:00","description":"From raw inputs to when to include data users, here we look at some of the most common pitfalls when creating data pipelines, and how to avoid them.","breadcrumb":{"@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#primaryimage","url":"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_content-1200x735.png","contentUrl":"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2021\/04\/datapipeline_blog6_content-1200x735.png"},{"@type":"BreadcrumbList","@id":"https:\/\/www.equalexperts.com\/blog\/our-thinking\/common-pitfalls-of-data-pipeline-projects-and-how-to-avoid-them\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.equalexperts.com\/"},{"@type":"ListItem","position":2,"name":"Common pitfalls of data pipeline projects, and how to avoid them"}]},{"@type":"WebSite","@id":"https:\/\/www.equalexperts.com\/#website","url":"https:\/\/www.equalexperts.com\/","name":"Equal Experts","description":"Making Software. Better.","publisher":{"@id":"https:\/\/www.equalexperts.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.equalexperts.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Organization","@id":"https:\/\/www.equalexperts.com\/#organization","name":"Equal Experts","url":"https:\/\/www.equalexperts.com\/","logo":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.equalexperts.com\/#\/schema\/logo\/image\/","url":"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2018\/08\/Equal_Experts_Logo_CMYK_Colour.jpg","contentUrl":"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2018\/08\/Equal_Experts_Logo_CMYK_Colour.jpg","width":719,"height":340,"caption":"Equal Experts"},"image":{"@id":"https:\/\/www.equalexperts.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/EqualExperts","https:\/\/www.linkedin.com\/company\/equal-experts\/?viewAsMember=true"]},{"@type":"Person","@id":"https:\/\/www.equalexperts.com\/#\/schema\/person\/461daa125fcd274669eb26dfbef20369","name":"Simon Case","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.equalexperts.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e197e1e0b9cc0e8781990d364393d74c819c76bdd8ca064ca08a79c4a897e0e3?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e197e1e0b9cc0e8781990d364393d74c819c76bdd8ca064ca08a79c4a897e0e3?s=96&d=mm&r=g","caption":"Simon Case"}}]}},"_links":{"self":[{"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/posts\/8992","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/users\/133"}],"replies":[{"embeddable":true,"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/comments?post=8992"}],"version-history":[{"count":0,"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/posts\/8992\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/media?parent=8992"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/categories?post=8992"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/tags?post=8992"},{"taxonomy":"location","embeddable":true,"href":"https:\/\/www.equalexperts.com\/wp-json\/wp\/v2\/location?post=8992"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}