{"id":28956,"date":"2024-04-09T21:25:07","date_gmt":"2024-04-09T21:25:07","guid":{"rendered":"https:\/\/building.nubank.com\/mastering-databricks-performance-with-spark-ui\/"},"modified":"2024-04-09T21:41:27","modified_gmt":"2024-04-09T21:41:27","slug":"entendendo-performance-no-databricks-usando-o-spark-ui","status":"publish","type":"post","link":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/","title":{"rendered":"Entendendo Performance no Databricks usando o Spark UI"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"><em>Revisado por Felipe Yukio<\/em><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Databricks e Spark UI s\u00e3o ferramentas poderosas para lidar com opera\u00e7\u00f5es com muitos dados. Assim como qualquer sistema robusto, otimizar o desempenho \u00e9 essencial para aproveitar os recursos ao m\u00e1ximo. Este guia detalha as m\u00e9tricas de desempenho no Databricks usando o Spark UI.\u00a0<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Desde a prepara\u00e7\u00e3o at\u00e9 a identifica\u00e7\u00e3o de desafios e o aperfei\u00e7oamento de solu\u00e7\u00f5es, ele oferece uma perspectiva abrangente sobre o aproveitamento m\u00e1ximo de opera\u00e7\u00f5es de dados. Com uma mistura de ideias t\u00e9cnicas e conselhos pr\u00e1ticos, os leitores aprender\u00e3o a usar as capacidades de diagn\u00f3stico do Spark UI, garantindo que as opera\u00e7\u00f5es de dados sejam eficientes, eficazes e esclarecidas. Continue lendo!<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">O valor do Spark UI<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">O Spark UI \u00e9 uma ferramenta instrumental de diagn\u00f3stico para quem trabalha com Databricks e Apache Spark. Ele oferece uma vis\u00e3o dos funcionamentos internos das opera\u00e7\u00f5es de dados. Ao lidar com grandes conjuntos de dados, costuma ser dif\u00edcil determinar se melhorias est\u00e3o sendo realizadas. O Spark UI oferece clareza apresentando dados de opera\u00e7\u00e3o de uma maneira compreens\u00edvel.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Prepara\u00e7\u00e3o para an\u00e1lise e resolu\u00e7\u00e3o de problemas de amostra<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Antes de come\u00e7ar a an\u00e1lise, garantir um ambiente condutivo \u00e9 fundamental. Isso envolve evitar caching de dados, que podem afetar m\u00e9tricas de processamento de dados em tempo real. \u00c9 poss\u00edvel desabilitar o cache no Databricks com comandos como definir &#8220;spark.databricks.io.cache.enabled&#8221; como falso.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Liberar o cache no cat\u00e1logo do Spark garante um ambiente sem cache. Para entusiastas de SQL no Databricks, \u00e9 poss\u00edvel obter o mesmo efeito com o comando de liberar cache do SQL. Caso queira evitar erros, \u00e9 bom considerar reiniciar o cluster para garantir uma configura\u00e7\u00e3o sem cache.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Explorando as m\u00e9tricas<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Ao executar consultas no Spark, as m\u00e9tricas do Spark UI se tornam centrais. Uma observa\u00e7\u00e3o inicial \u00e9 direcionada \u00e0 visualiza\u00e7\u00e3o de &#8220;trabalho&#8221;, oferecendo uma perspectiva mais ampla. Toda opera\u00e7\u00e3o de dados no Spark ativa &#8220;trabalhos&#8221; que t\u00eam m\u00faltiplas &#8220;etapas&#8221; compostas de diversas &#8220;tarefas&#8221;. M\u00e9tricas como quantidade de tarefas por etapa o tempo necess\u00e1rio para cada etapa s\u00e3o apresentadas aqui.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Uma explora\u00e7\u00e3o mais detalhada leva \u00e0 visualiza\u00e7\u00e3o de &#8220;etapa&#8221;, revelando detalhes mais espec\u00edficos, como distribui\u00e7\u00f5es de parti\u00e7\u00e3o. M\u00e9tricas valiosas como hor\u00e1rios de coleta de lixo e distribui\u00e7\u00f5es de tamanho de entrada s\u00e3o apresentadas, ajudando no diagn\u00f3stico de problemas como assimetria de dados.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">O Databricks opera em um ambiente em cluster, e as m\u00e9tricas agregadas pelo executor destacam o desempenho de cada executor nesse cen\u00e1rio. Na visualiza\u00e7\u00e3o de &#8220;tarefa&#8221;, h\u00e1 ainda mais granularidade, esclarecendo as m\u00e9tricas de cada tarefa e garantindo uma an\u00e1lise de desempenho abrangente.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Perspectivas sobre Opera\u00e7\u00f5es Assim\u00e9tricas<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">As opera\u00e7\u00f5es s\u00e3o consideradas assim\u00e9tricas quando h\u00e1 uma parti\u00e7\u00e3o que processa muito mais dados do que as outras, fazendo com que o Spark n\u00e3o seja capaz de executar as transforma\u00e7\u00f5es em paralelo. Isso acontece porque o Spark envia todos os dados com o mesmo c\u00f3digo para uma \u00fanica parti\u00e7\u00e3o, e se h\u00e1 um valor de c\u00f3digo mais comum do que outros, a parti\u00e7\u00e3o fica assim\u00e9trica.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">No Spark UI, isso pode ser identificado quando uma &#8220;tarefa&#8221; leva muito mais tempo do que a m\u00e9dia de todas as outras tarefas de um trabalho. E uma transforma\u00e7\u00e3o assim\u00e9trica pode ocorrer em uma fun\u00e7\u00e3o join ou window. Para evitar isso, verifique se os dados assim\u00e9tricos podem ser filtrados antes de realizar qualquer uma dessas transforma\u00e7\u00f5es.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Desafios com Shuffle e Spill<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Shuffle \u00e9 uma t\u00e9cnica cara usada pelo Spark para redistribuir os dados em diferentes parti\u00e7\u00f5es, o que \u00e9 desencadeado por transforma\u00e7\u00f5es comuns como: join e groupBy.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A quantidade de parti\u00e7\u00f5es usadas para o Shuffle dos dados pode ser aumentada ou reduzida por <em>spark.sql.shuffle.partitions<\/em>. Se estiver lidando com quantidades menores de dados, voc\u00ea deve reduzir a quantidade de parti\u00e7\u00f5es de Shuffle para evitar a execu\u00e7\u00e3o simult\u00e2nea de diversas tarefas com um volume pequeno de dados. Por outro lado, uma grande quantidade de dados sendo executada em poucas parti\u00e7\u00f5es faz as tarefas demorarem demais e pode causar erros de mem\u00f3ria.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Acertar o n\u00famero de parti\u00e7\u00f5es de Shuffle \u00e9 complicado, pois geralmente s\u00e3o necess\u00e1rios testes com diferentes valores para identificar a melhor quantidade. No entanto, costuma valer o esfor\u00e7o, pois \u00e9 a fonte mais comum de erros de desempenho em trabalhos do Spark.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Como \u00faltimo recurso para evitar erros de mem\u00f3ria, o Spark pode vazar dados da mem\u00f3ria para o disco, que depois precisam ser movidos de volta aumentando as taxas de leitura e escrita do disco, al\u00e9m do tempo de execu\u00e7\u00e3o da tarefa. Essa m\u00e9trica tamb\u00e9m pode ser identificada pelo Spark UI. Aumentar o n\u00famero de parti\u00e7\u00f5es de Shuffle \u00e9 uma das maneiras de reduzir o vazamento de dados.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Conclus\u00e3o<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Esse aprofundamento no Databricks e no Spark UI ressaltou a import\u00e2ncia da prepara\u00e7\u00e3o, das complexidades das m\u00e9tricas e das nuances dos processos de refinamento. Ao aproveitar as capacidades de diagn\u00f3stico do Spark UI, \u00e9 poss\u00edvel navegar pelo vasto cen\u00e1rio de opera\u00e7\u00f5es de dados com clareza e confian\u00e7a.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Seja para resolver problemas espec\u00edficos, testar m\u00e9todos diferentes ou aperfei\u00e7oar solu\u00e7\u00f5es, uma abordagem informada pode melhorar muito o desempenho. Conforme o mundo dos dados continua crescendo, ferramentas como o Spark UI se tornam indispens\u00e1veis, garantindo que toda opera\u00e7\u00e3o de dados n\u00e3o seja s\u00f3 um processo, mas uma oportunidade para otimiza\u00e7\u00e3o.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<span class=\"embed-youtube\" style=\"text-align:center; display: block;\"><iframe loading=\"lazy\" class=\"youtube-player\" width=\"640\" height=\"360\" src=\"https:\/\/www.youtube.com\/embed\/MCpKIUm-qjY?version=3&#038;rel=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;fs=1&#038;hl=pt-BR&#038;autohide=2&#038;wmode=transparent&#038;listType=playlist&#038;list=PLfqo9_UMdHhYKJ6LnyzNNNOYKshjvJFUA\" allowfullscreen=\"true\" style=\"border:0;\" sandbox=\"allow-scripts allow-same-origin allow-popups allow-presentation allow-popups-to-escape-sandbox\"><\/iframe><\/span>\n<\/div><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>Ao lidar com grandes conjuntos de dados, o Spark UI oferece clareza apresentando dados de opera\u00e7\u00e3o de uma maneira compreens\u00edvel.<\/p>\n","protected":false},"author":178110103,"featured_media":25154,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_post_was_ever_published":false},"categories":[2604,2503],"tags":[2606,3177,3176],"ppma_author":[2321],"class_list":["post-28956","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-analytics-engineering-pt-br","category-data-analytics","tag-analytics-engineering-pt-br","tag-databricks-pt-br","tag-spark-pt-br"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.8 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Entendendo Performance no Databricks usando o Spark UI - Building Nubank<\/title>\n<meta name=\"description\" content=\"Ao lidar com grandes conjuntos de dados, o Spark UI oferece clareza apresentando dados de opera\u00e7\u00e3o de uma maneira compreens\u00edvel.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Entendendo Performance no Databricks usando o Spark UI - Building Nubank\" \/>\n<meta property=\"og:description\" content=\"Ao lidar com grandes conjuntos de dados, o Spark UI oferece clareza apresentando dados de opera\u00e7\u00e3o de uma maneira compreens\u00edvel.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/\" \/>\n<meta property=\"og:site_name\" content=\"Building Nubank\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-09T21:25:07+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-04-09T21:41:27+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/building.nubank.com\/wp-content\/uploads\/2023\/06\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR-1024x683.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1024\" \/>\n\t<meta property=\"og:image:height\" content=\"683\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Nubank Editorial\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Nubank Editorial\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. tempo de leitura\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/\"},\"author\":{\"name\":\"Nubank Editorial\",\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/#\\\/schema\\\/person\\\/462f4f5a8d4ec3ccbc3d661dde00f0a4\"},\"headline\":\"Entendendo Performance no Databricks usando o Spark UI\",\"datePublished\":\"2024-04-09T21:25:07+00:00\",\"dateModified\":\"2024-04-09T21:41:27+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/\"},\"wordCount\":980,\"commentCount\":0,\"image\":{\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/i0.wp.com\\\/building.nubank.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR.jpg?fit=6240%2C4160&ssl=1\",\"keywords\":[\"Analytics Engineering\",\"Databricks\",\"spark\"],\"articleSection\":[\"Analytics Engineering\",\"Data &amp; Analytics\"],\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/\",\"url\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/\",\"name\":\"Entendendo Performance no Databricks usando o Spark UI - Building Nubank\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/i0.wp.com\\\/building.nubank.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR.jpg?fit=6240%2C4160&ssl=1\",\"datePublished\":\"2024-04-09T21:25:07+00:00\",\"dateModified\":\"2024-04-09T21:41:27+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/#\\\/schema\\\/person\\\/462f4f5a8d4ec3ccbc3d661dde00f0a4\"},\"description\":\"Ao lidar com grandes conjuntos de dados, o Spark UI oferece clareza apresentando dados de opera\u00e7\u00e3o de uma maneira compreens\u00edvel.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/#primaryimage\",\"url\":\"https:\\\/\\\/i0.wp.com\\\/building.nubank.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR.jpg?fit=6240%2C4160&ssl=1\",\"contentUrl\":\"https:\\\/\\\/i0.wp.com\\\/building.nubank.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR.jpg?fit=6240%2C4160&ssl=1\",\"width\":6240,\"height\":4160},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/entendendo-performance-no-databricks-usando-o-spark-ui\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Entendendo Performance no Databricks usando o Spark UI\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/#website\",\"url\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/\",\"name\":\"Building Nubank\",\"description\":\"We make the extraordinary happen\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/#\\\/schema\\\/person\\\/462f4f5a8d4ec3ccbc3d661dde00f0a4\",\"name\":\"Nubank Editorial\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/8c056170dc75ffd365b306a0ac7bea4e51d1cdab52a0c84e6ba0a42f7e2f4633?s=96&d=identicon&r=g0a78bc815f2126d9ba65b2af185671f1\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/8c056170dc75ffd365b306a0ac7bea4e51d1cdab52a0c84e6ba0a42f7e2f4633?s=96&d=identicon&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/8c056170dc75ffd365b306a0ac7bea4e51d1cdab52a0c84e6ba0a42f7e2f4633?s=96&d=identicon&r=g\",\"caption\":\"Nubank Editorial\"},\"url\":\"https:\\\/\\\/building.nubank.com\\\/pt-br\\\/author\\\/editorial\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Entendendo Performance no Databricks usando o Spark UI - Building Nubank","description":"Ao lidar com grandes conjuntos de dados, o Spark UI oferece clareza apresentando dados de opera\u00e7\u00e3o de uma maneira compreens\u00edvel.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/","og_locale":"pt_BR","og_type":"article","og_title":"Entendendo Performance no Databricks usando o Spark UI - Building Nubank","og_description":"Ao lidar com grandes conjuntos de dados, o Spark UI oferece clareza apresentando dados de opera\u00e7\u00e3o de uma maneira compreens\u00edvel.","og_url":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/","og_site_name":"Building Nubank","article_published_time":"2024-04-09T21:25:07+00:00","article_modified_time":"2024-04-09T21:41:27+00:00","og_image":[{"width":1024,"height":683,"url":"https:\/\/building.nubank.com\/wp-content\/uploads\/2023\/06\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR-1024x683.jpg","type":"image\/jpeg"}],"author":"Nubank Editorial","twitter_card":"summary_large_image","twitter_misc":{"Escrito por":"Nubank Editorial","Est. tempo de leitura":"4 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/#article","isPartOf":{"@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/"},"author":{"name":"Nubank Editorial","@id":"https:\/\/building.nubank.com\/pt-br\/#\/schema\/person\/462f4f5a8d4ec3ccbc3d661dde00f0a4"},"headline":"Entendendo Performance no Databricks usando o Spark UI","datePublished":"2024-04-09T21:25:07+00:00","dateModified":"2024-04-09T21:41:27+00:00","mainEntityOfPage":{"@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/"},"wordCount":980,"commentCount":0,"image":{"@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/building.nubank.com\/wp-content\/uploads\/2023\/06\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR.jpg?fit=6240%2C4160&ssl=1","keywords":["Analytics Engineering","Databricks","spark"],"articleSection":["Analytics Engineering","Data &amp; Analytics"],"inLanguage":"pt-BR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/","url":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/","name":"Entendendo Performance no Databricks usando o Spark UI - Building Nubank","isPartOf":{"@id":"https:\/\/building.nubank.com\/pt-br\/#website"},"primaryImageOfPage":{"@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/#primaryimage"},"image":{"@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/building.nubank.com\/wp-content\/uploads\/2023\/06\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR.jpg?fit=6240%2C4160&ssl=1","datePublished":"2024-04-09T21:25:07+00:00","dateModified":"2024-04-09T21:41:27+00:00","author":{"@id":"https:\/\/building.nubank.com\/pt-br\/#\/schema\/person\/462f4f5a8d4ec3ccbc3d661dde00f0a4"},"description":"Ao lidar com grandes conjuntos de dados, o Spark UI oferece clareza apresentando dados de opera\u00e7\u00e3o de uma maneira compreens\u00edvel.","breadcrumb":{"@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/"]}]},{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/#primaryimage","url":"https:\/\/i0.wp.com\/building.nubank.com\/wp-content\/uploads\/2023\/06\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR.jpg?fit=6240%2C4160&ssl=1","contentUrl":"https:\/\/i0.wp.com\/building.nubank.com\/wp-content\/uploads\/2023\/06\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR.jpg?fit=6240%2C4160&ssl=1","width":6240,"height":4160},{"@type":"BreadcrumbList","@id":"https:\/\/building.nubank.com\/pt-br\/entendendo-performance-no-databricks-usando-o-spark-ui\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/building.nubank.com\/pt-br\/"},{"@type":"ListItem","position":2,"name":"Entendendo Performance no Databricks usando o Spark UI"}]},{"@type":"WebSite","@id":"https:\/\/building.nubank.com\/pt-br\/#website","url":"https:\/\/building.nubank.com\/pt-br\/","name":"Building Nubank","description":"We make the extraordinary happen","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/building.nubank.com\/pt-br\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"},{"@type":"Person","@id":"https:\/\/building.nubank.com\/pt-br\/#\/schema\/person\/462f4f5a8d4ec3ccbc3d661dde00f0a4","name":"Nubank Editorial","image":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/secure.gravatar.com\/avatar\/8c056170dc75ffd365b306a0ac7bea4e51d1cdab52a0c84e6ba0a42f7e2f4633?s=96&d=identicon&r=g0a78bc815f2126d9ba65b2af185671f1","url":"https:\/\/secure.gravatar.com\/avatar\/8c056170dc75ffd365b306a0ac7bea4e51d1cdab52a0c84e6ba0a42f7e2f4633?s=96&d=identicon&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/8c056170dc75ffd365b306a0ac7bea4e51d1cdab52a0c84e6ba0a42f7e2f4633?s=96&d=identicon&r=g","caption":"Nubank Editorial"},"url":"https:\/\/building.nubank.com\/pt-br\/author\/editorial\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/building.nubank.com\/wp-content\/uploads\/2023\/06\/1120-www.victoriaholguin.com-Victoria-Holguin-_DSF0634-Mejorado-NR.jpg?fit=6240%2C4160&ssl=1","jetpack_shortlink":"https:\/\/wp.me\/pbKBB5-7x2","jetpack_sharing_enabled":true,"authors":[{"term_id":2321,"user_id":178110103,"is_guest":0,"slug":"editorial","display_name":"Nubank Editorial","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/8c056170dc75ffd365b306a0ac7bea4e51d1cdab52a0c84e6ba0a42f7e2f4633?s=96&d=identicon&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/posts\/28956","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/users\/178110103"}],"replies":[{"embeddable":true,"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/comments?post=28956"}],"version-history":[{"count":4,"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/posts\/28956\/revisions"}],"predecessor-version":[{"id":28968,"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/posts\/28956\/revisions\/28968"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/media\/25154"}],"wp:attachment":[{"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/media?parent=28956"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/categories?post=28956"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/tags?post=28956"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/building.nubank.com\/pt-br\/wp-json\/wp\/v2\/ppma_author?post=28956"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}