COLLECTED BY

Organization: Archive Team

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.

History is littered with hundreds of conflicts over the future of a community, group, location or business that were "resolved" when one of the parties stepped ahead and destroyed what was there. With the original point of contention destroyed, the debates would fall to the wayside. Archive Team believes that by duplicated condemned data, the conversation and debate can continue, as well as the richness and insight gained by keeping the materials. Our projects have ranged in size from a single volunteer downloading the data to a small-but-critical site, to over 100 volunteers stepping forward to acquire terabytes of user-created data to save for future generations.

The main site for Archive Team is at archiveteam.org and contains up to the date information on various projects, manifestos, plans and walkthroughs.

This collection contains the output of many Archive Team projects, both ongoing and completed. Thanks to the generous providing of disk space by the Internet Archive, multi-terabyte datasets can be made available, as well as in use by the Wayback Machine, providing a path back to lost websites and work.

Our collection has grown to the point of having sub-collections for the type of data we acquire. If you are seeking to browse the contents of these collections, the Wayback Machine is the best first stop. Otherwise, you are free to dig into the stacks to see what you may find.

The Archive Team Panic Downloads are full pulldowns of currently extant websites, meant to serve as emergency backups for needed sites that are in danger of closing, or which will be missed dearly if suddenly lost due to hard drive crashes or server failures.

Collection: Archive Team: URLs

TIMESTAMPS

The Wayback Machine - https://web.archive.org/web/20210115154532/https://thenewstack.io/software-engineers-use-spreadsheets-data-engineers-use-the-cloud/

SEARCH (ENTER TO SEE ALL RESULTS)

POPULAR TOPICS

Contributed
News
Analysis
The New Stack Makers
Tutorial
Research
Podcast
Science
Feature
Off-The-Shelf Hacker

Cloud Services / Data / Machine Learning

Software Engineers Use Spreadsheets; Data Engineers Use the Cloud

14 Jan 2021 1:00pm, by Lawrence E Hecht

Building or running data infrastructure is an important part of 55% of 372 data engineers’ jobs, according to the “2020 Kaggle Machine Learning & Data Science Survey.” These data engineers are supporting data science applications as well as other use cases. Data engineers are actually a bit more likely (58%) to be analyzing and understanding data in order to influence decisions as part of their job.

Data scientists focus on analysis, which is not as important for machine learning (ML) engineers. Still, there are many similarities between the 2,421 data scientists and 937 machine learning (ML) engineers in the Kaggle survey, with about the same percentage improving ML models, as well as building/running a ML service to improve a product or service.

At 18%, data engineers are more than twice as likely as data scientists to use cloud-based software and APIs as their primary tool to analyze data. They also exhibited a greater likelihood to analyze data in the cloud. Local development environments like Jupyter Notebooks are most likely to be used by all the job roles we reviewed. Basic statistical software, which is defined as spreadsheets, is very popular among software engineers. This is a reminder that just because they know Python doesn’t mean developers will use data tools for data science.

Feature image via Pixabay.

Sponsored Feed

Check Point Software Partners with Orange Cyber Defense to offer WIFI hacking course to cyber experts

January 15, 2021

Creating Windows Virtual Machines from Existing Images with OpenShift Virtualization

January 15, 2021

Back to the future: Cloud Foundry on Kubernetes

January 15, 2021

Announcing HAProxy Data Plane API 2.2

January 15, 2021

Announcing Terraform Enterprise Active/Active Architecture General Availability

January 14, 2021

Instructor-Led Kubernetes Security Fundamentals Course Now Available

January 14, 2021

Dramatically Improving Support for Role Providers in DataStax Enterprise Unified Authentication

January 14, 2021

The Open Source Story - Open Sourcing RudderStack Blog and Docs

January 14, 2021

How to Get Started with AIOps

January 14, 2021

Modernize data between siloed data warehouses with Infosys Data Mesh and MongoDB

January 14, 2021

Puppet Camp call for papers now open for March and April

January 14, 2021

Harness announces $115M in new financing with a $1.7B valuation

January 14, 2021

InfluxData closes 2020 with exponential cloud growth, expanding user base, and big new customers

January 14, 2021

Code Coverage Reports using Codacy and Codefresh

January 14, 2021

CaaS Services Through AWS, Azure, and Google Cloud

January 14, 2021

VirtOps #2: Managing existing infrastructure with Terraform

January 14, 2021

eBay for Charity Announces a Record-Breaking Year of Community Support

January 14, 2021

The Security Holes That Only DNS Can Plug

January 13, 2021

Benchmarking AWS, Azure, & GCP in the 2021 Cloud Report. Who came out on top?

January 13, 2021

January 13, 2021

Lessons from Hyperscale, Part 1: NVMe as a Service

January 13, 2021

5 Industry Use Cases for Redis Developers

January 13, 2021

Protecting Data In Your Cloud Foundry Applications (A Hands-on Lab Story)

January 13, 2021

G2 users rate Dynatrace number 1 in observability

January 13, 2021

An Introduction to Kubernetes Security using Falco

January 13, 2021

Assessing design quality for better software due diligence

January 13, 2021

Preventing Supply Chain Attacks like SolarWinds

January 13, 2021

Incident Communications With Alina Anderson by Mandi Walls

January 13, 2021

How to Take Feature Releases From Stress to Streamlined

January 12, 2021

Podcast: Break Things on Purpose | Alex Hidalgo, Director of Reliability at Nobl9

January 12, 2021

Announcing LogStream 2.4

January 12, 2021

Want to Strengthen Cybersecurity? MIT Says to Start With AIOps

January 12, 2021

Property Based Testing Confluent Server Storage for Fun and Safety

January 12, 2021

Standardizing Cloud Native Applications: A Conversation with Abby Kearns, Puppet CTO and New Lightbend Board Member

January 12, 2021

Tolly Group confirms Citrix ADC’s performance leadership

January 12, 2021

Secure SSH Access for Supercomputers

January 11, 2021

Sentry Receives SOC 2 Type 2 Certification

January 11, 2021

Aspen Mesh to Sponsor IstioCon 2021

January 11, 2021

Best Practices for Cloud Infrastructure: Zero Trust Microsegmentation

January 11, 2021

Kubernetes Security: Stop Blind SSRF with Policy as Code (CVE-2020-8555)

January 08, 2021

Looking forward to 2021: DevOps predictions from CircleCI CTO, Rob Zuber

December 21, 2020

How HPE Ezmeral is helping organizations conquer today’s data challenges

December 18, 2020

Registration is Open: DevSecOps and Zero Trust Architecture for Multi-Cloud Environments

December 17, 2020

OPA the Easy Way featuring Styra DAS!

December 17, 2020

Upgrading CSPC using Kubera

December 17, 2020

The History, Evolution, and Future of Modern IT

December 17, 2020

Amazon Location – Add Maps and Location Awareness to Your Applications

December 16, 2020

Consuming GraphQL in Plain JavaScript

December 16, 2020

Terraform plan analysis with Checkov and Bridgecrew

December 15, 2020

It’s WSO2 Identity Server’s 13th Anniversary

December 10, 2020

Kubernetes Backup and Restore for MySQL using Kasten K10

December 10, 2020

We built LogDNA Templates so you don’t have to

November 18, 2020

Snyk user community…3,2,1…LAUNCH!

October 26, 2020

We Replaced an SSD with Storage Class Memory. Here is What We Learned.

August 27, 2020

Join us at our new blog home

July 02, 2020