AI VILLAGE - Generating Labeled Data From Adversary Simulations with MITRE ATT&CK - TIB AV-Portal

AI VILLAGE - Generating Labeled Data From Adversary Simulations with MITRE ATT&CK

00:00

5

Formal Metadata

Title

AI VILLAGE - Generating Labeled Data From Adversary Simulations with MITRE ATT&CK

Title of Series

Number of Parts

322

Author

License

CC Attribution 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.

Identifiers

10.5446/39911 (DOI)

Publisher

Release Date

Language

Content Metadata

Subject Area

Computer Science

Genre

Conference/Talk

Abstract

Attackers have a seemingly endless arsenal of tools and techniques at their disposal, while defenders must continuously strive to improve detection capabilities across the full spectrum of possible vectors. The MITRE ATT&CK Framework provides a useful collection of attacker tactics and techniques that enables a threat-focused approach to detection. This technical talk will highlight key lessons learned from an internal adversary simulation at a Fortune 100 company that evolved into a series of data science experiments designed to improve threat detection.

DEF CON 2670 / 322

1

17:56

AI VILLAGE - Machine Learning Model Hardening For Fun and Profit

2

28:58

WIRELESS VILLAGE - Learning to Listen: Machine Learning for Adaptive Wireless Adversary Detection

3

55:24

WIRELESS VILLAGE - SirenJack Cracking a Secure Emergency Siren System

4

56:18

WIRELESS VILLAGE - Capture & Analye Like a Bawss

5

45:30

VOTING VILLAGE - Defending Election Security: A National Security Priority

6

55:32

VOTING VILLAGE - State, Local Perspectives on Election Security

7

12:06

VOTING VILLAGE - A Crash Course on Election Security

8

09:58

VOTING VILLAGE - Mechanics and Pitfalls of Auditing with Scanners

9

27:26

VOTING VILLAGE - Trustworthy Elections

10

42:04

VOTING VILLAGE - Recap of Voting Village 2017 Lessons Learned

11

27:18

VOTING VILLAGE - Cybersecurity and U.S. Elections

12

25:17

VOTING VILLAGE - The Return of Insecure Brazilian Voting Machines

13

24:39

VOTING VILLAGE - A Comparative Forensic Analysis of WINVote Voting Machines

14

31:05

SE VILLAGE - Social Engineering From a CISO's Perspective

15

39:22

SE VILLAGE - From Introvert to SE

16

30:50

SE VILLAGE - Would You Like To Play a Game?

17

28:07

SE VILLAGE - My Stripper Name Is Bubbles Sunset

18

50:25

SE VILLAGE - Welcome to 2018

19

50:40

SE VILLAGE - Hunting Predators: SE Style

20

36:01

SE VILLAGE - In-N-Out – That’s What It’s All About

21

50:45

SE VILLAGE - Social Engineering Course Projects for Undergraduate Students

22

15:23

RECON VILLAGE - Introducing YOGA: Your OSINT Graphical Analyzer

23

1:37:17

RECON VILLAGE - Building visualisation platforms for OSINT Data Using open source solutions

24

27:24

RECON VILLAGE - skiptracer: Ghetto OSINT for Broke Hackers

25

38:44

RECON VILLAGE - Core OSINT: Keeping Track of and Reporting All the Things

26

37:51

RECON VILLAGE - PREBELLICO: 100% Passive Pre-Engagement and Posz Compromise Reconnaissance Tool

27

02:23

RECON VILLAGE - Closing Note

28

03:35

RECON VILLAGE - Opening Note

29

21:10

RECON VILLAGE- Winning a SANS 504

30

21:17

RECON VILLAGE -Targeted User Analytics and Human Honeypots

31

27:10

RECON VILLAGE - WhiteRabbit: Combining Threat Intelligence Public Blockchain Data and Machine Learning to go Down the “Dirty Money Rabbit Hole”

32

22:37

RECON VILLAGE -Hackathon Product Showcase

33

11:50

RECON VILLAGE - Hackathon and CTF Prizes

34

21:57

RECON VILLAGE - Supercharge Your Web Recon With Commonspeak and Evolutionary Wordlists

35

43:42

RECON VILLAGE - I fought the law and law lost

36

28:09

RECON VILLAGE - Adventures in the dark web of government data

37

46:20

RECON VILLAGE - Applied OSINT For Politics: Turning Open Data Into News

38

1:07:41

RECON VILLAGE - Emergent Recon - fresh methodology and tools for hackers in 2018

39

18:08

RECON VILLAGE - 1983: I'm born 2018: I'm Cathing the Bad Guys

40

07:57

RECON VILLAGE - Social Mapper

41

13:53

RECON VILLAGE - OpenPiMap: Hacking the hackers with OSINT, Raspberry Pis, and Data Analysis

42

16:32

RECON VILLAGE - How WHOIS Data Uncovered $32 Billion Connected to the Mormon Church

43

28:20

RECON VILLAGE - Hacking the international RFQ Process #killthebuzzwords

44

34:14

RECON VILLAGE - Stalker In A Haystack

45

21:18

RECON VILLAGE - Mapping wifi networks and triggering on interesting traffic patterns

46

37:45

RECON VILLAGE - Bug Bounty Hunting on Steroids

47

36:33

RECON VILLAGE - From Breach to Bust

48

50:13

PACKET HACKING VILLAGE - PacketWhisper: Stealthily Exfiltrating Data and Defeating Attribution via DNS & Text-Based Steganography

49

14:04

PACKET HACKING VILLAGE - Burning the Lookout

50

18:15

PACKET HACKING VILLAGE - Defense in Depth The Path to SGX at Akamai

51

51:58

PACKET HACKING VILLAGE - Mallet A Proxy for Arbitrary Traffic

52

20:56

PACKET HACKING VILLAGE - An Analysis of Cybersecurity Educational Standards

53

51:16

PACKET HACKING VILLAGE - Protecting Crypto Exchanges from a New Wave of Man-in-the-Browser Attacks

54

43:30

PACKET HACKING VILLAGE - Fishing for Phishers. The Enterprise Strikes Back!

55

27:24

IoT VILLAGE Keynote - Tales of a SOHOpeful Journey: Where our Research Started and Where it's Going

56

41:46

PACKET HACKING VILLAGE - Rethinking Role Based Security Education

57

30:55

PACKET HACKING VILLAGE - Car Infotainment Hacking Methodology and Attack Surface Scenarios

58

19:34

PACKET HACKING VILLAGE - Turning Deception Outside-In: Tricking Attackers with OSINT

59

17:46

PACKET HACKING VILLAGE - How to Tune Automation to Avoid False Positives

60

41:28

IoT VILLAGE - Internet of Things: The ultimate key to Rooting the human being

61

42:16

PACKET HACKING VILLAGE - Target-Based Security Model

62

16:34

PACKET HACKING VILLAGE - Bitsquatting: Passive DNS Hijacking

63

24:57

PACKET HACKING VILLAGE - Swiss Cheese Holes in the Foundation of Modern Security - CERT VU#919801

64

40:30

PACKET HACKING VILLAGE - Grand Theft Auto: Digital Key Hacking

65

35:29

PACKET HACKING VILLAGE - Mapping Wi-Fi Networks and Triggering on Interesting Traffic Patterns

66

18:09

PACKET HACKING VILLAGE - Collaborative / Teaching SOC

67

21:41

PACKET HACKING VILLAGE - Normalizing Empire's Traffic to Evade Anomaly-based IDS

68

37:28

IoT VILLAGE - Worms that fight back: Nematodes as an antidote for IoT malware

69

29:22

IoT VILLAGE - The Sound of a Targeted Attack

70

38:25

AI VILLAGE - Generating Labeled Data From Adversary Simulations with MITRE ATT&CK

71

22:20

PACKET HACKING VILLAGE - An OSINT Approach to Third Party Cloud Service Provider Evaluation

72

34:57

IoT VILLAGE - Exploiting the IoT hub: What happened to my home

73

29:13

IoT VILLAGE - FPGA’s: a new attack surface for embedded adversaries

74

30:36

IoT VILLAGE - Your Smart Scale is Leaking More than Your Weight

75

18:17

PACKET HACKING VILLAGE - WPA-SEC: The largest online WPA handshake database

76

39:24

IoT VILLAGE - Attacking Smart Irrigation Systems

77

36:43

IoT VILLAGE - Internet of Laws: Navigating the IoT Hacking Legal Landscape

78

13:11

ICS VILLAGE - A SOC in the Village

79

47:03

IoT VILLAGE - How to modify ARM Cortex M-based firmware: A step-by-step approach for Xiaomi IoT Devices

80

20:23

ICS VILLAGE - How can industrial IioT be protected from the great unwashed masses of IoT devices

81

35:35

ICS VILLAGE - Behavior-Based Defense in ICS: Leveraging Minor Incidents to Protect Against Major Attacks

82

28:39

ICS VILLAGE - Side-Channel Analysis for Protecting Critical Infrastructure

83

41:54

ICS VILLAGE - Analyzing VPNFilter

84

30:00

ICS VILLAGE - TOR for The IOT (TORT Reform)

85

29:48

ICS VILLAGE - A CTF That Teaches: Challenging the Next Generation of ICS Ethical Hackers

86

26:37

HARDWARE HACKING VILLAGE - NFC Payments: The Art of Relay & Replay Attacks

87

19:51

HARDWARE HACKING VILLAGE - Hacking your HackRF

88

27:12

HARDWARE HACKING VILLAGE - Beacons will give you up

89

35:11

HARDWARE HACKING VILLAGE - Building drones, the hard way

90

50:08

HARDWARE HACKING VILLAGE - The Cactus: 6502 Homebrew Computer

91

24:37

HARDWARE HACKING VILLAGE - Disabling Intel ME in Firmware

92

46:09

ETHICS VILLAGE - Responsible Disclosure Panel

93

1:28:25

ETHICS VILLAGE - Nations and Nationalism and Cyber Security

94

1:01:51

ETHICS VILLAGE - Ethics of Technology in Humanitarian and Disaster Response

95

52:48

ETHICS VILLAGE - Ethics for Security Practitioners

96

36:47

ETHICS VILLAGE - Ethical Disclosure and the Reduction of Harm

97

34:52

ETHICS VILLAGE - Accountability without accountability: A censorship measurement case study

98

26:27

DATA DUPLICATION VILLAGE - Owning GlusterFS with GEVAUDAN

99

24:21

DATA DUPLICATION VILLAGE - The Memory Remains: Cold Disk forensics 101

100

39:29

DATA DUPLICATION VILLAGE - A Beginner's Guide to Musical Scales of Cyberwar

101

57:42

DATA DUPLICATION VILLAGE - Facts, figures, fun from managing 100,000 hard drives

102

20:45

CRYPTO AND PRIVACY VILLAGE - CATs [CryptoAuthTokens]: A Tale of Scalable Authentication

103

20:15

CRYPTO AND PRIVACY VILLAGE - Green Locks for You and Me

104

44:28

CRYPTO AND PRIVACY VILLAGE - Implementing a Library for Pairing-based Transform Cryptography

105

35:36

CRYPTO AND PRIVACY VILLAGE - Hiding in plain sight: Disguising HTTPS traffic with domain-fronting

106

1:18:00

CRYPTO AND PRIVACY VILLAGE - Cryptography, Codes, and Secret Writing: An Introduction to Secret Communications

107

12:36

CRYPTO AND PRIVACY VILLAGE - Building a Cryptographic Backdoor in OpenSSL

108

29:11

CRYPTO AND PRIVACY VILLAGE - No Way JOSE! Designing Cryptography Features for Mere Mortals

109

23:56

CRYPTO AND PRIVACY VILLAGE - “Won’t Somebody Think of the Children?” Examining COPPA Compliance at Scale

110

18:19

CRYPTO AND PRIVACY VILLAGE - Jailed by a Google Search: the Surveillance State’s War on Self-induced Abortion

111

34:29

CRYPTO AND PRIVACY VILLAGE - Cicada: What we can learn from the puzzles

112

20:37

CRYPTO AND PRIVACY VILLAGE - Geolocation and Homomorphic Encryption

113

47:13

CRYPTO AND PRIVACY VILLAGE - Prototyping Cryptographic Protocols in Python With Charm

114

45:49

CRYPTO AND PRIVACY VILLAGE - Cloud Encryption: How to not suck at securing your encryption keys

115

19:26

CRYPTO AND PRIVACY VILLAGE - Opportunistic Onions

116

52:34

CRYPTO AND PRIVACY VILLAGE - Hamilton’s Private Key: American Exceptionalism and the Right to Anonymity

117

16:41

CRYPTO AND PRIVACY VILLAGE - Bullies, Sluts, and Best Selves: Fixing digital privacy education

118

47:47

CRYPTO AND PRIVACY VILLAGE - Two Steps to Owning MFA

119

46:45

CRYPTO AND PRIVACY VILLAGE - Integrating post-quantum crypto into real-life applications

120

11:53

CRYPTO AND PRIVACY VILLAGE - The Underhanded Crypto(graphy) Contest: 2018 Winners

121

27:45

CRYPTO AND PRIVACY VILLAGE - Catarineu and Modi - Anonymous rate limiting with Direct Anonymous Attestation

122

52:39

CRYPTO AND PRIVACY VILLAGE - “Probably”: an Irreverent Overview of the GDPR

123

45:47

CRYPTO AND PRIVACY VILLAGE - JARVIS never saw it coming: Hacking machine learning (ML) in speech, text and face recognition – and frankly, everywhere else

124

26:48

CRYPTO AND PRIVACY VILLAGE - Revolutionizing Authentication with Oblivious Cryptography

125

22:49

CAR HACKING VILLAGE - Automotive Evidence Collection – Automotive Driving Aids and Liability

126

34:18

CAR HACKING VILLAGE - Flash Bootloaders Exposing Automotive ECU updates

127

22:47

CAR HACKING VILLAGE - Automotive Exploitation Sandbox: A Hands-on Educational Introduction to Embedded Device Exploitation

128

45:41

CAR HACKING VILLAGE - Grand Theft Auto: Digital Key Hacking

129

12:02

CAR HACKING VILLAGE - So, You Want To Hack A Car?

130

14:16

CAR HACKING VILLAGE - Go Hack Cars

131

1:00:59

CAR HACKING VILLAGE - Meet Salinas, the first ever SMS-commanded Car Infotainment RAT

132

23:40

CAR HACKING VILLAGE - CAN Signal Extraction from OpenXC with Radare2

133

52:19

CAR HACKING VILLAGE - V2X Misbehavior Detection

134

48:39

CAR HACKING VILLAGE - When CAN CANT

135

24:27

CANNABIS VILLAGE - Compliance and Infosec Within the Cannabis Industry

136

35:55

CANNABIS VILLAGE - Cruising the Cannabis Hiiighway: A Series of Major Breaches in Cannabis

137

39:04

CANNABIS VILLAGE - The Federal Cannabis lawsuit and why the controlled Substances Act is going up in smoke

138

49:03

CANNABIS VILLAGE - How to deal with local government (if you must)...

139

47:12

CANNABIS VILLAGE - Hacking Plants to Hack Humans: Using informatics to Breed Cannabis Plants in Order to Hack the Human Metabolome

140

28:35

CANNABIS VILLAGE - Where do cannabinoids come from?

141

47:01

CANNABIS VILLAGE - The Invisible Hands Tending the Secret Greens

142

21:12

CANNABIS VILLAGE - Identifying sick Cannabis with AI

143

48:42

CANNABIS VILLAGE - Weed Hacking: a pragmatic primer for marijuana home grows

144

42:18

CANNABIS VILLAGE - Open Cannabis Project: Cannabis + Open Source

145

42:37

CANNABIS VILLAGE - The Real History of Marijuana Prohibition

146

42:32

CANNABIS VILLAGE - Introduction to Hydroponic systems in commercial Facilities

147

22:57

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - Weapons for Dog Fight：Adapting Malware to Anti-Detection based on GAN

148

11:32

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - High Frequenzy Targeted Attacks

149

17:28

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - Boosting Adversarial Attacks with Momentum

150

27:30

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - Magic Tricks for Self-driving Cars

151

21:43

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - Targeted Adversarial Examples for Black Box Audio Systems

152

37:16

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - How to leverage the open-source information to make an effective adversarial attack/defense against deep learning model

153

24:38

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - Hardware Trojan Attacks on Neural Networks

154

09:52

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - Transferable Adversarial Perturbations

155

18:38

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - Explanation: Alternative Path to Secure Deep Learning System

156

32:39

CAAD VILLAGE - GeekPwn - The Uprising Geekpwn AI/Robotics Cybersecurity Contest U.S. 2018 - Practical adversarial attacks against challenging models environments

157

44:51

BLUE TEAM VILLAGE - Cloud Security Myths: Cutting through he BS-as-a-service

158

20:28

BLUE TEAM VILLAGE - Effective Log and Events Management

159

42:30

BLUE TEAM VILLAGE - Endpoint Monitoring: With Free, and Open Source tools!

160

28:21

BLUE TEAM VILLAGE - Automating DFIR: The Counter Future

161

41:29

BLUE TEAM VILLAGE - How Not to Suck at Vulnerability Management at Scale

162

49:19

BLUE TEAM VILLAGE - Subversion and Espionage Directed Against You (SAEDY)

163

36:01

BLUE TEAM VILLAGE - Hacking Your Dev Job to Save the World: Where Programming and Hacking Meet

164

41:34

BLUE TEAM VILLAGE - SOC of 2020

165

40:35

BLUE TEAM VILLAGE - Stop, Drop and Assess Your SOC: Sonducting and Using Att&ck Assessments

166

18:46

BIO HACKING VILLAGE - Jumping the Epidermal Barrier

167

45:34

BIO HACKING VILLAGE - PWN to Own My Heart

168

51:52

BIO HACKING VILLAGE - Four Thieves Vinegar Collective: Take back the knowledge, Take back the power

169

14:37

BIO HACKING VILLAGE - Biohacking Village Microfluidics Badge

170

22:03

BIO HACKING VILLAGE - Exploiting Immune Defenses - Can Malware Learn from Biological Viruses?

171

34:20

BIO HACKING VILLAGE - Workplace Accommodation for Autistics: Autistic Autobiography and Technology Enabled Prosthetic Environments

172

37:14

BIO HACKING VILLAGE - Hacking Human Fetuses

173

2:21:33

BIO HACKING VILLAGE - DAY ONE

174

33:56

BIO HACKING VILLAGE - Our Evolutionary Path... in 45 Minutes

175

19:20

BIO HACKING VILLAGE - WaterBot The Hackable Plant Control System

176

37:14

BIO HACKING VILLAGE - Selfie or Mugshot? The power of facial recognition technology and the implications for genetic discrimination

177

1:04:41

BCOS Monero Village - Greatest Questions

178

1:02:10

BCOS Monero Village - BCOS Keynote

179

13:30

BCOS Monero Village - Hacking a Crypto Payment Gateway

180

1:07:08

BCOS Monero Village - We Program Our Stinkin Badges

181

23:54

BCOS Monero Village - We Don't Need No Stinkin Badges

182

25:11

BCOS Monero Village - A Rundown of Security Issues in Crypto Wallets

183

32:25

BCOS Monero Village - Ring Signatures MONERO

184

53:37

BCOS Monero Village - Inside Monero

185

1:02:08

BCOS Monero Village - Scaling and Economic Implications of the Adaptive Blocksize in Monero

186

42:23

BCOS Monero Village - Monero's Emerging Applications

187

44:44

BCOS Monero Village - Monero Gameshow

188

22:16

BCOS Monero Village - The Monero Projects Vulnerability Response Process

189

1:44:26

BCOS Monero Village - An Introduction to Kovri

190

39:33

BCOS Monero Village - Welcome Speech

191

26:46

AI VILLAGE - Hunting the Ethereum Smart Contract: Color Inspired Inspection of Potential Attacks

192

22:03

AI VILLAGE - Adversarial Stickers

193

31:30

AI VILLAGE - Towards a framework to quantitatively assess AI safety – challenges, open questions and opportunities.

194

35:13

AI VILLAGE - JMPgate: Accelerating reverse engineering into hyperspace using AI

195

19:56

AI VILLAGE - StuxNNet: Practical Live Memory Attacks on Machine Learning Systems

196

27:27

AI VILLAGE - It's a Beautiful Day in the Malware Neighborhood

197

38:58

AI VILLAGE - Stop and Step Away from the Data: Rapid Anomaly Detection via Ransom Note File Classification

198

13:39

AI VILLAGE - Beyond Adversarial Learning – Security Risks in AI Implementations

199

15:27

AI VILLAGE - DeepPhish: Simulating the Malicious Use of AI

200

16:14

AI VILLAGE - The current state of adversarial machine learning

201

32:40

AI VILLAGE - Detecting Web Attacks with Recurrent Neural Networks

202

25:16

AI VILLAGE - Malware Panel

203

03:14

AI VILLAGE - Opening Remarks

204

00:52

AI VILLAGE - Closing Notes and Prizes

205

13:18

AI VILLAGE - Chatting with Your Programs to Find Vulnerabilities

206

35:34

ICA VILLAGE - Hacking firmware where you least expect it: in your tools

207

36:37

AI VILLAGE - Responsible Offensive Machine Learning

208

35:36

AI VILLAGE - The Great Power of AI Algorithmic Mirrors of Society

209

38:11

IoT VILLAGE - I'm the One Who Doesn't Knock: Unlocking Doors from the Network

210

27:28

AI VILLAGE - Automated Planning for the Automated Red Team

211

23:06

AI VILLAGE - Identifying and correlating anomalies in internet-wide scan traffic to newsworthy security events

212

13:59

AI VILLAGE - AI DevOps: Behind the Scenes of a Global Anti-Virus's Machine Learning Infrastructure

213

39:03

Replay Attacks on Ethereum Smart Contracts

214

48:38

Your Peripheral Has Planted Malware - An Exploit of NXP SOCs

215

18:20

Fasten your seatbelts: We are escaping iOS 11 Sandbox!

216

48:13

Eternal Exploits: Reverse Engineering of FuzzBuncg and MS17-010

217

18:43

Dissecting the Teddy Ruxpin: Reverse Engineering the Smart Bear

218

23:19

LoRa Water Meter Security Analysis

219

21:54

Infecting The Embedded Supply Chain

220

35:24

Attacking the macOS Kernel: Graphics Driver

221

32:00

Privacy Infrastructure: Challenges and Opportunities

222

18:31

Hacking the Brain: Customize Evil Protocol to Pwn an SDN Controller

223

37:32

SMBetray: Backdooring & Breaking Signatures

224

20:56

One-Click to OWA

225

20:45

barcOwned Popping Shells with Your Cereal Box

226

40:11

An Attacker Looks at Docker: Approaching Multi-Container Applications

227

30:17

Hacking BLE Bicycle Locks for Fun and a Small Profit

228

18:30

Dragnet: Your Social Engineering Sidekick

229

40:28

Hacking PLCs and Causing Havoc on Critical Infrastructures

230

45:40

Weaponizing Unicode: Homographs Beyond IDNs

231

44:06

DEF CON GROUPS: Panel Discussion

232

42:59

Welcome To DEF CON & Badge Maker Talk

233

1:31:43

DEF CON 26 Closing Ceremonies

234

1:56:07

Inside the Fake Science Factory

235

49:45

Your Bank's Digital Side Door

236

47:28

Jailbreaking the 3DS Through 7 Years of Hardening

237

18:03

Man-In-The-Disk

238

37:53

More MitM makes Mana mostly mediate mischievous Messages

239

47:13

Wagging the Tail: Covert Passive Surveillance

240

21:22

Playing Malware Injection with Exploit thoughts

241

45:55

sorry for the lame-ass title (all your math are belong to us)

242

43:39

Exploiting Active Directory: Administrator Insecurities

243

46:17

A Journey Into Hexagon: Dissecting Qualcomm Baseband

244

21:34

House of Roman: Using Unsorted Bin Attack to achieve a leakless RCE on PIE Binaries

245

22:26

Asura: A huge PCAP file analyzer for anomaly packets detection using massive multithreading

246

20:32

Finding Xori: Malware Analysis Triage with Automated Disassembly

247

19:45

Reaping and breaking keys at scale: when crypto meets big data

248

33:31

Thin SIM-based Attacks on Mobile Money Systems

249

43:52

NSA Talks: Cybersecurity

250

40:04

Tineola: Taking a Bite Out of Enterprise Blockchain

251

45:09

Who Controls the Controllers? Hacking Crestron IoT Automation Systems

252

45:38

The Mouse is Mightier than the Sword

253

20:06

Fire & Ice: making and breaking mac firewalls

254

39:22

It WISN't me, attacking industrial wireless mesh networks

255

50:14

DEF CON 101 - The Panel

256

44:57

Revolting Radios: Get it? It's a pun!

257

43:16

Breaking Parser Logic! Take Your Path Normalization Off and Pop 0Days Out

258

49:16

Reverse Engineering: Hacking Documentary Series

259

32:35

Relocation Bonus: Attacking the Windows Loader Makes Analysts Switch Careers

260

20:00

One Step Before Game Hackers - Instrumenting Android Emulators

261

35:16

It's assembler, Jim, but not as we know it!

262

17:30

Micro-Renovator: Bringing Processor Firmware up to Code

263

20:00

Compromising online services by cracking voicemail systems

264

35:11

Fuzzing Malware For Fun & Profit. Applying Coverage-Guided Fuzzing to Find Bugs in Modern Malware

265

20:47

Sex Work After SESTA (Stop Enabling Sex Traffickers Act)

266

46:28

I'll See Your Missle and Raise You A MIRV: An Overview of the Genesis scripting engine

267

19:17

Edge Side Include Injection: Abusing Caching Servers into SSRF and Transparent Session Hijacking

268

46:27

Trouble in the Tubes

269

42:13

Booby Trapping Boxes: Running Private Services as a High Value Target

270

51:01

Thru the Eyes of the Attacker: Designing Embedded Systems for ICS

271

39:09

Designing RF Fuzzing Tools to Expose PHY Layer Vulnerabilities

272

36:30

Your Watch Can Watch You! Gear Up for the Broken Privilege Pitfalls in the Samsung Gear Smartwatch

273

45:47

Ridealong Adventures: Critical Issues with Police Body Cameras

274

42:07

IOActive: Breaking WingOS

275

44:11

Vulnerable Out of the Box: An Evaluation of Android Carrier Devices

276

44:13

Re-Targetable Grammar Bases Test Generation

277

19:33

Securing our Nations Election Infrastructure

278

19:50

Digital Leviathan: Nation-State Big Brothers (from huge to little ones)

279

41:09

Automated Discovery of Deserialization Gadget Chains

280

42:55

Breaking Smart Speaker: We are Listening to You

281

24:46

Pwning "the toughest target": the exploit chain of winning the largest bug bounty in the history of ASR program

282

18:35

Reverse Engineering and more using X-Ray

283

42:30

For the Love of Money Finding and Exploiting Vulns in mPOS Systems

284

42:41

Owning the LAN in 2018: Defeating MacSEC and 802.1x-2010

285

42:27

De-anonymizing Programmers from Source Code and Binaries

286

45:45

Defending the 2018 Midterm Elections from Foreign Adversaries

287

21:13

Lost and Found Certificates

288

44:14

What the Fax?! Get Ready for a 1980's Hack Party!

289

21:01

In Soviet Russia Smartcard Hacks You

290

1:04:07

LØpht - Heavy Industries

291

41:57

All your family secrets belong to us - Worrisome Security Issues in Tracker Apps

292

44:54

You're Just Complaining Because You're Guilty: A DEF CON Guide to Adversarial Testing of Software Used in the Criminal Justice Systems

293

41:42

4G - Who is paying your cellular phone bill?

294

45:38

80 to 0 in Under 5 Seconds: Falsifying a Mediacal Patient's Vitals

295

35:27

Your Voice is My Passport

296

37:53

Project "The Interceptor": Owning anti-drone systems with nanodrones

297

46:40

Politics and The Surveillance State.

298

34:59

Outsmarting the Smart City

299

41:39

BLE Sniffing 101

300

21:27

The ring 0 façade: awakening the processor's inner demons

301

46:03

GOD MODE unlocked: Hardware backdoors in x86 CPUs

302

27:13

Building the Hacker Tracker

303

32:37

Last mile authentication problem: Exploiting the missing link in end-to-end secure communication

304

35:01

Rock appround the Clock: Tracking Malware Developers

305

24:23

One bite and all your dreams will come true: Analyzing and Attacking Apple Kernel Drivers

306

37:32

Playback: a TLS 1.3 story

307

46:45

Windows Offender: Reverse Engineering Windows Defender's Antivirus Emulator

308

32:21

Ring 0/-2 Rootkits: Compromising Defenses

309

18:25

Detecting Blue Team Recon With Ads

310

45:57

The Road to Resilience: How Real Hacking Redeems a Damnable Profession

311

57:19

WIRELESS VILLAGE - WEP and WPA Cracking 101

312

47:22

WIRELESS VILLAGE - Blue Sonar

313

36:12

WIRELESS VILLAGE - Can you hear me now (DEFCON)? Wireless communication for pentesters

314

29:55

WIRELESS VILLAGE - Hunting Rogue APs*

315

16:47

WIRELESS VILLAGE - PiClicker v2.0

316

16:16

WIRELESS VILLAGE - BLE CTF

317

20:39

WIRELESS VILLAGE - Attacking Gotenna Networks

318

28:25

WIRELESS VILLAGE - RFNoC: Accelerating The Spectrum with the FPGA

319

54:24

WIRELESS VILLAGE - Little Fluffy Pineapple Clouds

320

58:37

WIRELESS VILLAGE - "It's not Wi-Fi": Reverse engineering and managing radio signals

321

20:06

WIRELESS VILLAGE - Exploring the 802.15.4 attack surface

322

55:41

WIRELESS VILLAGE - Introduction to Railroad Telemetry

Automatic playback

Speech

Text

Image

00:00

BitComputer simulationSet (mathematics)Electric generatorGraphical user interfaceDecimal

00:29

InformationSimulationComputer programmingData analysisWave packetExploratory data analysisBitSlide ruleVirtual machineQuicksortReal numberDependent and independent variablesInformation securityCuboidOpen setDirect numerical simulationBlack boxMultiplication signUniform resource locatorRight angleService (economics)Data analysisWave packetExploratory data analysisMachine learningComputer animationLecture/Conference

04:23

Video gameLevel (video gaming)Phase transitionBitLine (geometry)MultilaterationLimit (category theory)ResultantKnowledge baseDirection (geometry)Endliche ModelltheorieStorage area networkComputing platformCycle (graph theory)Context awarenessCybersexPhase transitionPrime idealDirection (geometry)Software frameworkEndliche ModelltheorieComputing platformCybersexComputer animation

05:52

CausalityLine (geometry)Slide ruleDaylight saving timeCentralizer and normalizerAreaAsynchronous Transfer ModeExterior algebraHand fanPoint (geometry)Communications protocolDirect numerical simulationEndliche ModelltheorieDifferent (Kate Ryan album)Fitness functionTwitterExterior algebraCommunications protocol

07:55

Menu (computing)CuboidFocus (optics)

08:32

Row (database)SimulationPhysical systemSlide ruleSystem callProcess (computing)Meta elementDegree (graph theory)Multiplication signCybersexRight angleFault-tolerant systemWave packetKerr-LösungSoftware framework

10:21

Semiconductor memoryWave packetTheorySampling (statistics)Control flowSource codePlastikkarteDifferent (Kate Ryan album)Tracing (software)

11:16

Computer architectureRandom matrixCausalityCountingComputer architectureUniqueness quantificationMenu (computing)

11:51

Descriptive statisticsType theoryWave packetExploratory data analysisDecision theoryArithmetic meanCore dumpField (computer science)Direct numerical simulationTexture mappingSoftwareData analysisExploratory data analysisInformation security

13:16

Perspective (visual)Information securityLaptopStandard deviationData analysisExploratory data analysis

14:07

Mathematical analysisCodeData typeFeedbackSoftwareSpreadsheetVideoconferencingLink (knot theory)Bridging (networking)Frame problemGreatest elementPreprocessorOpen sourceFigurate numberData analysisExploratory data analysisComputer animation

15:55

SoftwareBitDirect numerical simulationEndliche ModelltheorieHookingMultiplication sign

16:40

AlgorithmMathematical analysisRow (database)InformationSoftwareDescriptive statisticsWave packetSoftware testingFormal verificationCausalityBitLetterpress printingMereologySlide ruleVirtual machineLink (knot theory)Information engineeringField (computer science)Bridging (networking)Social classVirtual realityChemical equationGreatest elementGraph coloringSource codeDirect numerical simulationOpen sourceSelectivity (electronic)Disk read-and-write headEndliche ModelltheorieDifferent (Kate Ryan album)Image resolutionLoginCycle (graph theory)DemosceneMachine learningProcess (computing)Virtual reality

21:44

Mathematical analysisMatrix (mathematics)Point (geometry)Frame problemDirect numerical simulationLoginMultiplication signStructural load

22:38

Type theoryError messageInformation securityPoint (geometry)IP addressFrame problemDirect numerical simulationDifferent (Kate Ryan album)TouchscreenEquals signUniform resource nameComputer animation

24:29

Row (database)Data structureMathematicsFunction (mathematics)PlotterFrequencyProduct (business)Fast Fourier transformCausalityLine (geometry)Lambda calculusResultantTable (information)Term (mathematics)Electronic visual displayQuery languageRevision controlOperator (mathematics)WeightDirectory serviceInformation securityIP addressProgrammschleifeEncryptionData conversionFrame problemDirect numerical simulationView (database)LoginLengthMultiplication signEntropie <Informationstheorie>Uniform resource locatorMessage passingDemosceneComputer-assisted translation

29:11

Row (database)Theory of relativityVariable (mathematics)Dot productTerm (mathematics)Similarity (geometry)Point (geometry)Social classCuboidOpen setChemical equationDirect numerical simulationEndliche ModelltheorieDifferent (Kate Ryan album)LoginLengthMultiplication signUniform resource locatorTracing (software)Message passingRight anglePattern languageComputer animation

32:41

CausalityHistogramVisualization (computer graphics)Scaling (geometry)Query languageFrame problemDirect numerical simulationFlagLengthRule of inferenceMessage passing

34:02

Data structureSoftwareExistential quantificationMoment (mathematics)Slide ruleTable (information)Link (knot theory)Range (statistics)RandomizationBridging (networking)Degree (graph theory)Entropie <Informationstheorie>Right angleTwitter

35:12

Mathematical analysisGene clusterComputer animation

35:55

StatisticsWave packetSoftware testingCausalityPredictabilityFrame problemEndliche ModelltheorieNatural numberWave packetSoftware testingHeegaard splittingPredictabilitySet (mathematics)Endliche ModelltheorieComputer animation

36:41

Performance appraisalCASE <Informatik>Endliche ModelltheorieThresholding (image processing)Right angle1 (number)Performance appraisalResultantEndliche ModelltheorieComputer animation

37:21

Perspective (visual)Endliche ModelltheorieMachine learningProcess (computing)Social classComputer animation

37:50

Information securityIP addressMultiplication signMathematical analysisSoftware bugAddress spaceSource codeLoginComputer animation

Transcript: English(auto-generated)

00:00

For our first speaker today, we have Brian on generating labeled data from adversary simulations with MITRE ATT&CK. Please give it up for him. Thank you very much for coming so early on Sunday morning. That's awesome. I appreciate you guys coming and listening to this for a little

00:22

bit. I wanted to talk to you about a couple of things and just give you a little bit of background on how I see this problem set. So the general premise here is that whatever I'm looking at, whether it's prologues or whatever the problem is that I'm trying to solve,

00:42

I try to recognize the biases that I have. Right. So I looked at this last week, I looked at this last month, that kind of idea. So if I can abstract away some of that bias and have a repeatable methodology, something that's based on math, maybe I can find some insights. And the interesting thing about what I'm talking about today

01:04

is for me, it's not theoretical at all. We have an internal red team that's really proficient. Is anyone here from the red team? We have a red team that sometimes will perform some activities

01:22

based on MITRE ATT&CK and whether that's DNS exfiltration, like we're talking about today or some other technique, they'll hit a canary URL first. So think about white box and overt out in the open versus black box. So if you heard of the threat hunting, the hypothesis,

01:42

you know, I think that there might be DNS exfiltration and therefore you come up with a plan and look for the artifacts. We'll get into that in a minute. But for me, it's not theoretical at all. I know based on what I saw here, that those guys, my friends that I drink beer and bourbon with, they ripped us off, they broke in and they

02:02

stole some stuff on May 18th, 2018. And that was the white box overt time where they hit the canary URL. And I know from patterns, that means that they probably broke in again in a covert black box way. So when we talk about assumed breach, completely not theoretical,

02:22

whether you believe in that philosophy or not, which I do, you know, I know that these guys that bought me a beer the other night, they're probably sitting on some data that they exfilled. So that's kind of the background that we'll get into here with the threat hunting and how this ties in. But here's the quick agenda.

02:40

Do a very quick intro. Believe it or not, the MITRE ATT&CK, it's gonna be real quick. Love MITRE ATT&CK, absolutely do. But I think a lot of CFPs, a lot of cons, a lot of stuff's getting saturated, right? So if anybody wants more information than I'm going to provide in the slides here, please come up. I'll talk about it as long as you want to afterwards,

03:02

but I'm going to trim that down just a little bit because I think everyone's probably heard a lot about it recently. So harvesting labeled training data, I'll get into what I mean by that. EDA, exploratory data analysis, machine learning, work example, and I'll talk about just very candidly,

03:21

some challenges that I've run into and a little bit about future work. So before I do that, I just want to get a quick sense of what the background is in the room. So if we could just start here and we'll go around and tell everybody what you do and name. Okay. Well, how about, could I just say a show of hands? How many people do something like threat hunting?

03:42

How many people do any kind of data analytics outside of Excel? Awesome. And how many people have some sort of a program where you're doing adversary simulation where you've got actual purple team,

04:01

both folks internal. Okay, cool. Thank you very much. That was a terrible idea. I don't know what I was thinking. All right. So real quickly about me, my name is Brian. I'm a threat hunting lead at a fortune 100 financial services company. I also help out with the threat intelligence and now security orchestration automation response or SOAR.

04:25

But the bottom line for me is that there's one prime directive. It's find evil. You know, Rob Lee from SANS talks about no normal find evil. I mean, I, I think about this all day, sometimes all night. I hear about that a little bit from my wife sometimes,

04:40

but she's been very patient with that. It's almost an obsession. So MITRE ATT&CK framework, we're probably mostly familiar with it, but just to level set, it's tactics, techniques, common knowledge, and it's a curated knowledge base and model for cyber adversary behavior

05:04

reflecting the various phases of adversaries life cycle and the platforms they're known to target. So my buddy Zach, the lead red team guy at our place in Milwaukee, we did a talk at a DerbyCon in 2016. It's a small world.

05:20

Anybody see that talk in 2016? So we were just talking about very open kimono, very transparent. Here's what we were trying to do with limited resources and budget and everything else. Cause there's a lot of techniques. Here's what we tried. Here's why we're doing it. And here were the results. So I didn't put the ATT&CK timeline on here,

05:42

but I know that we have pre-ATT&CK now, but at that point, we were just primarily focusing on the later stages of ATT&CK. So that's the context in which I'm talking about some of these techniques. So in particular, and we're talking about exfiltration over alternative protocol.

06:01

You know, I didn't blow this up cause I wanted to fit everything on there. So you don't need to see what's on there. I will, I will make sure that I have my Twitter handle, which is just at Brian Ginns. If anybody wants to watch that, I'll have all these slides up by Tuesday, Tuesday, midnight central daylight time.

06:20

So exfiltration over alternative protocol here, I'm focused on DNS. So as anybody think of any tools that you might use for DNS exfiltration, go ahead and shout it out. Somebody said iodine. Yep. Anybody else? Cobalt strike fans or write your own. So there's a lot of different ways you can do this, right? Um, interestingly,

06:41

I don't want to create an over-fitted model. Does anybody ever see that thing? There was some kind of picture I saw on Twitter from the Bay area and it was talking about over-fitting a model. I can't verify whether this is true or not, but maybe somebody in the back can tell me if you heard about this. Essentially that a lot of the models were trained on roads,

07:01

the Teslas with the self-driving car mode in the Bay area. And then when they were driving on roads that were outside of that area that didn't fit the initial kind of things that was used to, there were five salt lines that were laid down by a salt truck and that that was just messing with that. So, um, if anybody heard about that, cool. If,

07:22

if not, um, it's one example in my mind of how, if I try to detect my buddies that are breaking in and stealing data, cause if I can't detect them, I can't detect somebody else doing it right. If I focus only on what it looks like with cobalt strike,

07:41

that's too narrow, right? You use iodine, maybe I'll catch it, maybe I won't. So the point is that this is one of many techniques, but it's one that we focused on, um, because we had the instrumentation and the telemetry to dig into. So really for me, all minor attack is, it's a true North.

08:02

It's a true North where I and we can sit in front of our CISO and executive leadership and say, certainly there are compliance requirements. There are other things that we have to do and boxes that need to be checked, but let's focus on what the attackers are doing. They don't have to do something off the menu, uh, on the menu,

08:23

off the menu. It's up to them, but let's start with known TTPs and tighten up our monitoring and our detection engineering efforts. So not going to spend a lot of time on this. If you follow this stuff, you're probably familiar with, uh, Katie Nichols. Um,

08:41

and there's a couple of other folks did that, but miters got caldera, red canaries got atomic red team. Uber's got meta and games, got red team automation. There are varying degrees of what they're automating, but it's essentially helping teams. These are open source, helping teams figure out how to do a repeatable processes for adversary

09:05

simulation. And then, um, cyber war dog, uh, Roberto Rodriguez, I think is it a specter ops now and Devin care and game, right? Um, I had had an article that this will be linked to if you want to go to it when I send the slides out. Um,

09:20

basically they were using the API and hooking it in with, uh, um, basically letting you dig into that. Hang on just a minute and get some more vodka. Just kidding. And then that, that, uh, talk that we did is online if you want to look at that. So there I was minding my own business, red team sitting here,

09:42

I'm sitting on the right and red team is lighting things up, probably cobalt strike at the time. And honestly I can see them, they hit enter and they're waiting for something to call back and then starts popping up. I think that they're counting the milliseconds to boom. Okay.

10:00

I got a call back here and then they're kind of looking at me like, are your systems lighting up yet? Why aren't you guys hunting for this? Why aren't you guys looking at a ticket from Splunk or whatever semi use? I look back at it and there were 300 rows that were specifically related to what my buddy Matt and Zach had done.

10:23

Has anybody heard of a low cards exchange principle? So every contact leads a, leaves a trace that the burglar breaks in might break a window, might leave some skin, might leave some hair, some kind of sample that law enforcement could use to trace back in a DNA

10:41

sample. Um, you know, footprints outside of it's money. This is what we're trying to do with MITRE ATT&CK to identify, you know, as Chris Sanders who has the investigation theory course we brought him in, in December to do some training for our folks. And he talks about a triangle, a pyramid he's got of four different kinds of evidence sources and he

11:03

breaks it down like network, host, memory and OSINT. And I want to know what are the digital artifacts that are left when my buddies or somebody else breaks in and steals stuff. And our architecture is not like yours. Um,

11:23

your architecture probably isn't exactly like it was six months ago or a year ago. So I think there's a lot of value in seeing what this looks like. And why, why is everything moving on the screen? Because it's early. I know, I know how this goes. Um,

11:41

it was slightly amusing to do that cause it's parallax and that's, that's better than PowerPoint animation. So I think that counts. I don't think that's against the rules, but something that moves, you know, so how many people have heard of EDA exploratory data analysis? Well, let's start with, um, and again,

12:02

don't worry about trying to read the small texture. What I wanted to show is the entirety of this rectangle, which is from a core lights bro G sheet. And this is all the, uh, DNS log fields and the type and the description of that.

12:20

So in a minute, when we're trying to figure out, how do we represent the knowledge of what we're seeing on the network? How do we represent that and convert it into a feature or think of a column in a spreadsheet, right? How do we convert that into a feature that we can kind of hook into and just in the same way that, you know,

12:41

maybe you're training a child in some experiment, I don't know, to classify a fork versus a spoon. If none of that's labeled, if you don't know what the ground truth is, you're dependent on somebody coming and doing that. Otherwise, all you can do is kind of cluster them based on similarities, right?

13:01

But the first thing we have to do before we make those decisions is understand what AA is, you know, what does that mean in your environment? Um, you know, protocol, proto, and then there's some other stuff we'll get into in a minute. But when I say EDA, I'm talking about starting with that. Uh, Jupyter notebook, used to be IPython notebook.

13:22

Um, this is actually from Clarence's book that I've got here. Shameless plug. But I say that because, uh, this is from chapter two of one of my new favorite books. Anybody else buy Data-Driven Security in 2014, like the day it came out?

13:42

Yeah. Um, uh, this, this is something I've been digging into and it's very helpful and it just reminds me that, you know, there's always something to learn and I always find it extremely valuable to get somebody else's perspective here. So this is actually from the O'Reilly GitHub and it's just an example of,

14:01

you know, we're bringing in, uh, some imports, loading the data, um, and just kind of standard pieces there. Candace data frame just oversimplify if you haven't heard of it before, but, uh, you can, I like to think of it visually as an Excel spreadsheet. I'm probably going to have pitchforks and torches after that comment,

14:22

but I think it's a tape, it's tabular. So you can think of it right now, but it does much, much more. Has anybody heard of, uh, Kitware's bro analysis tools, bat? Um, so there's a guy named Brian is one, he's one of the developers there.

14:40

And I just can't say enough good things about these folks too because I wasn't at Brocon last year. I saw the video that he'd done. And again, there'll be a link in this one at the bottom and I contacted Brian because I was stuck on something on his open source code. I don't like to do that.

15:00

I don't like to ask people to Google stuff for me to figure something out. I just, you know, a few weeks ago I was playing around with something. And I said, you know, I'll figure this out. I'm just not sure how long it's going to take. And I just sent him my question and without getting into the details, it was essentially around why can't I join two of these data frames together? And it was because of something on the backend,

15:22

the way they were doing the pre-processing with bro analysis tools, which they describe as a software bridge. So you can get from bro to pandas and then from pandas to scikit learn, which we'll talk about in a minute. But he appreciated the feedback from somebody kind of in the field, in the trenches saying, this is what I'm trying to do.

15:42

And he explained to me what the work around was because it was a different data type. So just another great example of people, you know, pitching in the open source community. And I mean, I sent the dude an email at like nine or 10 at night and he responded that night by midnight.

16:02

So just it's really encouraging when you're kind of working through something and somebody else helps you out a little bit. So a feature engineering, again, we're trying to figure out what, what are the things that we can use to categorize something? You know, this is, you could talk about height, diameter, top.

16:21

So in the same way, we want to find ways to represent the knowledge to describe what's going on on the network with DNS. And hopefully that's going to allow us to figure out what features we can hook into and then train a model so we can catch my buddies the next time they break in. Okay. Griffin data science,

16:44

virtual environment. This is Charles Givray. He does a class here with Austin Taylor and sometimes with Jay Jacobs from data driven security. Awesome folks. He's got, and yeah, I don't know if this is accurate. I've always thought of it as the Kali Linux for data science.

17:02

So I use this, it's, it's pretty decent. And again, there'll be a link there. So I said we're going to do a machine learning worked example. I pulled,

17:21

I pulled some stuff out of this after listening to some of the other talks, just because I wanted to make sure that I don't cram some kind of crazy algorithm in and, and try to show everything that I'm doing. Cause again, I mentioned I'm doing a little bit of orchestration automation. So I want to kind of have that cycle going where I'm getting an internal IP

17:42

address, you know, going forward and then enriching that with friendly intelligence or okay, here's the IP address, which host grabbed that? Let's assume 24 hour lease from it. Okay. Which host, which internal host name has it? Who's the last logged in user with that, then go collect some other stuff. The more you can find out and the faster you can find out, feed that back in.

18:03

There might be a feature or a column that you can compute or some other insight. I'm going to say reputation score. I know it's a terrible example, but some other verifiable piece of information that you can create another column about and a very fine print in the bottom.

18:25

I'm being very explicit about giving credit to Charles Givry cause I literally lifted this from his slide. I just made a different color. So thank you Charles. There's different descriptions of how this works. I liked this from his training class because it's consumable for me.

18:46

You get and you clean the data, you preprocess, do the feature engineering. Now some of this stuff, this is naive of me. When I thought Bro Logs, I'm like, ah, Bro Logs are pretty structured,

19:00

right? I'm not going to have a lot of this. No, because there's a lot of getting it from where it is to where it needs to go. A lot of the data engineering and the pipeline, that kind of stuff. And believe it or not, ID dot O R I G underscore H. That's the one I'm going to think of as the source IP that initiate that DNS

19:22

request. Just the fact, can anybody think of a problem when you start doing stuff in Python and the label, the field or column name is called dot something. So you're going to throw an error, right? And it's just simple things like that where you'll see that in a few minutes

19:42

here where we have a column that's renamed, not a big deal, but I wouldn't have thought that I'd run into that. It's just a different use than maybe we originally thought of for it. But then with the preprocessing feature engineering Bro analysis tools, again, Kitware describes that as an open source software bridge that's going to kind of

20:04

do some of the behind the scenes heavy lifting. So you can just use it kind of as a gray box and move forward with what you're trying to analyze. And then advanced feature selection. Then we have the data that we're going to split into train and test.

20:21

And then we'll build the model, we'll evaluate the model. And I tried a couple of different things. So I'll show you a couple of differences. But the main thing that popped into my head when we started talking about this is if I have labeled data, if I can use labeled data when we train that model,

20:42

now I can move from unsupervised, which is clustering to supervised. Now we've got, I know from the 300 records that are from the DNS exfiltration from cobalt strike or whatever it is. I know what they did and when they did it on May 18th.

21:01

So I have another column where it's one if it's known malicious and it's zero if it's not. Is that, does that solve everything? No, there's, there's some issues there because what are the, what are the attackers and red team want to do? You want to be stealthy? And the better your trade craft is, the more stealthy you are,

21:22

the more stealthy you are and the quieter you are, the fewer artifacts that I have, which leads to something we call class imbalance and you can correct for that. You can adjust for that. But I kind of wonder sometimes do I want to, do you want to want to make that seem like it's a bigger part of the log

21:43

data than it really is? So I'm importing pandas and then we just have a as PD. I'm importing NumPy, get into the matrix. And then from bat bro analysis tools,

22:04

which he's going to change the name after they change the name at some point. So keep in mind that this will be called something else because bro itself has changed the name of their offering, but import log to data frame. And then a lot of times you'll say DF equals,

22:22

I just put DNS underscore DF equals and I'm calling log to data frame, dot log to data frame path to this is one hours worth of logs, a one hours worth of logs. And then next I see DNS DF dot rename columns.

22:44

So you see what I was talking about the ID dot origin. So if you have something dot ID dot origin, you're going to throw an error. And anything I say, there's two or three ways. You could probably get around that. This is the quickest for me,

23:00

filtered DNS DF. All I'm saying there is the data frame. We call the DNS DF. The data frame is after the equal sign. So we're saying DNS underscore DF, we're referencing that pandas data frame. And then in the square brackets, we're saying ID underscore origin underscore H.

23:21

That's the host that initiated that DNS request that string contains. And I masked this, but there is a large subnet that wasn't relevant to this. And I won't get into that for opposite reasons, of course, but the point is you might segment that through,

23:40

you might go through and convert the IPs to integers. You might, you know, you can do ranges, you can do a lot of different stuff with that. And I think that's actually covered in the data driven security book among other places. And then I did the type filtered just to make sure, you know, after I talked to Brian from kitware,

24:02

so I'm kind of leaving myself some breadcrumbs and going through, I pulled all the comments out of this just to make it easier to have less on the screen. So this is just a nuance, but filter DNS DF dot is copy equals false. It's trying to be helpful if you don't do that and say, Hey,

24:22

you keep slicing these things off and you're trying to do things on a copy. So I had to Google that and it turns out if you just do equals false, then stops throwing those errors. It sounds pretty scientific, right? It worked filter DNS DF query length.

24:42

So here what I'm doing, this is the current version of the data frame or the tabular data structure that we're dealing with. And then I say query length in quotes equals, and then I'm saying add a new column and what I want in that column for each

25:05

row is the data frame and then give me the length of what's in the query. So again, we start talking about malicious URLs. When you start talking about message length and that kind of stuff, this might seem like one of the go-to things.

25:23

However, I took a bunch of the other stuff off cause originally, you know, when I'm trying to do this in a production environment, I want to know for that IP address for that time period or maybe expanded it to longer, like 24 hours. Does that IP address and what does that look like in terms of the con dot

25:43

log? A lot of you I'm sure are familiar with con dot log, but if you're not, I think of con dot log and bro as the closest I'm going to get to a hundred percent net flow, right? So basically just the phone record instead of the phone conversation. So I'm trying to take an entity based view of this,

26:04

a user 360 and a device 360 and essentially understand what behaviors are being exhibited by that host during that timeframe. So, uh, just real quick aside, how many people have heard of a Black Hills information security is Rita?

26:22

Is anybody using Rita? Um, I think I've got a link in there. I'll make sure it is before I send it out. But I've been using that for a while. Basically you pipe in the bro logs to it. You import a bro log for a day and a directory full of bro logs for one 24 hour period. I should be more precise.

26:41

And from there you import it, it creates a Mongo DB collection and then you run analyze and it's going to tell you beaconing. Uh, and John Strand and a couple of guys did a talk at Derby con a couple of places. They're using some kind of crazy math behind the scenes like fast Fourier transform and looking at the signals and what you get,

27:03

I just use the command line, but they've got a AI Hunter product. What you get is basically a table or a CSV and when I cat it out on the command line, what I see is a score on the left. Uh, yeah, we're 99% sure this thing's beaconing. Well, there's other stuff that looks like beaconing, right?

27:21

So I don't ever want to have one view into something. I want to have a more holistic approach and enrich these things by either computing new features, you know, add a column, perform an operation on a different column. And now I know something else about that entity in that record.

27:44

So the next one, 22 in 22, I'm just, you have to put percent matplotlib in line so that you can have the plot actually display in a minute here. And the rest of this is just, uh, in 22 just the formatting for how I want that plot to look.

28:01

There's a lot of cleaner and, um, you know, more sophisticated looking things. I just wanted to have the basics out there. And then in 23, we're just saying import math and we're going to look at entropy. So entropy, you might have something along the lines of, uh, um, base 64, base 32 encoded, uh, or you might have some encryption.

28:24

So either way, I find that to be pretty helpful. So filter DNS, again, we're creating a new column entropy, we're running a Lambda. So in the data frame, you don't have to do four loops shouldn't do four loops. Uh,

28:41

you want to, you know, map or apply or use a Lambda function, and you're hitting it on that series, which is that, that column, right? So essentially very quickly I'm populating the value of this new column entropy with the results of that, uh, mathematical function.

29:01

And now I know two more things about each of these rows. I know the message length and I know the entropy. When I look at the length after I filled it out, that other, uh, those segments I didn't need were at 14,000.

29:22

In 27, do you remember I was talking about that canary URL? It's not actually called canary URL. I did a sophisticated find and replace and I masked it because that's also scientific. So I'm trying to understand length of all of that inside the parentheses, which means just to zoom back out for a minute that my friends on the red

29:44

team internally have 121 records or DNS message requests or messages that were logged during eight to 9 AM. Well, look at that 121 out of 14,000. That's what I'm talking about in terms of the class imbalance.

30:04

So MITRE ATT&CK, here's one of the ways this fits in. I can help do the detection engineering. I can help look for those artifacts. Every contact leaves a trace. I can help, uh,

30:21

that will help me dig in and start dissecting things. And I have one goal in mind. I'm trying to protect this house. I'm trying to find how they did the exfil and then see if there's any similarities that I can come up with. And if that works out mathematically, then maybe I can run that against everything else from 18th until yesterday

30:44

and see what I find. So now the point is I, I keyed in on that canary URL and now I can isolate the traces that they left based on that overt white box attack. And from there, um, this is a little bit early. You know,

31:00

I don't really need to add this column just yet, but that's where it was. And I didn't want to mess around with it. Excuse me. So essentially what I'm doing is filter DNSDF. I'm creating a new column called is malicious. This is my, uh, my label. I'm going to have, essentially it's if it contains this canary URL, um, is malicious is going to have a value of one.

31:24

Notice I did a dot map. It's going to hit everything. The other interesting thing that I saw, has anybody ever seen bro logs with DNS with DNS exfil where once in a while you'll see an API dot encrypted string,

31:40

200 characters long and then a post dot. Anybody have any ideas what that is or yeah, yes sir. No, I was hoping you'd tell me, man. Uh, no. So basically again, this is a pattern. So I don't know this, that's kind of a hail, hail Mary. When I throw that out there, um,

32:01

normally I'm going to look at the, make sure that these variables aren't related. I'm going to make sure that I look at the feature importance. We're not going to get into that right now because of time. Someone hurry up just a little bit. If it has posts in it.

32:21

Now I'm looking for a string, right? So this is the thing about the, the spy versus spy. Anybody in here in the room that sees how I'm doing this, you're going to come up with a different way around it. It's so that's why I have to keep doing this and making sure that the model doesn't degrade and I don't get lazy in the detection here.

32:42

All right. We talked about query length. So I just said for the data frame filter DNS DF and then I wanted to know about the column that has the values that we computed, which is the query length. So we computed a feature, populated the column for each row, and now we have a histogram.

33:04

So it might be kind of hard to see in the back. I didn't blow this up. Or there's some different things you can do on the scale here. And that that visualization didn't look much better, but you see a preponderance. Try to work that one, that word in on a Sunday morning, every possible chance you see a,

33:23

a high number, uh, over, you know, 14,000 it looks like of DNS requests that are what, 25, 30. And then way over there on the right, you see just a few, just a few, I'm guessing like not 121 maybe like 106 or something that are 200.

33:46

Can you write a signature? Can you write a rule that says, Hey, anything that's got a message length and DNS over 40 is malicious and flag it. What's going to happen when you do that? Yeah, it's going to light up, right? Cause there's, there's stuff that looks like that.

34:03

Now we computed two things though. We figured out the entropy entropy. Uh, you know, what's the degree of randomness and in a moment I'll, I'll show what the ranges are for the values, but in the upper right hand corner, that's weird, right?

34:21

So we've got when we look at entropy against query length, something's definitely unique about those. So I'm going to bust through a lot of this short on time. Thank you. I'll get it. Uh, anybody has any questions about this? Again, I'll have the slides up by Tuesday at Brian Gans on Twitter.

34:43

I'll put out the link there. I'm going to push through the rest of this. So I said, here are the columns that I want for features. So now I made a new data frame. I said, I may have a new tabular data structure here, but only give me the data that's in these columns series. And that's my new features underscore DF. Um,

35:04

we imported some other things. And again, the scikit learn, we had a software bridge from bro to pandas to scikit learn and bro analysis tools and bro analysis tools is doing a lot of this transformation for us. Um, again,

35:23

without us doing adversary simulation, I'm stuck with clustering because I don't have any ground truth labels, right? So where's the evil, where's Waldo, where's my buddy, Zach and Matt. Anybody have any idea which one's malicious?

35:41

You shouldn't be able to tell. Um, I mean, you might have some ideas, but this is one issue that I run into with just clustering stuff. So with MITRE ATT&CK, I've got a column that's got one if it's known malicious cause my buddy just

36:02

did it. And there's a zero if I don't know, right. I don't know that it's not. So what I'm doing here is creating, um, another data frame and I'm going to push past that. Essentially I'm going to split that into train and test sets,

36:26

train the classifier model, make predictions. I'm using logistic regression, not maybe the kind you might think about from a stats. And then I'm saying, Hey, predict, you know, how well is this model going to do once we get to the results?

36:44

And in this case it was 99.85% accurate when we look at the model evaluation or model results. But that's not the whole story. Overall it had in the 2775,

37:03

you know, we're okay with the top left and the bottom right. The four on the bottom left, that means there were four malicious ones that I told the model that those are malicious and we missed those. So again, it has to do with your threshold and how it works. So again,

37:23

that's the model we looked at. We need news flash, right? More signal, less noise. And this is, you know, this is just something that I've come across. If anybody has any other perspectives on it, come see me afterwards. I'd be interested to hear your perspective. But I just think that the more stealthy attackers are,

37:43

the fewer footprints, the fewer contacts they're going to leave, which makes it harder for me to hook into something. Future work, I'll just push through that. But like I said, looking at other bro logs, I want to generate some features based on the presence or absence of beaconing. So take the insight that I'm getting out of Rita from Black Hills Information

38:02

Security or offensive countermeasures now, enrich that and then do some other enriching IP addresses. Also very excited to look at some Neo4j and some graphistry stuff as well. So thank you very much for coming out on Sunday morning. I appreciate your time. I'll be in the back and I hope you have a great rest of the conference.