You can create a lot of dummy data using the faker module.
pip install fake-factory
This time id, 10 digits, 10 words Create a csv file in the format like.
dummy.py
from faker import Factory
import csv
with open("dummy_data.csv", "w+") as f:
csv_writer = csv.writer(f)
fake = Factory.create()
for i in range(10000):
l = [fake.md5(), fake.random_number(10)]
l.extend(fake.words(10))
csv_writer.writerow(l)
When you do this
2109993cebbf9e68b5a74344798c19a3,0,sit,corrupti,eaque,perspiciatis,voluptatum,nihil,quaerat,corporis,asperiores,aut
3728284aa04584cafaaab4118fd77e58,1470,non,qui,vitae,aperiam,ut,est,facilis,perspiciatis,dolores,adipisci
ed599579acda23e99243372106f1f2f8,0,provident,sint,quidem,unde,omnis,perferendis,sint,dolorum,rerum,qui
a117e010335d11c8e88bcd8d359d9429,434500369,enim,atque,earum,nihil,voluptatem,omnis,enim,reiciendis,qui,facilis
b2524affecebe4f67f2dccfca6b6ddf2,6590,commodi,et,maxime,laudantium,eaque,nihil,omnis,perferendis,nesciunt,beatae
aefecf6b23019fbab30947f948b26a18,477210330,doloremque,fugit,est,ut,nobis,sed,aliquam,rem,asperiores,ducimus
834df95fc9e1dff879e3f1d63c870390,36,dolores,at,et,est,id,earum,nulla,ut,autem,ut
fd9a959e399b57749fcaf1b52e0388e0,13,minus,quaerat,tenetur,cumque,rerum,molestiae,repellat,autem,voluptas,repudiandae
f08d779d34eb463d9ee2653fe7f58e59,1746570,perspiciatis,maiores,saepe,porro,quia,iusto,facilis,inventore,repellat,provident
af31877a37fff42e8f624cbe5aa2ae57,5236,odit,neque,voluptatem,facere,corrupti,incidunt,est,et,id,quo
You can get a csv like that Convenient
The dummy data that can be created looks like this
fake.add_provider fake.name
fake.address fake.null_boolean
fake.am_pm fake.numerify
fake.boolean fake.opera
fake.bothify fake.paragraph
fake.bs fake.paragraphs
fake.building_number fake.parse
fake.catch_phrase fake.phone_number
fake.century fake.postcode
fake.chrome fake.prefix
fake.city fake.provider
fake.city_prefix fake.providers
fake.city_suffix fake.pybool
fake.company fake.pydecimal
fake.company_email fake.pydict
fake.company_suffix fake.pyfloat
fake.country fake.pyint
fake.country_code fake.pyiterable
fake.credit_card_expire fake.pylist
fake.credit_card_full fake.pyset
fake.credit_card_number fake.pystr
fake.credit_card_provider fake.pystruct
fake.credit_card_security_code fake.pytuple
fake.date fake.random_digit
fake.date_time fake.random_digit_not_null
fake.date_time_ad fake.random_element
fake.date_time_between fake.random_int
fake.date_time_this_century fake.random_letter
fake.date_time_this_decade fake.random_number
fake.date_time_this_month fake.randomize_nb_elements
fake.date_time_this_year fake.safari
fake.day_of_month fake.safe_email
fake.day_of_week fake.secondary_address
fake.domain_name fake.seed
fake.domain_word fake.sentence
fake.email fake.sentences
fake.firefox fake.sha1
fake.first_name fake.sha256
fake.format fake.slug
fake.free_email fake.state
fake.free_email_domain fake.state_abbr
fake.geo_coordinate fake.street_address
fake.get_formatter fake.street_name
fake.get_providers fake.street_suffix
fake.internet_explorer fake.suffix
fake.ipv4 fake.text
fake.ipv6 fake.time
fake.iso8601 fake.timezone
fake.language_code fake.tld
fake.last_name fake.unix_time
fake.latitude fake.uri
fake.lexify fake.uri_extension
fake.linux_platform_token fake.uri_page
fake.linux_processor fake.uri_path
fake.locale fake.url
fake.longitude fake.user_agent
fake.mac_platform_token fake.user_name
fake.mac_processor fake.windows_platform_token
fake.md5 fake.word
fake.mime_type fake.words
fake.month fake.year
fake.month_name
It covers most of the addresses, names, credit card numbers, dates, etc.
Recommended Posts