Luca's meaningless thoughts

GC optimization for contiguous pointers to the same page

by Leandro Lucarella on 2009- 04- 01 23:41 (updated on 2009- 04- 01 23:41)
tagged d, dgc, en, gc, optimization, phobos - with 0 comment(s)

This optimization had a patch, written by Vladimir Panteleev, sitting on Bugzilla (issue #1923) for a little more than an year now. It was already included in both Tango (issue #982) and DMD 2.x but DMD 1.x was missing it.

Fortunately is now included in DMD 1.042, released yesterday.

This optimization is best seen when you do word splitting of a big text (as shown in the post that triggered the patch):

import std.file, std.string;
void main() {
    auto txt = cast(string) read("text.txt"); // 6.3 MiB of text
    auto words = txt.split();
}

Now in words we have an array of slices (a contiguous area in memory filled with pointers) about the same size of the original text, as explained by Vladimir.

The GC heap is divided in (4KiB) pages, each page contains cells of a fixed type called bins. There are bin sizes of 16 (B_16) to 4096 (B_PAGE), incrementing in steps of power of 2 (32, 64, etc.). See Understanding the current GC for more details.

For large contiguous objects (like txt in this case) multiple pages are needed, and that pages contains only one bin of size B_PAGEPLUS, indicating that this object is distributed among several pages.

Now, back with the words array, we have a range of about 3 millions interior pointers into the txt contiguous memory (stored in about 1600 pages of bins with size B_PAGEPLUS). So each time the GC needs to mark the heap, it has to follow this 3 millions pointers and find out where is the beginning of that block to see its mark-state (if it's marked or not). Finding the beginning of the block is not that slow, but when you multiply it by 3 millions, it could get a little noticeable. Specially when this is done several times as the dynamic array of words grow and the GC collection is triggered several times, so this is kind of exponential.

The optimization consist in remembering the last page visited if the bin size was B_PAGE or B_PAGEPLUS, so if the current pointer being followed points to the last visited (cached) page, we can skip this lookup (and all the marking indeed, as we know we already visited that page).

2022: Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
years: 2020 2016 2015 2014 2013 2012 2011 2010 2009 2008
subscribe: atom
views: blog list
tags: .net 0.6s 0s 1.055 1/1000s 1/100s 1/1024s 1/10s 1/1250s 1/125s 1/13s 1/15s 1/1600s 1/160s 1/19s 1/2000s 1/200s 1/20s 1/23s 1/250s 1/25s 1/29s 1/30s 1/320s 1/400s 1/40s 1/500s 1/50s 1/5s 1/60s 1/640s 1/800s 1/80s 1/8s 10 songs of 10.0 mm 10.8 mm 11.3 mm 11.5 mm 11.7 mm 12.0 mm 12.2 mm 12.5 mm 13.0 mm 13.3 mm 13.5 mm 13.8 mm 14.7 mm 16.0 mm 16.1 mm 16.3 mm 16.6 mm 17.3 mm 18.0 mm 18.4 mm 19.1 mm 19.5 mm 19.9 mm 1978 1s 2.6.35 20.7 mm 2002 2010 2010-05-05 2010-05-09 2010-05-10 2010-05-26 2010-06-02 2011 2011-02-06 2011-02-07 2011-02-08 2011-02-10 2011-02-12 2011-02-13 2011-02-14 2011-02-15 2011-02-16 2011-02-17 2011-03-15 2011-03-20 2011-03-27 2011-03-31 2011-04-01 2011-04-03 2011-04-05 2011-04-08 2011-04-22 2011-05-04 2011-05-21 2011-05-28 2011-05-29 2011-05-31 2011-06-03 2011-06-14 2011-06-15 2011-06-22 2011-07-23 2011-08-10 2011-08-11 2011-08-13 2011-08-16 2011-08-20 2011-08-25 2011-08-29 2011-08-31 2011-10-12 2011-10-16 2011-10-20 2011-11-22 2011-12-01 2011-12-09 2011-12-18 2012-01-08 2012-01-10 2012-01-15 2012-01-16 2012-02-03 2012-03-28 2012-04-29 2012-05-07 2013-04-29 2013-05-26 2014 21.6 mm 22.4 mm 23.3 mm 23.8 mm 24.2 mm 24.7 mm 25.2 mm 25.7 mm 26.7 mm 27.8 mm 28.1 mm 29.4 mm 3.2s 3/100s 30.0 mm 30.6 mm 31.2 mm 31.5 mm 31.8 mm 33.0 mm 34.3 mm 34.9 mm 35.7 mm 36.4 mm 37.2 mm 38.7 mm 3d 40.3 mm 42.0 mm 42.9 mm 43.9 mm 45.9 mm 46.6 mm 48.0 mm 49.2 mm 5.0 mm 5.2 mm 5.6 mm 50.4 mm 51.7 mm 57.6 mm 6 degrees of black sabbath 6.0 mm 6.6 mm 6.8 mm 60.0 mm 64 bits 7.3 mm 7.4 mm 7.5 mm 7.8 mm 70.0 mm 8.0 mm 8.2 mm 8.5 mm 8.6 mm 8.8 mm 9.1 mm 9.2 mm 9.6 mm 9.8 mm CI DNI Mariano Llinás aballay abandono abort accurate activism activity adam elliot addressbook adhesion adobe adsl advertisement advertisment aeropuerto air guitar alanis morissette alberto laiseca album alemania allocation almost the truth alsa amanecer amd64 andrei alexandrescu android anfiteatro anfiteatro de puerto madero angostura animation annotation aperture-priority ae april fools day arcade fire arch architecture in helsinki argentina arnet arrayanes arroyo arroyo teno art arte artificial asado violento ascensor asm attourney audience auto avión away from keyword azulejos b babasónicos bacap backend backup bad cover version baila balnearios banda della magliana bandcamp bandera barcelona basic basket basketball basquet basura bat bate batimóvil batman battleground bbc bear beastie boys beep belle & sebastian belvedere benchmark benjamin biolay berlin beta bewelcome bicentenario bici bicicleta bicicletería bicycle bicyclederailleurhangers.com bike billy corgan bin binary compatibility binding binutils bios bison bitbucket bitcoin black keys blitiri blog blu bluetooth bobrow bohemian rhapsody bolsón bondage fairies boogie book bosque bpython brasero brasil breaking bad bret mckenzie brian storming buenos aires bugzilla build system building buraco burj dubai burner burocracia bvg c c++ c++0x c++11 caba cache cactus caff california cambio de fecha camino canary sky canon canon digital canon powershot sx120 is canon powershot sx210 is car carlos muñoz cartel cascaya incayal casciari castellano catupecu machu cavaliers cazadores de sueños cbs cc cc matienzo cdgc censorship centro cultural recoleta centros de canje cepillo ceretti cerro belvedere cha cha cha channel 4 chile china chist! chris milk christmas church ciencias naturales cine ciudad emergente clang cleveland cli cloud nothings clouds cloudy club cultural matienzo cobra starship coca-cola cocorosie coima cold cold cut cold war coldcut colectivo collection comancheros comedy comedy central comic como matar al intermediario compare compiler compression concert conclusion concurrent connections conservative consulado italiano consumidor content distribution network copying copyleft copyright coral corneta corto costanera sur couchsurfing coupling cousteau cover coverage cpufreq craig minowa creative commons creativecommons critical mass cronograma cs cuadrícula cuevana curb your enthusiasm curiosidad curiosity currency curses cycles cycling cámara d d.net daniel aráoz daniel lopez dante dante-server danted data deduplication david byrne david simon david walliams daylight debian debug declassified decorey young defensa defensa del consumidor deferred delegate delirious depeche mode deporte derailleur hanger derechoaleer desecho design deutsch deutscheland devel development model dgc dgcbench diarios de bicicleta dicky james dictadura dil discarded disco diy dmc-zs7 dmd dmdfe dnet documental documentary dog doite dolar dom done dopádromo doses download draft drago dragón drama drew pearce druntime duarte dunk dvd dwarf dynamic día 1 día 10 día 11 día 12 día 2 día 3 día 5 día 6 día 7 día 8 día 9 día de la condena errada e-mail e4 eager allocation earthology records ed burns egresado el alberto el choque urbano el cielo de canarias el eternauta el hombre de al lado el hombre sin miedo el hongo el milagro del padre flato el paraíso el rati horror show el teatro elección elephone elevator ellos me quieren llevar ellum en encuentro engineer english eric giler error es espejo chico espoiler espon lx-86 estación eva perón evento evolution exception exit through the gift shop experimental extension f.a.t. f/2.8 f/3.1 f/3.2 f/3.5 f/4.0 f/4.3 f/4.5 f/5.0 f/5.6 f/5.9 f/6.3 facultad facultad de medicina fail falkor fall-through fan fandango faq fast-export fat fatboy slim fcnym feed feel good ghosts fernsehturm festival fibertel fifa fight fight for your right figures file:line film filmus final finalization firefox first orbit fitito fiuba fix flaming lips flare flash flashlight flat volumes flattr flickr flight of the conchords floss fluorescent folclore fontanarrosa food fork foto fotografía fotonovela fotopedia foxyproxy freak folk free culture freedom freedroidrpg freeing frequency fresnel fridge friedman fritzclub fs fsf fuerzabruta fugate fujitsu full auto fun funny futurama fútbol game garage sale gateway gato gato pajero gaucho sónico gay gc gcc gcx gdb gdc geba gecko gema generación dorada gentoo george carlin germany gettext ginobili git gmail gmane gml go go neko! go-neko! god golang gold google google translate gorillaz gradiente graffiti graph grey oceans gripe grml grog xd groove grooveshark groups guitar hero guy ritchie gwene h1n1 hack hacking hard drive hardware harvie krumpet hbo hdd heimathafen helicopter heligoland henderson hernan casciari hielo azul hijo de puta hipster historia history homosexual hostel hot festival how i met your mother howlin' for you howto hp scanjet 4c html html5 huemul hugin humor humour hurricane heart attacks huxley's neue welt identity correction image immix import imán in symmetry in-edit incayal incredible machine indiefolks inglaterra initrd inline inlining innocent project argentina integration intel internet interpreter intro iplan isat island records iso iso100 iso114 iso125 iso160 iso1600 iso200 iso214 iso320 iso3200 iso400 iso4000 iso500 iso640 iso80 iso800 isp issue tracker iterative itv2 jabber james burke jane's addiction japanese jem bendell jemaine clement joichi ito jpg juego karl pilkington keep on running kernel klee knowledge web konex kreuzberg kseniya simonova la la angostura la angostura y espejo chico la plata la rural la trastienda la tribu labels lago lake lamb landscape lang language largo larry david law law and oracle lazy lazy boy lazy freeing ldc legalización lego lemonade len chi lengua les mentettes les mentettes orchestra lessfs let england shake let's go to bed libclang libebook library libreta libro lido ligh limited rc link links linux literatura little britain live llvm llvm developer meeting lnb lonko lorentz los angeles los coholins los justos los reyes paraguayos loudquietloud low light lowe alpine lto luciana tagliapietra lucrecia martel luis scola lumix luna luna park lyrics lübars macri madness madres maemo mafalda magnet club mail makalu make makefile manual map mapa mapuche mar del plata mariposa mark mark-compact mark-region mark-sweep markup marlon brando marriage marvell maría masa crítica massacre massive attack master master and servant matienzo matrix matt lucas mauerpark mauerwerg mazziblog mc. giver mcabber medieval medio ambiente memoria memory memory layout mensaje mercurial merge messi metro metrotel mgmt michael moore michel colombier micropayment migration mini-farthing miniature effect minujin misfits mit mitte mkmutest mobile mochila mompox money monkey island mono montagne montaña montiel @ complejo 25 de mayo (15 de marzo 2011) monty python moon movie moving mozilla mr. orkester mundial museo music mutest muto mutt my kid could paint that n900 naive napoleón puppy nathan native nature nba neukoelln nevada new album news newton faulkner night night scene nikko nntp no heroics no tocar noalcanon noche nokia non-moving northland notification nsa nuclear nude nueva zelanda nuevo hogar numbeo obelisco obvio okupas ombú oms ondemand online opdispatch opensuse optimization orchestra orsai oscar osso otff outside p2p p9000 package painting panasonic pancho pankow panorama paper papers papo parado parental advisory park güell parque centenario parque sarmiento parser generator partial parts pasaporte italiano patch patricio rey y sus redonditos de ricota pattern paul abbott paulo coelho pause time pavement peligrosos gorriones pelusa suero performance perrin perro personal personal information pescador peter callesen peter sunde peter sundes philip sheppard phobos phone photo physics pic pichi traful pierre henry pila pintura pixies piñeyro pj harvey plagiarism plan playa playlist plug computing plugin plural png politics política pool porcentaje mentira portege porto alegre portugués postbahnhof poster powershot precise price primer printer privacy program program ae programming prohibido fumar project project gutenberg promoción property proposition infinity protest proxy psi psyché rock public transport publicidad publicity pulp pulse audio pulseaudio pybugz python quilmes rock quino r830 radiohead rae raleigh rant rc reality reallocation reciclable reciclado reciclar recovery recursive refugio registro no llame release religion reloj relojería reposición repository request rescue research resynthesizer retiro retorno retro review revisited revolución de mayo richard jones ricky gervais rincón del arte river robert sheehan robot robotics rocknrolla rodriguez saa roger & me romanzo criminale rompe pepe rompe ror rsa rsanimate rss rst rsync ruben río azul s06 s6240 sacheen littlefeather sadba safe sand santa satellite scanner schrödinger science scola script sculpture sean kelly sebastian uul secundario securitykiss seinfeld self sen to chihiro sequre sequreisp sergio denis en berlín sergio pángaro serie series service shaman shaman y los hombres en llamas shameless shane carruth share sharing-cli short short film sign simplicity sk8 skate skaters skype slow smashing pumpkins snow sociomantic labs socks software solar system soldiers song sound space spam specs spectrum spell spotify sr. tomate! stand up standard static data statistics status area display blanking applet stephen merchant stereophonics steven moffat stl stop motion stop-motion stop-the-world streaming street string import stuart murdoch studio ghibli subdivx subdivxget subdownloader subliminal subte subtitle subtítulos sudáfrica sun sun airway sur sur 2011 surreal surveillance sweep switch sx210 is syntax error südstern tag tales of mere existence talk tango tapa tapita teardrop teatro 25 de mayo teatro colón vs perl jam technology tecnópolis ted tedx teflon telecentro telecom telemarketing template tero terry gilliam tesis test testdisk the architecture of open source applications the attack of the killer app the beautiful south the black keys the canterbury distribution the cure the d programming language the day we fight back the dø the flaming lips the it crowd the king of the limbs the lady and the reaper the liberty of norton folgate the magnetic zeros the mighty boosh the money mith the new pornographers the pirate bay the rapture the ricky gervais show the shins the suburbs the wilderness downtown the wire the yes men the yes men fix the world theme theremin thievery corporation third album tick tock ticket ticketek ticketportal tilt time lapse time travel todo together tomas lindquist olsen torre de babel torrent toshiba tower tpb tpb afk tracing trailer translate trastienda travel tree tren treses tricky trip-hop trisagio del soltero true story trusted tucan tungsten turn blue turquía tv two dogs dining in a busy restaurant tyc typeinfo título u-bahn uba ubuntu ucep uk ulrich drepper una especie de documental uncooperative environment undelete understanding the current gc unesco unknown (36) unknown (40) unknown (48) unstaged update upgrade upload upstream tracker uriburu 950 us usa vaqueros paganos vaticano ventana vfat vialibre video video youtube vienna villa urquiza vim vimeo vince gilligan virtual vm vocals vodo volume voronoi voz vsevolod volkov vuvuzela walas walter bright wander wildner wapaq war we love life we used to wait weak pointers web webgl website werk9 western digital white trash fast food berlin whiteboard wiki wikileaks wikipedia win window wise witricity work world digital library wtf x86-64 x86_64 xkcd xml yacc yak yikebike youtube youtuve yt yuri gagarin z830 zct zuiikin zx80