From skaneko ¡÷ a2.mbn.or.jp Wed Feb 6 22:40:12 2013 From: skaneko ¡÷ a2.mbn.or.jp (Seiji Kaneko) Date: Wed, 06 Feb 2013 22:40:12 +0900 Subject: [JF-gofer:10028] [Draft] Linux-3.6 filesystems/xfs-delayed-logging-design Message-ID: <51125D3C.3010607@a2.mbn.or.jp> ¤«¤Í¤³¤Ç¤¹¡£Âêµ¤Î¥É¥é¥Õ¥È¤Ç¤¹¡£¤è¤í¤·¤¯¤ª´ê¤¤¤·¤Þ¤¹¡£ # ¤«¤Ê¤êÆñ¤·¤¤¤Ç¤¹¡£ ---------- >8 ---------- >8 #XFS Delayed Logging Design #-------------------------- XFS ÃÙ±ä¥í¥®¥ó¥°¥Ç¥¶¥¤¥ó ------------------------ #Introduction to Re-logging in XFS #--------------------------------- XFS ¤Ë¤ª¤±¤ëºÆ¥í¥®¥ó¥°µ¡¹½¤Î¾Ò²ð -------------------------------- #XFS logging is a combination of logical and physical logging. Some objects, #such as inodes and dquots, are logged in logical format where the details #logged are made up of the changes to in-core structures rather than on-disk #structures. Other objects - typically buffers - have their physical changes #logged. The reason for these differences is to reduce the amount of log space #required for objects that are frequently logged. Some parts of inodes are more #frequently logged than others, and inodes are typically more frequently logged #than any other object (except maybe the superblock buffer) so keeping the #amount of metadata logged low is of prime importance. XFS ¤Ç¤Î¥í¥®¥ó¥°¤Ï¡¢ÏÀÍý¥í¥®¥ó¥°¤ÈÊªÍý¥í¥®¥ó¥°¤È¤ÎÁÈ¤ß¹ç¤ï¤»¤Ç¤¹¡£inode ¤ä dquot ¤Ê¤É¤Î°ìÉô¤Î¥ª¥Ö¥¸¥§¥¯¥È¤ËÂÐ¤·¤Æ¤ÏÏÀÍý¥Õ¥©¡¼¥Þ¥Ã¥È¤Ç¤Î¥í¥®¥ó¥°¡¢¤¹¤Ê¤ï¤Á¥í¥° ¤¬¼è¤é¤ì¤ë¾ÜºÙ¾ðÊó¤È¤Ï¥Ç¥£¥¹¥¯¾å¤Î¹½Â¤¤ÎÊÑ¹¹¤Ç¤Ï¤Ê¤¯¡¢¥á¥â¥ê¾å¤Î¹½Â¤¤ÎÊÑ¹¹¤Ë¤Ê ¤ê¤Þ¤¹¡£Â¾¤Î¥ª¥Ö¥¸¥§¥¯¥È¡¢Åµ·¿Åª¤Ë¤Ï¥Ð¥Ã¥Õ¥¡¡¢¤Ç¤ÏÊªÍýÅª¤ÊÊÑ¹¹¤Î¥í¥°¤¬¼è¤é¤ì¤Þ ¤¹¡£¤³¤Î¤è¤¦¤Ê°ã¤¤¤¬¤¢¤ë¤Î¤Ï¡¢ÉÑÈË¤Ë¥í¥°¤¬¼è¤é¤ì¤ë¥ª¥Ö¥¸¥§¥¯¥È¤ÇÉ¬Í×¤Ë¤Ê¤ë¥í¥° ÍÆÎÌ¤òÍÞ¤¨¤¿¤¤¤¿¤á¤Ç¤¹¡£°ìÉô¤Î inode ¤ÏÂ¾¤Î inode ¤è¤êÉÑÈË¤Ë¥í¥°¤¬¼è¤é¤ì¤Þ¤¹¤·¡¢ inode ¤ÏÂ¾¤Î¥ª¥Ö¥¸¥§¥¯¥È¤è¤ê (¶²¤é¤¯¥¹¡¼¥Ñ¥Ö¥í¥Ã¥¯¥Ð¥Ã¥Õ¥¡¤ò½ü¤¤¤Æ¤Ï) °ìÈÌÅª¤Ë¤Ï ÉÑÈË¤Ë¥í¥°¤¬¼è¤é¤ì¤Þ¤¹¤Î¤Ç¡¢¥á¥¿¥Ç¡¼¥¿¤Î¥í¥°¤ÎÎÌ¤òÍÞ¤¨¤ë¤³¤È¤Ï¤È¤Æ¤â½ÅÍ×¤Ë¤Ê¤ê ¤Þ¤¹¡£ #The reason that this is such a concern is that XFS allows multiple separate #modifications to a single object to be carried in the log at any given time. #This allows the log to avoid needing to flush each change to disk before #recording a new change to the object. XFS does this via a method called #"re-logging". Conceptually, this is quite simple - all it requires is that any #new change to the object is recorded with a *new copy* of all the existing #changes in the new transaction that is written to the log. ¤³¤ì¤¬¤³¤Î¤è¤¦¤Ë´Ø¿´»ö¹à¤È¤Ê¤Ã¤Æ¤¤¤ëÍýÍ³¤Ï¡¢XFS ¤Ç¤Ï°ì¤Ä¤Î¥ª¥Ö¥¸¥§¥¯¥È¤ËÂÐ¤¹¤ë Ê£¿ô¤ÎÊÌ¡¹¤ÎÊÑ¹¹¤òÇ¤°Õ¤Î»þÅÀ¤Ç¥í¥°¤ËÅÇ¤¯¤³¤È¤òµö¤·¤Æ¤¤¤ë¤«¤é¤Ç¤¹¡£¤³¤ì¤Ë¤è¤ê¡¢ ¥ª¥Ö¥¸¥§¥¯¥È¤ËÂÐ¤¹¤ë¿·¤·¤¤ÊÑ¹¹¤òµÏ¿¤¹¤ëÁ°¤ÎÊÑ¹¹ÆâÍÆ¤Î¥í¥°¤ò¥Ç¥£¥¹¥¯¤Ë½ñ¤¤À¤· ¤ò²óÈò²ÄÇ½¤Ë¤Ê¤ê¤Þ¤¹¡£XFS ¤Ï¤³¤Îµ¡Ç½¤ò ¡ÖºÆ¥í¥®¥ó¥°¡×¤È¸Æ¤Ð¤ì¤ë¼êË¡¤Ç¼Â¸½¤·¤Æ ¤¤¤Þ¤¹¡£³µÇ°Åª¤Ë¤Ï¡¢¤³¤Î¼êË¡¤Ï¤È¤Æ¤â´ÊÃ±¤Ç¤¹¡£¥ª¥Ö¥¸¥§¥¯¥È¤ËÂÐ¤¹¤ëÇ¤°Õ¤Î¿·¤·¤¤ ÊÑ¹¹¤ò¡¢¥í¥°¤Ë½ñ¤¹þ¤Þ¤ì¤ë¿·¤·¤¤¥È¥é¥ó¥¶¥¯¥·¥ç¥ó¤Ë¤¹¤Ç¤Ë¼Â¹ÔºÑ¤ß¤ÎÊÑ¹¹¤Î¡Ö¿·¤· ¤¤¥³¥Ô¡¼¡×¤È°ì½ï¤ËµÏ¿¤¹¤ë¤³¤È¤òÍ×µá¤·¤Æ¤¤¤ë¤À¤±¤Ç¤¹¡£ #That is, if we have a sequence of changes A through to F, and the object was #written to disk after change D, we would see in the log the following series #of transactions, their contents and the log sequence number (LSN) of the #transaction: Îã¤Ç¼¨¤·¤Þ¤·¤ç¤¦¡£A ¤«¤é F ¤Þ¤Ç¤Î°ìÏ¢¤ÎÊÑ¹¹¤¬¤¢¤Ã¤¿¤È¤·¡¢¥ª¥Ö¥¸¥§¥¯¥È¤ÏÊÑ¹¹ D ¤Î¸å¤Ç¥Ç¥£¥¹¥¯¤Ë½ñ¤¹þ¤Þ¤ì¤ë¤â¤Î¤È¤·¤Þ¤¹¡£¤³¤Î¾ì¹ç¡¢¥í¥°¤Ë¤Ï°Ê²¼¤Î°ìÏ¢¤Î¥È¥é¥ó ¥¶¥¯¥·¥ç¥ó (¥È¥é¥ó¥¶¥¯¥·¥ç¥ó¤ÎÊÑ¹¹ÆâÍÆ¤È¥í¥°¥·¡¼¥±¥ó¥¹ÈÖ¹æ (LSN)) ¤¬½ñ¤¹þ¤Þ¤ì ¤ë¤Ç¤·¤ç¤¦¡£ # Transaction Contents LSN # A A X # B A+B X+n # C A+B+C X+n+m # D A+B+C+D X+n+m+o #